AI Evening Update: Anthropic Caught in Pentagon Storm, OpenAI Builds Internal AI Workforce, and Vibe Coding Grows Up

As the day winds down, your AI Evening Update covers the stories that defined Tuesday in artificial intelligence. From Anthropic’s extraordinary entanglement with the US military to a fascinating look at how OpenAI is using AI to transform its own operations — tonight’s AI News Today recap is one you will not want to miss.

Anthropic AI Used in Iran Strikes Hours After Trump Banned It

In one of the most dramatic ironies of the week, US forces carried out major air strikes against Iran using Anthropic’s Claude AI for intelligence assessments and target identification — just hours after President Trump signed an order banning federal agencies from using Anthropic’s technology. The Wall Street Journal reported that the planning for Saturday’s military operation was already underway when Trump’s order was issued, and the operational reliance on Claude meant an immediate cessation was impossible.

Trump subsequently softened the ban to a six-month phaseout, citing operational realities. The episode has intensified the debate around AI’s role in military operations, with protesters gathering outside OpenAI’s San Francisco offices and former Trump advisor Dean Ball calling the original designation of Anthropic as a “supply chain risk” tantamount to “attempted corporate murder.” The story is far from over — it sits at the intersection of AI capability, national security, and corporate politics in ways that will reverberate for months.

Advertisement

OpenAI’s Two-Engineer Data Agent Now Serves Thousands

OpenAI shared a compelling internal success story: a data agent built by just two engineers is now serving thousands of employees, dramatically accelerating internal analytics workflows. Finance analysts who previously spent hours hunting through 70,000 datasets and writing complex SQL queries can now type a plain-English question into Slack and receive a finished, accurate chart in minutes.

The company says the system is replicable and is sharing details publicly to help other organisations build similar tools. It is a vivid demonstration that the most valuable AI deployments are often not the flashiest — they are the ones quietly solving real operational bottlenecks inside large organisations.

Alibaba’s Qwen3.5-9B: A Laptop-Sized Model That Beats a 120B Giant

Alibaba’s Qwen team made waves earlier this week with the release of Qwen3.5-9B, a small open-source model that reportedly outperforms OpenAI’s gpt-oss-120B on key benchmarks. The model is compact enough to run on standard consumer laptops, representing a significant step in the democratisation of high-performance AI. Whether it runs as a 0.8B model on a smartphone or a 9B model powering a coding terminal, the Qwen3.5 series is designed with the “agentic era” in mind — responsive, efficient, and deployable anywhere.

Vibe Coding Gets a Sober Assessment

As AI-assisted coding becomes mainstream, practitioners are beginning to push back on uncritical enthusiasm. Writing in VentureBeat, Doug Snyder explored the lessons learned from treating Google AI Studio as a full coding teammate. The key takeaway: generative AI is a powerful collaborator for prototyping and exploration, but production systems still demand human oversight, determinism, and rigorous testing. The “vibe coding” trend is maturing — which is exactly what the industry needs as AI-generated code quality comes under greater scrutiny.

That wraps up today’s AI Evening Update. The themes of the day — military AI ethics, internal AI transformation, open-source model efficiency, and the limits of autonomous coding — paint a picture of an industry grappling seriously with the implications of its own rapid progress. Stay tuned to AI Stack Digest for tomorrow’s AI News Today.

This article was produced with the assistance of AI tools and reviewed by the AIStackDigest editorial team.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top