Sonnet 5 propagates across every distribution surface the week it ships, while assistants pry open their model layer.

The week in ai-assistants

The defining pattern this week was distribution speed, not raw capability. Anthropic's Sonnet 5 shipped June 30, and within seventy-two hours it was generally available inside GitHub Copilot, live on AWS Bedrock, and typed into the Anthropic TypeScript SDK as a first-class model id. A frontier model no longer lands at one endpoint; it lands across the entire assistant supply chain in the same news cycle. For operators, the takeaway is that the model itself is increasingly commoditized plumbing, and the contested surface has moved to who controls the picker, the entitlements, and the routing in front of it.

That is the second, quieter move: assistants are opening their model layer. Copilot added Kimi K2.7 Code as the first open-weight option in a picker that until now held only proprietary frontier models. OpenHands cloud shipped an Agent Client Protocol picker with bring-your-own-key gating and multi-model discovery. OpenRouter pushed aggregation into agent tooling with an MCP server. Model choice is being treated as a managed fleet — added, gated, and deprecated (Copilot flagged Gemini 2.5 Pro and Gemini 3 Flash for retirement the same week). The differentiation is migrating from the weights to the governance and the retrieval primitives wrapped around them.

Leaders

GitHub Copilot had the densest real week: Kimi K2.7 Code as the first selectable open-weight model, Copilot vision GA (images and PDFs as first-class prompt inputs), and Sonnet 5 promoted to a default-eligible option. The picker is now an actively managed fleet, with two Gemini tiers queued for deprecation alongside the additions.

Claude drove the week's upstream signal with Sonnet 5, pitched as its most agentic mid-tier model with gains over 4.6 in reasoning, tool use, and coding. Paired with Claude Tag pushing task delegation into Slack, the run shows both a fast frontier cadence and an agentic surface expanding into where teams already work.

Gemini widened its efficiency tiers with Nano Banana 2 Lite for fast, cheap image generation and Omni Flash for video and conversational editing, plus computer use landing in 3.5 Flash. Google is competing on cost-per-call and multimodal breadth rather than a single frontier headline.

Firecrawl shipped its Research Index, a curated index over 3M+ arXiv papers and their code claiming 53.3% recall on arXivQA against 45.4% for the next provider. It is Firecrawl's first specialized vertical index rather than a general scraper — a move to own a retrieval surface, not just the pipe to it.

Exa introduced Exa Agent, a web-research agent exposed through its API. It marks the pivot from returning search results to running the research itself, with MCP distribution positioning Exa as a retrieval backend inside other agent stacks.

Wildcards

Qodo published a rare architectural reversal: in Qodo 2.4 it tore out most of the state-of-the-art RAG code-review system it spent a year building, arguing the smarter move is to index almost nothing and remember the right context instead. A public teardown of a core product architecture is unusual and worth watching as a signal on where retrieval for code is heading.

Sourcegraph moved Agentic Batch Changes to public beta — an agent that scopes, edits, and drives PRs to mergeable across hundreds of repositories. It reframes the search index from a place to find code into an execution layer that ships migrations autonomously.

Themes that compounded

Sonnet 5 propagated to Copilot, AWS Bedrock, and the Anthropic SDK within days of release, compressing the model-to-distribution lag to near zero.
The model picker became a managed fleet: open-weight additions (Kimi K2.7), BYOK gating (OpenHands), and scheduled deprecations (Gemini tiers in Copilot).
Retrieval primitives climbed the stack — Firecrawl's vertical index, Exa Agent, and Sourcegraph's autonomous migrations all move from fetching to acting.
Agentic delegation kept spreading into existing workspaces (Claude Tag in Slack, Dosu's Libraries and Agents, OpenHands sub-agent visualizer).
MCP recurred as the distribution channel of choice, showing up in OpenRouter, Exa, and OpenHands releases as the way to embed inside third-party agents.

Watch this week

The near-zero lag between Sonnet 5's release and its appearance across Copilot, AWS, and the SDK sets the pattern to watch: the next frontier model will be judged as much on how fast it saturates distribution surfaces as on its benchmarks. Expect continued churn in the selectable-model lists as providers are added and retired, and watch whether the open-weight opening (Kimi K2.7 in Copilot, BYOK in OpenHands) widens into a real second track alongside the proprietary defaults. On the retrieval side, Qodo's public RAG teardown and Firecrawl's vertical index are early data points on a shift from broad indexing toward selective, curated context — a thread worth tracking across code and agent products in the coming weeks.