LiveKit Agents vs Google AI
Side-by-side trajectory, velocity, and editorial themes.
LiveKit Agents added Answering Machine Detection — voice agents are becoming a serious telephony runtime.
LiveKit Agents is releasing roughly twice a week along the 1.5.x line, accumulating telephony-grade primitives around its voice loop. The headline is Answering Machine Detection in 1.5.9 — an LLM-classified detector for what kind of endpoint an outbound call hit. Surrounding work is split between reliability (barge-in cooldown, interruption guards, preemptive-generation tuning, observability retries) and provider breadth (Perplexity Responses, Soniox, Speechmatics, Cerebras, xAI, Rime WebSocket TTS). The mcp_servers parameter was also deprecated on Agent and AgentSession.
The product is converging on a real contact-center runtime, not just a realtime meeting agent. AMD, warm transfer, DTMF handling, recording retries, and avatar join/playback metrics are the feature surface phone deployments demand. The provider plugin universe keeps widening; LiveKit positions itself as the neutral broker between voice models and the actual network. Internal cleanups (mcp_servers deprecation, instruction parts, AvatarSession base class) suggest a tidying pass before a 1.6 cut.
Expect more telephony primitives — supervisor barge-in, richer DTMF flows, call-recording controls — and a unified MCP configuration surface across Agent and Session as the mcp_servers deprecation lands fully.
I/O 2026: Gemini 3.5 lands as Google bets the stack on agentic action.
Google's consumer AI surface is mid-I/O 2026 announcement burst. Gemini 3.5 ships as a frontier model framed around 'action' rather than chat, alongside a $100/mo AI Ultra tier, expanded AI Mode in Search, voice-native Workspace tools, and a Beam group-meeting experiment. The framing across launches is consistent: Gemini is no longer positioned as a model but as an agent embedded in Search, Workspace, and devices.
The product line is converging on two arcs: agentic action and tiered monetization. Capability releases (Gemini 3.5, agentic Workspace, AI Mode) are arriving in lockstep with a steeper subscription ladder, and Search is being openly reframed away from the keyword model. Side bets like Beam and community investments suggest Google is willing to fund longer-horizon hardware and brand work while the model layer carries the revenue narrative.
Expect Gemini 3.5 'action' capabilities to surface in Workspace agents and Android over the next two quarters, with AI Ultra positioned as the gating tier. Watch for a developer-facing agent runtime to follow the consumer rollout.
See more alternatives to LiveKit Agents →
See more alternatives to Google AI →