Comparison · ai-assistants

D-ID vs Gemini

Side-by-side trajectory, velocity, and editorial themes.

AI-ASSISTANTS

2.5

D-ID's update stream is almost entirely blog content — the real product news is the LiveKit plug-in and V4 Visual Agents.

◆ Current state

What's flowing through the changelog reads more like a content-marketing calendar than a release feed: Sora alternative listicles, G2-rating posts, AI agents comparison pieces. The two genuine product items are the LiveKit plug-in that turns D-ID avatars into real-time visual agents and the earlier V4 Expressive Visual Agents launch positioned for product-grade scale.

◆ Where it's heading

D-ID is positioning at the intersection of real-time agent frameworks (LiveKit) and avatar generation, betting the interactive-avatar category (digital humans you can interrupt and challenge) will eclipse static AI video. The volume of best-of-X listicles suggests an SEO-driven top-of-funnel strategy more than a product-led one — the real momentum signal is the LiveKit integration, not the blog cadence.

◆ Prediction

Expect further real-time-frameworks integrations beyond LiveKit (Daily, Pipecat, or Twilio Voice) and a V5 or feature-named follow-up to V4 Expressive that adds direct emotion-control inputs.

Gemini

AI-ASSISTANTS

10.0

I/O 2026 ships Gemini 3.5, an agentic Gemini app, and Gemini for Science in a single keynote.

◆ Current state

Google's I/O 2026 consolidated the next phase of Gemini into a single news cycle. Gemini 3.5 lands as the new model family combining frontier reasoning with action. The Gemini app becomes proactive and 24/7 in posture. Gemini for Science launches as a vertical scientific-tooling product. Gemini Omni unifies multimodal creation and natural-language editing. Android picks up Gemini Intelligence for proactive on-device features, and a new $100 AI Ultra tier joins the subscription lineup. Content provenance tooling rounds out the safety side.

◆ Where it's heading

Google is no longer positioning Gemini as a model — it is positioning an agentic surface that crosses scientific research, Android, the consumer app, and creative production. The 'action' framing on Gemini 3.5 is the central technical bet; the multi-SKU and vertical product moves stack on top of it. The content-provenance work is the safety counterpart aimed at keeping the deployment story defensible.

◆ Prediction

Expect Gemini 3.5's 'action' capability to be the bar against which Anthropic and OpenAI are compared in the next quarter. More vertical products are likely to follow Gemini for Science (legal, code, finance), alongside deeper Android default-AI integrations that put real pressure on Samsung's and Apple's own assistant stories.

See more alternatives to D-ID →
See more alternatives to Gemini →