Google I/O 2026: Gemini 2.0 Ultra and the Search Cannibalisation Bet

Written byRhys Donnelly

PublishedJun 2, 2026

UpdatedJun 4, 2026

10 min read

Google I/O 2026: Gemini 2.0 Ultra, Android 16, and the Search Reinvention That Puts Google’s Core Business at Risk

Google I/O 2026, held in mid-May at the Shoreline Amphitheatre, was simultaneously Google’s most impressive technical showcase in years and the clearest public statement yet of the company’s central tension: how to deploy the AI capabilities that could make Google Search obsolete without making Google Search obsolete.

The announcements — Gemini 2.0 Ultra, AI Overviews’ expansion, Android 16’s deep Gemini integration, Project Astra’s progress toward persistent multimodal AI, and the continued evolution of NotebookLM — were technically impressive across the board. But the strategic subtext beneath each announcement was the same: Google is trying to turn the threat of AI-disrupted search into a durable advantage before someone else does it to them.

Gemini 2.0 Ultra: The Benchmark Leader That Matters Less Than It Should

Gemini 2.0 Ultra debuted at I/O 2026 — building on the agentic shift previewed earlier in the I/O keynote — with benchmark scores that establish it as the leading publicly-available foundation model on several major evaluations. On the MMLU Pro reasoning benchmark, Gemini 2.0 Ultra scores 91.4 — above GPT-4.5’s 89.7 and Claude 3.7 Opus’s 90.1. On coding benchmarks including HumanEval and LiveCodeBench, Gemini 2.0 Ultra similarly leads the pack — and the Flash tier compression that has played out earlier in 2026 means the price-performance advantage extends down the model stack. On multimodal benchmarks, it holds a more commanding lead: Google’s investment in video and audio understanding, built on top of its YouTube training data advantage, produces measurable capability improvements on video comprehension tasks that text-focused models cannot match.

The benchmark victory is genuine. The commercial implication is more complicated.

Enterprise buyers increasingly understand that benchmark scores predict model capability on well-defined tasks but do not fully predict real-world deployment reliability, instruction-following consistency, or safety behaviour. The enterprise sales cycle for foundation model access runs through procurement teams that prioritise vendor stability, compliance documentation, and integration support over benchmark rankings. In this environment, being the benchmark leader is a marketing advantage, not a decisive commercial one.

Google’s distribution through Google Cloud’s Vertex AI platform is its more durable competitive advantage. Gemini 2.0 Ultra access through Vertex AI means enterprise buyers already on GCP — Google’s estimated 30% share of enterprise cloud deployments — can add Gemini access to their existing vendor relationship without new procurement processes. For Google, the benchmark win matters primarily as permission to be in the evaluation shortlist; the distribution advantage is what converts evaluations to contracts.

AI Overviews and the Search Revenue Question

The most consequential and most carefully managed announcement at I/O 2026 was the expansion of AI Overviews — Google’s AI-generated search summaries that appear above organic results. AI Overviews now trigger for approximately 40% of Google Search queries in the US, up from 25% at launch and the 10% in the experimental phase. The expansion includes more categories: shopping queries, local business queries, and multi-step research queries now routinely receive AI Overview summaries.

The commercial tension is explicit: when an AI Overview answers a user’s question directly in the search results page, that user has less reason to click through to a website. Fewer click-throughs mean fewer opportunities for Google’s cost-per-click advertising to generate revenue. AI Overviews that are monetised with ads embedded in the summary itself produce lower CPMs than traditional search ads (because the user is reading rather than actively seeking to transact). The revenue-per-query economics of AI-augmented search are structurally lower than the revenue-per-query economics of traditional search.

Google’s response to this tension has been to move fast and shape the market before anyone else can. If AI search summaries are inevitable — which Google’s own data suggests, given user satisfaction scores for AI Overview results — then it is better for Google to cannibalise its own click-through revenue than to allow a competitor to capture the AI search market and cannibalise Google’s entire revenue base.

The bet is that AI-augmented search, despite lower per-query revenue, increases total query volume and total user time in the Google ecosystem sufficiently to offset the per-query revenue decline. Early data from Google’s advertising team supports this: average queries per user per day increased approximately 18% in markets where AI Overviews have been fully deployed for more than six months. If per-query revenue falls 20% but queries grow 18%, the net revenue impact is manageable — and if query growth continues to compound while per-query revenue stabilises, the long-term economics improve.

Android 16: Gemini Everywhere

Android 16, previewed at I/O 2026 for release to Pixel devices in Q3 2026, ships with Gemini as the system-level AI — replacing Google Assistant throughout the operating system. The integration is materially deeper than previous Gemini rollouts: Gemini has access to all on-screen content, the device’s notification history, calendar, contacts, Gmail, and Google Photos, enabling the contextual awareness that Apple Intelligence’s Siri has been attempting to achieve.

The Android 16 Gemini integration is significant for two reasons beyond user experience. First, the scale: approximately 3 billion active Android devices will eventually run Gemini-integrated Android, giving Google a training signal and product feedback loop that no competitor can match. Second, the data advantage compounds over time — Gemini learning from billions of Android interactions (with appropriate privacy controls) builds a behavioural model of how people actually use AI-augmented mobile operating systems that will improve Gemini’s on-device performance in ways that are structurally difficult to replicate.

The competitive comparison to Apple Intelligence is inevitable and instructive. Apple’s on-device AI runs 3-7B parameter models; Google’s Pixel-native Gemini Nano (the on-device component) has been expanded to larger model sizes with the A19-class Tensor chip in Pixel 10. The on-device vs cloud-dependent architecture debate continues, but Android 16’s approach — a hybrid that runs common tasks on-device and escalates complex tasks to cloud Gemini — is more pragmatic than Apple’s privacy-first on-device purist position.

Project Astra: The Persistent Multimodal Assistant

Project Astra, Google DeepMind’s research project for a persistent, multimodal AI assistant, showed its most advanced capabilities at I/O 2026. The demonstration showed an AI that maintains persistent memory across conversations (remembering context from sessions days earlier), understands video in real time through a phone camera, and can navigate complex multi-step tasks by combining visual understanding, web access, and long-form reasoning.

Astra is not a shipping product — the full vision remains a research demonstration. But the components are real and progressively being deployed: Gemini Live (real-time voice conversation), camera-based contextual awareness in the Gemini app, and memory features that persist across conversation sessions. The I/O 2026 demonstration showed these components operating more fluidly than in any previous public demo, suggesting the gap between research vision and shipping product has narrowed.

The strategic importance of Project Astra is not its current state but what it signals about Google’s capability roadmap. If Astra’s full vision ships — a persistent AI that knows your history, understands your environment in real time, and can act autonomously on your behalf — it represents a shift from search as query-and-response to search as continuous ambient intelligence. Google’s position at the centre of that paradigm is more defensible than its position in a world of competing AI chatbots, because the data infrastructure required to make Astra work at scale is something only Google (with its combination of search history, Maps data, YouTube engagement history, and Android device penetration) can credibly build.

NotebookLM and the Knowledge Work Tool

NotebookLM — Google’s AI-powered research and note-taking tool — received substantial updates at I/O 2026 that move it from a consumer productivity tool toward enterprise knowledge management. The enterprise tier, introduced in GA at I/O, allows organisations to deploy NotebookLM on top of internal document repositories, enabling employees to query institutional knowledge the same way they would query a curated research corpus.

NotebookLM’s audio overview feature — which generates a conversational podcast-style summary of a document or research topic — has been particularly successful with enterprise learners who absorb information better through audio than text. The feature is technically trivial (text-to-speech over a structured summary) but commercially clever: it creates a usage pattern that is highly sticky and differentiates NotebookLM from generic AI summarisation tools.

The enterprise NotebookLM play is a direct challenge to Microsoft’s Copilot positioning in knowledge management. Both products do similar things — surface relevant organisational knowledge in response to natural language queries. Google’s advantage is the quality of its foundation model for information synthesis; Microsoft’s advantage is integration depth within the Microsoft 365 data graph. The competition will be decided in enterprise IT evaluation cycles over the next 12-18 months, with data sovereignty configuration and existing vendor relationships the primary decision criteria.

What I/O 2026 Reveals About Google’s Strategic Position

Google enters mid-2026 in a stronger AI position than the conventional narrative — which spent 2023-2024 focused on OpenAI’s lead and Google’s alleged fumbling — suggested. Gemini 2.0 Ultra’s benchmark leadership, Android 16’s deep integration, and the measured expansion of AI Overviews reflect a company that has caught up technically and is executing a coherent commercial strategy.

The existential risk that preoccupied Google’s leadership from early 2023 — that AI search alternatives would erode the advertising revenue base before Google could adapt — has not materialised at scale. Perplexity, you.com, and other AI search alternatives have not taken measurable market share from Google Search. The 40% AI Overviews penetration is Google’s own cannibalisation of its click-through revenue, but it is happening on Google’s terms, at Google’s pace, with Google’s advertising infrastructure capturing most of the value.

The medium-term risk is not displacement but margin compression. A world where AI Overviews handle 70-80% of queries with embedded, lower-CPM ads is a structurally less profitable search business than the pre-AI baseline. Google’s response — growing query volume through better user experience and expanding beyond search into Assistant, Cloud, Workspace, and device AI — is the right playbook. Whether the revenue diversification happens fast enough to offset the core search margin compression is the question that Google’s financial results over the next three years will answer.

I/O 2026 showed a company that knows what game it is playing. Whether it wins that game is a different question.

The Second-Order Case for Cannibalising Search

ShaneParrish’s framework: first-order thinking sees the obvious outcome. Second-order thinking asks what happens after that.

The first-order reading of Google’s AI Overviews strategy is that it eats its own search ad business. AI Overviews answer questions without making users click through to publisher sites. Fewer click-throughs means lower ad impression volume on publisher sites, which means lower Google ad revenue over time. The evidence for this reading is in the traffic data: multiple studies published in the twelve months after AI Overviews launched showed click-through rates on informational queries declining by 15 to 35 percent on search results pages where an AI Overview appeared.

The second-order reading is different. Google’s ad revenue doesn’t come primarily from informational queries. It comes from transactional and commercial queries. The user who asks Google “what is inflation” is not the user Google monetises at premium CPM. The user who asks “best credit card for travel points” or “buy MacBook Air M4” is. AI Overviews are concentrated in the informational query space because that’s where LLMs perform most reliably. Commercial-intent queries remain click-heavy because the user is making a purchase decision, and a summary paragraph doesn’t substitute for price comparison.

The deeper second-order question is what happens if Google doesn’t build AI Overviews. If Google concedes the informational query layer to ChatGPT’s Browse mode or Perplexity, it concedes the attention entry point for users who start their online sessions with a question. Those users don’t stay on Google for the follow-up commercial query — they stay where they are. AI Overviews are Google’s effort to ensure that the answer to every question, even questions that don’t generate ad revenue today, is something Google shows the user. That positions Google for the monetisation of those queries when the format evolves.

The Gemini 2.0 Ultra benchmark performance claim from I/O 2026 matters less than it appears and more than the stock price movement suggests. It matters less because benchmark leadership in AI has a half-life measured in months. It matters more because enterprise AI procurement decisions are being made right now, and procurement teams use benchmark data as a decision shortcut. A company that can demonstrate its model leads on the benchmarks that procurement teams are using has a meaningful short-term conversion advantage over a company whose model is comparable but harder to evaluate — and Google is competing for enterprise AI infrastructure spend at a moment when that spend is being locked in for multi-year horizons.

ShaneParrish would frame the central question this way: not whether AI Overviews hurt today’s ad revenue, but what Google’s competitive position looks like in 2028 if it had chosen not to build them. That counterfactual answer is worse than any traffic decline the Overviews have produced so far. The cost of inaction in platform competition is rarely visible until it’s irreversible. That’s the lesson from every search disruption cycle that preceded this one.

Rhys Donnelly

Rhys Donnelly studied electrical engineering at Trinity College Dublin before pivoting to journalism. He has visited semiconductor fabs in Taiwan, South Korea, and TSMC’s Arizona facility. Based in San Francisco, he covers the full stack from process node economics to platform strategy, with particular focus on where the AI infrastructure buildout creates genuine constraints versus vendor narratives.

Latest Posts

Zoe Kessler

Adobe Firefly Crossed 12 Billion AI Generations and the Enterprise Creative Market Has Shifted

AI·6 min read·Updated Jun 23, 2026

Victor Hale

Solana DEX Volume Surpassed Ethereum Mainnet in Q2 2026 and Jupiter Drove the Shift

Crypto·6 min read·Updated Jun 23, 2026

Jamie Rowe

Netflix’s Reality TV Bet Is Driving Subscriber Growth and the Unscripted Format Has Matured

Streaming·6 min read·Updated Jun 23, 2026