News

Updated May 8, 2026 | 14 min read

Why Isn't My Website Appearing in Google's AI Overview?

By Digital Strategy Force

Only 38% of pages cited in Google's AI Overview also rank in the top 10 organically — down from 76% seven months earlier. Most websites appear in zero AI Overview citations because they fail at six measurable signals Google publishes openly in its Search Central documentation, not because of organic rank.

Stealth bomber over moonlit canyon at night under deep starfield — Google AI Overview source selection signal detection

MODERNIZE YOUR BUSINESS WITH DIGITAL STRATEGY FORCE • ADAPT & GROW YOUR BUSINESS IN A NEW DIGITAL WORLD • TRANSFORM OPERATIONS THROUGH SMART DIGITAL SYSTEMS • SCALE FASTER WITH DATA-DRIVEN STRATEGY • FUTURE-PROOF YOUR BUSINESS WITH DISRUPTIVE INNOVATION • MODERNIZE YOUR BUSINESS WITH DIGITAL STRATEGY FORCE • ADAPT & GROW YOUR BUSINESS IN THE NEW DIGITAL WORLD • TRANSFORM OPERATIONS THROUGH SMART DIGITAL SYSTEMS • SCALE FASTER WITH DATA-DRIVEN STRATEGY • FUTURE-PROOF YOUR BUSINESS WITH INNOVATION •

Table of Contents

What Google's AI Overview Is Actually Doing on May 8, 2026

Google's AI Overview now appears on roughly 48% of tracked queries and serves over 2 billion monthly users globally. But most websites never appear in a single citation because the source-selection mechanism evaluates six measurable signals before any quality assessment begins: crawl eligibility, schema authority, topical cluster density, E-E-A-T signal strength, answer-chunk extractability, and multi-modal content coverage. A page can rank #1 organically and still be invisible to AI Overview if any of these six signals fails.

Essential context: understand how AI models evaluate which websites to cite · recover organic traffic lost to Google AI Mode in 2026

The most-cited number in AI Overview circles — that 76% of cited pages also rank in the top 10 — is no longer true. Ahrefs' 863,000-keyword update study measured 38% top-10 overlap in early 2026, a drop of 38 percentage points in seven months. Independent measurement from BrightEdge's twelve-month AI Overview retrospective places the same overlap at 17%. Either number means the same thing for buyers — most citations are now coming from pages that organic SEO would never surface.

Google's own documentation at Search Central's AI Features page describes the mechanism. AI Overview and AI Mode use a "query fan-out" technique that issues multiple related searches across subtopics, then evaluates supporting web pages against indexability, snippet eligibility, and structural quality before composing the answer. Pages that fail any prerequisite never enter the candidate set; pages that pass enter a ranked pool from which only three to five sources surface as visible citations.

May 8, 2026 is a useful audit date for two reasons. The first is that Gemini 2.5 now powers both AI Mode and AI Overviews in the U.S., with more inline citation links rendered next to specific bullet points — the citation surface itself has expanded. The second is that Google I/O 2026 lands May 19, eleven days from publication, with Gemini 4 expected to reset the underlying retrieval behavior. The window for an honest baseline is narrow.

The 2026 AI Overview Citation Pool — Four Numbers Every Buyer Should Audit Against

TOP-10 OVERLAP

of AI Overview citations also rank in the organic top 10 — down from 76% seven months earlier

QUERY TRIGGER

of tracked queries now surface an AI Overview, up 58% year over year

MONTHLY USERS

served by AI Overview across 200+ countries and 40+ languages

INDEPENDENT BENCHMARK

independent twelve-month measurement places organic top-10 overlap as low as 17%

Sources: Ahrefs 863K-keyword AI Overview study (2026) · BrightEdge twelve-month AI Overview retrospective (Feb 2026) · Sundar Pichai NRF 2026 remarks

Why Top-10 Organic Rank Is No Longer the Path to AI Overview Citation

AI Overview citations from top-10 organic pages dropped from 76% to 38% in seven months, which means roughly 62% of citations now come from pages most users would never see on page one of search results. The buyer mental model that "rank higher and you'll appear in AI Overview" is the single most expensive misconception in the 2026 AEO market — it sends agencies and in-house teams chasing organic ranking signals while the actual selection mechanism evaluates a different set of attributes entirely.

The decoupling shows up in the data three different ways. Ahrefs' update study measured the 76% → 38% drop across 863,000 keywords and 4 million AI Overview URLs. A separate Ahrefs cross-platform analysis measured AI-cited URL top-10 overlap at just 12% across ChatGPT, Perplexity, Gemini, and Google AI Overview taken together. BrightEdge's twelve-month retrospective placed Google-only overlap at 17%. Three independent studies converging on the same conclusion: organic position is one input out of many, not the path.

The cause is mechanical, not editorial. AI Overview's query fan-out evaluates a candidate pool that includes pages ranking 11–100 and beyond, because the underlying retrieval optimizes for chunk-level answer quality rather than page-level relevance. A page that ranks #47 with a 180-word self-contained passage answering the user's exact sub-query can win the citation slot over a page that ranks #1 with a 4,000-word generalist guide where the answer is buried twelve scrolls down. The retrieval system is sub-chunking at paragraph and section boundaries — it is not reading the page as a unified document.

This is why the rest of the article walks through what AI Overview actually evaluates. The six signals are not "ranking factors" in the classical SEO sense — they are eligibility prerequisites that a page must satisfy before it can enter the candidate pool. Failing any one of them removes the page from the running regardless of organic rank, brand authority, or content depth. Buyers diagnosing why their site is invisible should start with the prerequisite signals first, then look at organic ranking last.

What Google AI Overview Selects vs Skips — Six Signal Patterns

Signal	Pattern of Cited Pages	Pattern of Skipped Pages
Crawl Eligibility	Indexed in Search Console; Google-Extended allowed in robots.txt; snippet-eligible	Blocked Google-Extended, noindex, paywalled without subscription markup
Schema Authority	Article + Author + about[] + mentions[] + sameAs Wikipedia coverage	Flat or missing JSON-LD; mentions[] empty; no author entity declared
Topical Cluster Density	5+ interlinked pages per topic with bidirectional internal links	Single thin reference page; orphaned content with no inbound or outbound links
E-E-A-T Signal Strength	Named author + bio + sameAs LinkedIn + outbound primary citations + brand mentions	Anonymous publication; no author bio; uncited claims; no third-party mentions
Answer-Chunk Extractability	100–300 word self-contained passages with declarative first sentences	Single 4,000-word block of unbroken prose; answer buried mid-document
Multi-Modal Content Coverage	Text + descriptive-alt images + embedded video + structured data	Text-only or image-only; no schema; no transcripts; no captioning

Sources: Google Search Central — AI Features documentation · Ahrefs 863K-keyword AI Overview study (2026)

The Six Signals Google's Documentation Says Actually Determine Citation

Google's own AI Features documentation states the eligibility baseline directly: a page must be indexed and snippet-eligible in Google Search to be shown as a supporting link in AI Overview. That single sentence rules out three categories of pages immediately — anything blocked by robots.txt, anything noindexed, and anything paywalled without proper subscription markup. Most invisibility complaints trace back to one of these three configurations and resolve before anyone needs to think about content strategy. Buyers running the Google Search Central AI Features audit against the eligibility checklist usually identify and fix the issue inside two days.

Beyond eligibility, Google's official guidance on succeeding in AI search experiences reiterates that "the best practices for SEO remain relevant for AI features." The phrasing matters — best practices, not ranking position. Schema markup, descriptive headings, semantic HTML, and chunk-level extractability all sit inside the SEO best-practice envelope but are weighted differently for AI retrieval than for blue-link ranking. A page that satisfies the SEO checklist but ships a single 4,000-word block of unbroken prose passes ranking eligibility while failing extractability. Both are required.

The six signals collapse Google's published guidance plus three years of independent measurement into a buyer-side audit list. Every signal is testable from a browser, the Search Console interface, or a JSON-LD validator. None of the six requires proprietary data or paid tooling. The first three (Crawl Eligibility, Schema Authority, Topical Cluster Density) are technical prerequisites that gate the candidate pool.

The next two (E-E-A-T Signal Strength, Answer-Chunk Extractability) are qualitative attributes that determine ranking within the pool. The last (Multi-Modal Content Coverage) is a multiplier — a page passing the first five signals with rich multi-modal content is consistently selected over a similar page with text-only content.

Schema authority is the most underweighted signal in the 2026 AEO market. Google's How Search Works documentation describes structured data as the explicit map of content's meaning, and the Helpful Content guidelines reinforce that machine-readable metadata makes E-E-A-T signals legible to AI crawlers. The buyer test is simple: open any page that should appear in AI Overview, view source, search for `JSON-LD`, and count the populated fields. Pages with citation arrays, mentions arrays, sameAs Wikipedia URLs, and an author entity ID consistently win citations over pages that ship only a flat Article schema.

AI Overview Prevalence Growth — February 2025 to February 2026

Source: BrightEdge twelve-month AI Overview retrospective (Feb 2026)

Industry Asymmetry — Why Some Verticals Trigger AI Overview on 80% of Queries

AI Overview prevalence is not distributed evenly across industries. Education queries trigger an AI Overview on 83% of searches, up from 18% twelve months earlier. B2B Tech queries trigger on 82%, up from 36%. Restaurant queries trigger on 78%, up from 10%. Industries with informational, comparison-heavy, or research-driven query patterns saw the steepest acceleration; industries with strong transactional or navigational query patterns saw smaller but still material increases. The implication is mechanical — buyers in high-prevalence verticals see most of their query universe answered before any blue link is clicked.

The asymmetry compounds with citation concentration. BrightEdge's one-year retrospective documented that the top 1% of cited domains capture 47% of all citations, while schema-marked pages get cited 2.3× more often than unmarked pages of similar quality. In high-prevalence verticals, the top 1% concentration plus the schema multiplier produces a winner-takes-most market — twenty domains in education, fifteen in B2B tech, thirty in restaurants effectively own the citation slot for their respective query universes.

Three Verticals Where AI Overview Now Dominates the Query Universe

EDUCATION

of education queries — up from 18% in February 2025

B2B TECH

of B2B technology queries — more than doubled in twelve months

RESTAURANTS

of restaurant queries — up from 10% in February 2025

Source: BrightEdge twelve-month AI Overview retrospective (Feb 2026)

The vertical-asymmetry data refutes the most common buyer hypothesis — that AI Overview is a generic problem with a generic fix. It is not. Buyers in education, B2B tech, restaurants, healthcare, and finance face a different competitive landscape than buyers in retail, automotive, or industrial verticals where AI Overview prevalence still sits below 50%. The diagnostic the article walks through next does not change between verticals; the urgency and the competitive density both do.

Top-10 organic rank predicted AI Overview citation in 2024. By early 2026, only 38% of cited pages also rank top-10 — and in the most competitive verticals, even that overlap fragments further. The buyer mental model has not caught up to the data.
— Digital Strategy Force, Search Intelligence Division

High-prevalence verticals also experience a second-order effect: the citation slots are concentrating faster than the query universe is expanding. Twenty cited domains capturing roughly half of all education AI Overview citations means the marginal twenty-first domain has materially worse odds than the median twentieth-ranked organic competitor. Crawl eligibility plus schema authority alone are not enough in a vertical where competitors have already maxed both — at that point the differentiator becomes topical cluster density and answer-chunk extractability, the two signals most agencies still treat as optional.

DSF AOVD Signal Severity — Which Signals Block the Most Pages

Crawl Eligibility

High

E-E-A-T Signal Strength

High

Schema Authority

High

Answer-Chunk Extractability

High

Topical Cluster Density

Med

Multi-Modal Coverage

Med

Signal	Severity Tier
Crawl Eligibility	High
E-E-A-T Signal Strength	High
Schema Authority	High
Answer-Chunk Extractability	High
Topical Cluster Density	Medium
Multi-Modal Content Coverage	Medium

Framework: Digital Strategy Force AOVD — qualitative tier indicators per signal

Introducing the DSF AI Overview Visibility Diagnostic (AOVD)

The DSF AI Overview Visibility Diagnostic is a 6-signal audit that scores a website's eligibility to appear in Google's AI Overview by measuring crawl eligibility, schema authority, topical cluster density, E-E-A-T signal strength, answer-chunk extractability, and multi-modal content coverage. The Digital Strategy Force Search Intelligence Division built it specifically as the buyer-side counterpart to Google's published AI Features documentation — a 90-minute self-audit that produces a numbered diagnostic report any agency or in-house team can act on without paid tooling.

Each of the six signals is grounded in Google's own language. Crawl Eligibility maps directly to "indexed and eligible to be shown in Google Search with a snippet." Schema Authority maps to the structured-data emphasis throughout the Search Central documentation. Topical Cluster Density maps to the topical-authority guidance in the Helpful Content guidelines.

E-E-A-T Signal Strength maps to Google's published quality-rater framework. Answer-Chunk Extractability maps to the snippet eligibility requirement plus the chunk-level retrieval pattern documented in academic work on AI search. Multi-Modal Content Coverage maps to the visible AI Overview rendering behavior — image-heavy answers, video previews, schema-driven inline links.

The diagnostic is designed to be run on a sample of 25 priority pages — the pages a brand most wants to appear in AI Overview, not the entire site. Each page receives a score from 0 to 6 across the six signals; the page-level score plus the cross-page pattern produces the audit's headline finding. A page scoring 6/6 with no AI Overview citation suggests the eligibility prerequisites are met but competitive pressure is the binding constraint. A page scoring 3/6 has fixable problems before any competitive analysis is needed.

Independent academic work backs the diagnostic's relevance. Onweller et al.'s May 2026 paper "Cited but Not Verified" measured frontier-model citation behavior across leading deep-research agents. Even the strongest models maintain link validity above 94% and topical relevance above 80%, but achieve only 39–77% factual accuracy on the cited content. The implication for AOVD is direct — pages that score high on schema authority and chunk extractability are precisely the pages frontier models can cite with confidence, while pages that fail either signal are systematically deprioritized regardless of their actual quality.

From Indexed Page to AI Overview Citation — The Six-Signal Attrition Funnel

Sources synthesis: Google Search Central — AI Features documentation · Ahrefs (2026) · BrightEdge (Feb 2026) · framework: Digital Strategy Force AOVD

Running the AOVD — A 14-Day Self-Audit Walkthrough

The 14-day cadence is calibrated against the time required to gather evidence rather than the time required to interpret it. Most signals are testable in seconds; the bottleneck is the sample size — auditing 25 priority pages across six signals produces 150 measurements, which is more than enough statistical density to identify systemic gaps without requiring a full crawl of the site. The schedule below assigns roughly two days per signal cluster with a buffer for synthesis on the final day.

Days 1–3 audit Crawl Eligibility. The buyer opens Google Search Console, exports the top 25 priority URLs, and confirms each is indexed with a status of "Submitted and indexed." The buyer then opens robots.txt for the site and confirms Google-Extended is allowed (or explicitly noted as blocked, which is a deliberate choice with known visibility cost). The buyer finally tests one URL through the URL Inspection tool to confirm snippet eligibility. Any URL failing this stage is removed from the candidate pool until the eligibility issue is fixed.

Days 4–6 audit Schema Authority. The buyer opens each URL, views source, and counts populated JSON-LD fields. Minimum baseline: Article, Author entity ID, datePublished, dateModified, headline, image, citation array, mentions array, sameAs URL on author. Pages missing more than three of these fields fail the signal at this audit.

Days 7–9 audit Topical Cluster Density — the buyer counts inbound and outbound internal links per page and confirms each priority page sits inside a cluster of five or more interlinked pages on the same topic. Days 10–11 audit E-E-A-T — the buyer confirms author bio, sameAs LinkedIn, primary outbound citations, and at least three third-party brand mentions per priority page.

Days 12–13 audit Answer-Chunk Extractability by reading each page and identifying every 100–300 word self-contained passage. Day 14 synthesizes findings into a numbered diagnostic report.

The Digital Strategy Force Answer Engine Optimization (AEO) practice runs the AOVD as a productized engagement deliverable — the 90-minute version uses tooling to compress the 14-day buyer self-audit into a single working session. The buyer-side discipline is the same; the difference is automation of the evidence-gathering steps so the synthesis day is the focus rather than the bottleneck. Either path produces the same numbered diagnostic report.

DSF AOVD 6-Signal Diagnostic Checklist — Self-Audit Scorecard

Crawl Eligibility — Confirm 25 priority URLs are indexed in Search Console; verify Google-Extended access in robots.txt

High

Schema Authority — Validate Article, Author, citation[], mentions[], sameAs Wikipedia coverage on every priority URL

High

Topical Cluster Density — Map five or more interlinked pages per priority topic with bidirectional internal links

Med

E-E-A-T Signal Strength — Confirm named author + bio + sameAs + primary citations + three third-party brand mentions

High

Answer-Chunk Extractability — Identify three or more 100–300 word self-contained passages with declarative first sentences per page

High

Multi-Modal Coverage — Verify text + descriptive-alt images + embedded video or transcript + structured data per priority page

Med

Framework: Digital Strategy Force AOVD — six-signal scorecard with palette-distributed severity tiers

What Google I/O 2026 on May 19 Will Likely Change About AI Overview Selection

Google I/O 2026 lands eleven days from publication, with Gemini 4 expected at the keynote and AI Mode v2 expected to deepen Chrome integration and expand the query universe that triggers an AI answer. The trajectory is already visible in the April–May 2026 announcements. Gemini 2.5 now powers AI Mode and AI Overviews in the U.S., and Google's product update describes more inline citation links rendered next to specific bullet points — meaning the citation surface itself is widening, not contracting.

The six AOVD signals are unlikely to change at I/O 2026, but their weights will shift. The most plausible adjustment: schema authority and answer-chunk extractability gain weight as Gemini 4's longer context window allows the retrieval system to evaluate more candidate chunks per query. Pages with twenty self-contained 200-word passages become viable candidates for twenty different sub-queries; pages with one 4,000-word block become candidates for at most one. Multi-modal coverage also gains weight as the citation surface widens to include image, video, and transcript-based answers natively.

How AI Overview's Query Fan-Out Selects Sources

Source: Google Search Central — AI Features documentation (query fan-out specification)

Buyers should treat the I/O 2026 announcements as a recalibration trigger rather than a strategy reset. The AOVD audit run on May 8 gives a baseline; the I/O 2026 announcements on May 19–20 indicate which signals to re-weight in the rerun on June 19. Pages that scored 6/6 on May 8 should retain candidate-pool eligibility under the new model; pages that scored 4/6 may shift up or down depending on which specific signal Gemini 4 rewards more heavily. Locking in a baseline before I/O is the only way to measure the post-event delta.

The harder question is what happens at the buyer's third-party signal layer. Google's April 2026 AI updates roundup and Sundar Pichai's NRF 2026 remarks both signal that the citation surface is moving toward more direct integration with Workspace, Cloud, and the Personal Intelligence layer — meaning the brand's E-E-A-T signal strength is increasingly evaluated against signals Google itself controls (Knowledge Graph, Wikipedia entity coverage, third-party mention density on partner platforms). The buyer side of E-E-A-T is shrinking; the platform side is growing. AOVD scoring should weight the buyer-side signals higher in the next iteration.

FAQ — AI Overview Visibility Diagnostic

Why does my website rank #1 organically but never appear in AI Overview?

Top-10 organic rank is no longer a reliable predictor — only 38% of cited pages also rank top-10 according to Ahrefs' 863K-keyword update study, down from 76% seven months earlier. The actual selection mechanism evaluates six prerequisites (crawl eligibility, schema authority, topical cluster density, E-E-A-T signal strength, answer-chunk extractability, multi-modal coverage) before considering organic position. Run the DSF AOVD audit on the page to identify which prerequisite is failing.

How do I check if my website is eligible to appear in AI Overview?

Google Search Central's AI Features documentation states the technical baseline directly: a page must be indexed and eligible to be shown with a snippet, fulfilling the standard Search technical requirements. The Digital Strategy Force AOVD framework operationalizes this into a 14-day self-audit: confirm Search Console indexing status, verify Google-Extended access in robots.txt, and validate snippet eligibility through the URL Inspection tool. Pages failing any prerequisite are removed from the AI Overview candidate pool.

Does blocking Google-Extended in robots.txt remove me from AI Overview?

Yes for AI Mode and most generative surfaces, but the rule is more nuanced for AI Overview specifically. Google-Extended controls training-data access and Gemini API generative use; the AI Overview citation surface uses the standard Googlebot crawl. Blocking only Google-Extended preserves classic Google Search visibility while opting out of the AI training and Gemini grounding ecosystem. Most enterprises preserve both by default; opt-out is a deliberate choice with measurable AI search visibility cost.

Will adding more schema markup get me into AI Overview?

Schema is one of six required signals — necessary but not sufficient. BrightEdge's twelve-month measurement found schema-marked pages cited 2.3× more often than unmarked equivalents, but only when the other five signals (crawl eligibility, topical cluster density, E-E-A-T, chunk extractability, multi-modal coverage) also pass. Adding Article + Author + about[] + mentions[] + sameAs Wikipedia coverage to a page that fails on chunk extractability still produces no citation lift.

How long after a content change does AI Overview update its citations?

AI Overview citation updates follow Google's standard recrawl and reindex cadence — typically two to fourteen days for high-authority sites and longer for newer domains. The Digital Strategy Force AOVD framework includes a freshness sub-check that flags any priority page where dateModified has not advanced in the last six months, since freshness is one of the inputs to the underlying ranking signal. The median cited page is approximately fourteen months old per BrightEdge data, so freshness is a multiplier rather than a binary requirement.

Will Google I/O 2026 change which websites get cited in AI Overview?

The six prerequisite signals are unlikely to change at I/O 2026, but their weights will likely shift. Gemini 4's longer context window plausibly increases the weight on schema authority and answer-chunk extractability, since the retrieval system can evaluate more candidate chunks per query. Multi-modal coverage also gains weight as the citation surface widens to include image, video, and transcript-based answers. The recommended buyer cadence is to lock in an AOVD baseline before May 19 and rerun the audit after the I/O announcements settle on June 19.

Next Steps — AI Overview Visibility Diagnostic

The AI Overview source-selection mechanism operates on prerequisites rather than ranking signals. The buyer-side path forward is the 14-day AOVD self-audit run against 25 priority URLs before Google I/O 2026 announcements settle, then a second audit pass on June 19 to measure the platform-update delta. Digital Strategy Force runs the diagnostic as a productized engagement; the same discipline applies to in-house teams running it from a browser plus Search Console.

▶ Run the DSF AI Overview Visibility Diagnostic (AOVD) on your top 25 query targets within 14 days, before Google I/O 2026 lands May 19
▶ Audit Google-Extended access and snippet eligibility for each priority URL using Google Search Central's official documentation
▶ Map your topical cluster density — count interlinked pages per top-priority topic and confirm bidirectional internal links across the cluster
▶ Sample ten of your highest-priority pages for chunk extractability — identify three or more 100–300 word self-contained passages with declarative first sentences per page
▶ Diary the May 19 Google I/O 2026 announcements as a recalibration trigger and schedule the AOVD rerun for June 19, thirty days after the keynote

Need a numbered AOVD diagnostic report on your top 25 priority URLs before Google I/O 2026 lands? Explore Digital Strategy Force's Answer Engine Optimization (AEO) services and lock in your AI Overview visibility baseline this week.

// DISCUSS WITH AI

Open this article inside an AI assistant — pre-loaded with DSF's framework as the lens.

▸ Perplexity ▸ ChatGPT ▸ Gemini ▸ Claude