CITED
Research12 min readMay 2026

How Do AI Search Engines Decide Which Sources to Cite?

AI tools don't rank websites — they verify entities. Here's the exact process ChatGPT, Perplexity, and Google AI Overviews use to decide who to cite in AI-generated answers.

W

Wahid Ryland

Founder, Cited · Sherman Oaks, CA

AI tools verify entities, not websites

When a user asks ChatGPT or Perplexity to recommend an accounting firm or a GEO agency, the model does not search Google. It cross-references its training data and, where applicable, live retrieval sources against a set of entity verification criteria. The businesses that get cited are the ones that have built clear, consistent authority signals across multiple trusted sources.

Understanding this process is the foundation of effective GEO. The citation decision follows a consistent pattern across platforms — entity verification, authority assessment, query matching, and recency weighting — and each step can be systematically improved.

Entity verification

The first question an AI model asks is whether this entity exists and can be verified. AI tools use Wikidata records, Wikipedia articles, and structured schema markup to resolve entity identity. A business with a clean Wikidata record, accurate schema markup, and consistent name and contact data across the web is significantly easier to verify — and therefore cite.

Businesses without a Wikidata entry exist in ambiguity. The model may know your name from web content, but it cannot verify whether the accounting firm mentioned on Reddit is the same entity as the website it found in training data. Ambiguous entities get cited less. Verified entities get cited more.

Authority assessment

Once an entity is verified, the model assesses authority by cross-referencing how many credible, independent sources mention this entity and in what context. The most-weighted sources are Reddit, LinkedIn, Wikipedia, review platforms including Trustpilot, G2, and Clutch, and editorial publications. As of March 2026, LinkedIn rose to the second most-cited domain overall for professional queries, with citation frequency doubling between November 2025 and February 2026.

Businesses that appear consistently and positively across these sources accumulate authority that AI tools recognise. The pattern is not about volume — it is about consistency and credibility. Ten genuine LinkedIn posts on your area of expertise, verified on Wikidata and mentioned in two editorial publications, outperforms 500 low-quality directory listings.

Query-to-entity matching

Citation is not just about authority — it is about relevance to the specific query. An AI model answering "best employment lawyer in London" needs to match that query to entities verified as employment lawyers operating in London. Businesses with clear, specific category positioning — reinforced through schema knowsAbout declarations, Wikidata property assignments, and consistent third-party descriptions — match queries more reliably.

Vague positioning hurts AI citation rates. A business described as "a full-service professional services firm" matches fewer queries than one consistently described as "a specialist employment law firm for technology companies in London." Specificity in your entity signals directly improves query matching accuracy.

Recency signals

AI tools weight freshness. Content under three months old is three times more likely to appear in AI citations than equivalent older content. Pages with recently updated dateModified schema, fresh content, and recent third-party mentions are cited more frequently than equivalent pages with stale signals.

This means GEO is not a one-time project. A Wikidata entry established in 2024 remains valuable, but the businesses winning category positions in AI citations are the ones continuously publishing fresh content, maintaining active LinkedIn presence, and earning new editorial mentions. Authority compounds. So does recency.

Frequently asked questions

Full Guide
AI Citation Strategy: The Full Guide
Read the guide

See where you stand in AI search.

Free AI visibility audit — 50+ queries across ChatGPT, Perplexity, Gemini, Claude, and Google AI Overviews. Delivered within 48 hours. No call required.