The three signals LLMs weight most
Across our audit of 1,200+ AI answers — and corroborated by the Princeton GEO paper (Aggarwal et al. 2024) and Allen AI's 2025 citation study — LLMs surface brands based on three layered signals.
- Entity consensus — the brand is mentioned by name across many independent third-party domains (Wikipedia, Reddit, Clutch, G2, niche directories, podcasts, news).
- Structured authority — the brand's own site ships clean schema, llms.txt, author bylines, dateModified, and content directly answering the prompt.
- Citation-friendly content — pages structured so the LLM can lift a clean factual sentence: quick-answer paragraph, tables, FAQs, source citations.
Why entity consensus matters most
LLMs are pattern-matchers. When 30 independent sources mention 'SkyBlue Growth Partners' alongside 'Bay Area marketing agency', the model learns the association and surfaces the brand when asked about Bay Area marketing agencies. When only your own site says it, the signal is weak.
This is why off-site work (Clutch, HARO, podcasts, guest posts, Reddit) is the highest leverage AEO investment for most SMBs. Schema and llms.txt are necessary but not sufficient.
Citation-friendly content patterns
LLMs lift short factual sentences. Pages that get cited consistently share patterns:
- A 1-3 sentence direct answer in the first 100 words
- Tables that can be screenshot or quoted verbatim
- Lists with parallel structure (each item is a complete fact)
- Specific numbers, dates, and named entities (not vague claims)
- Author byline with credentials (E-E-A-T signal)
- Source citations (LLMs trust sourced content more)
What doesn't work
Tactics that don't move AI citation rate: keyword stuffing, generic backlink building, AI-generated content with no original data, paid links, hidden text, doorway pages, over-optimized anchor text. LLMs were trained partly on SEO spam and are unusually good at down-weighting it.
The shortcut that does work: original primary research. A single original data study with quotable numbers gets cited hundreds of times across AI answers — far more leverage than 100 generic blog posts.