Research Synthesis

What the Data Actually Shows

Living research pages synthesized from patents, API leaks, court testimony, and large-scale citation studies. Each page is updated as new evidence emerges. Not opinions, not predictions. The accumulated weight of data.

How AI Search Decides What to Cite, and What It Ignores

Data-driven synthesis of AI citation mechanics across ChatGPT, Perplexity, Google AI Mode, and Gemini. What gets cited, why traditional SEO metrics barely matter, and what the pipeline actually looks like.

15+ studies across Ahrefs, Semrush, DEJAN AI, Profound, AirOps, and others

Read synthesis

The Reality Gap: What Google Says vs. What Google Measures

A clinical comparison of official search engine guidance against what their internal documentation, API leaks, and legal discoveries prove they actually measure. Quote vs. module vs. testimony.

Google API leak (14,014 attributes), DOJ antitrust testimony, spokesperson statements

Read synthesis

The Zero-Click Paradox: Fewer Clicks, Higher Conversion

Traffic volume is collapsing but traffic value is concentrating. Why the traditional CTR curve is dead and why the clicks that survive the AI filter convert at 5x the historical rate.

SparkToro, Semrush, Ahrefs, Seer Interactive, Superprompt, Pew Research Center

Read synthesis

Content Engineering: Building the Agent-Shaped Web

The shift from writing SEO content to building extractable data structures. How to architect pages for LLM ingestion, query fan-out, and agentic retrieval.

DEJAN AI, iPullRank, Semrush, Wellows, Growth Marshal, Seer Interactive, Surfer SEO

Read synthesis

Topical Authority: From SEO Folklore to Confirmed Signal

The 2024 API leak confirmed topical authority as a real, multi-signal system: siteFocusScore, siteRadius, topic embeddings, NsrChunks, and ClusterUplift. What the system measures, how it affects AI citation, and how to act on it.

Google API leak (14,014 attributes), Semrush, Surfer SEO, Graphite, iPullRank, Digital Bloom

Read synthesis

Programmatic SEO Architecture: When Data Is the Product

What separates programmatic SEO that ranks from programmatic SEO that gets suppressed. Data granularity, template architecture, Google's detection systems, rendering strategy, and why 96.55% of all indexed pages get zero traffic.

Google patents, API leak (14,014 attributes), Wise, Zillow, NerdWallet, G2, Ahrefs, Surfer SEO

Read synthesis

Content Decay: The 31% Refresh Threshold and the Dual Decay Curve

Content visibility half-life has collapsed to 3-6 months. A controlled 14,987-URL study shows only major content expansions (31%+ change) produce ranking gains. AI citations turn over 70% in 2-3 months.

RepublishAI (14,987 URLs), Seer Interactive, Authoritas, HubSpot, Google API leak, patent US8549014B2

Read synthesis

In-Page Information Architecture: Structuring Content for Three Audiences

Site architecture tells search engines how pages relate. In-page architecture determines how humans scan, how crawlers parse, and how AI systems extract. The patterns that serve all three overlap more than practitioners realize.

NN/g, Baymard Institute, DEJAN AI, Shashko (42,971 citations), iPullRank, WebAIM, Google API leak

Read synthesis

Unhelpful Content: The Patterns That Trigger Suppression

Google does not measure helpfulness. It detects unhelpfulness across 13 independent classifiers. Mapped from the 2024 API leak, patents, and HCU recovery data to the ranking pipeline stages where each pattern fires.

Google API leak (14,014 attributes), Panda/N-gram/NavBoost patents, SpamBrain taxonomy, Lily Ray HCU analysis

Read synthesis

Living Documents

These pages are continuously updated as new studies, patent filings, and platform changes emerge. Every claim is sourced. Certainty language reflects the weight of evidence, not conviction.