Someone just analyzed 21,000 ChatGPT citations to figure out exactly which pages AI picks as sources. The findings are wild, a little depressing if you are not in the top tier, and genuinely actionable if you move on them.
Here is what they found, what it means, and what to actually do about it.
AI Citation Is Winner-Take-Most
The study looked at 21,482 citation rows across 670 unique domains and 127 prompts. The top 10 domains captured 46% of all citations on any given topic. The top 30 domains captured 67%.
Thirty seats at the table. Everything else is essentially invisible.
This is not a temporary quirk of how these models work. It reflects a deeper structural reality: AI systems are trained on the internet as it exists, and the internet as it exists is heavily concentrated in authority. The same sites that dominate Google search tend to dominate AI citation. The feedback loop reinforces itself.
Who Is Actually Getting Cited
The citation breakdown across domains follows a predictable pattern. Established media properties, government and institutional sources, and platforms with massive content depth dominate across almost every topic category.
| Source Type | Citation Share | Why |
|---|---|---|
| Wikipedia | 47% of all citations | Structured, neutral, heavily interlinked |
| Major media (.com, .org) | Top 10 domain share | Domain authority, editorial standards |
| Government/institutional (.gov, .edu) | High per-topic | Trustworthiness signals |
| Industry-specific authorities | Topic-dependent | Topical depth and specificity |
| Original research publishers | Growing share | Unique data that cannot be paraphrased away |
The pattern across all of these: they are not just high-authority in general. They are specifically authoritative on the topics they cover. A site with 500 deeply researched articles on a narrow topic consistently outperforms a generalist site with 5,000 surface-level posts in citation frequency for that specific topic.
What This Means If You Are Not Wikipedia
The brands and agencies I work with are not Wikipedia. Most of them are not The New York Times either. So the question is whether any of this is actionable for a business that does not have a 20-year head start on domain authority.
It is. But the path is narrower than most content marketing advice suggests.
Topical depth beats topical breadth. A site that is the definitive resource on one specific subject — say, answer engine optimization for B2B SaaS brands — will get cited more often in that niche than a site trying to cover all of digital marketing at a surface level. The AI citation model rewards specificity.
Original data is the highest-value citation magnet. When a site produces research that cannot be found anywhere else, it becomes a primary source. Primary sources get cited not just by humans but by AI systems, which need authoritative data to back up claims. A single piece of original survey research or proprietary data analysis can generate more citation equity than 50 generalist blog posts.
Structured content outperforms prose. The study found that pages with clear FAQ sections, defined terminology, step-by-step processes, and comparison tables were cited at significantly higher rates than equivalent content written as flowing prose. AI systems need to extract specific answers, and structured content makes that extraction easier and more reliable.
The Three Moves That Actually Work
1. Pick a Lane and Own It
Stop trying to be the answer to everything in your industry. Pick the 3 to 5 topics where you have genuine, differentiated expertise and go three levels deeper than any competitor. Publish frequently on those topics. Link internally between all of them. Build a content cluster so deep that anyone who stumbles into it gets everything they need from your site alone.
This is exactly how smaller sites break into AI citation on specific topics. Not by competing with Wikipedia on general knowledge, but by becoming the definitive source on something specific enough that Wikipedia does not cover it well.
2. Produce One Piece of Original Research Per Quarter
A survey of 500 people in your industry. A proprietary analysis of publicly available data. A benchmark report based on your client base with the identifying details removed. Any of these creates data that only you have, which means any AI system that wants to make a specific factual claim on that topic has to cite you or produce nothing.
Original research is expensive to produce and cheap to distribute. It earns links, earns citations, and earns the kind of authority that no amount of well-written blog posts can manufacture.
3. Structure Everything for Extraction
Every piece of content you publish should be built assuming an AI system is going to try to extract specific answers from it. That means a direct answer in the first paragraph of every section. FAQ schemas on every page that answers questions. Numbered lists for processes. Comparison tables for anything that involves trade-offs. Clear definitions for any term you introduce.
The Google Structured Data guidelines are the right starting point for implementation. But the structural thinking should precede the technical implementation — write for extraction first, then mark it up.
The Honest Timeline
None of this happens overnight. The sites that dominate AI citation today built their authority over years. But the window for establishing topical authority in specific niches is open right now in a way it will not be in three years, when the market catches up to what is happening.
The brands that invest in original research, topical depth, and structured content in 2026 are building the citation authority that compounds through 2028 and beyond. The brands that wait are going to find themselves paying to advertise their way back into a game they could have won organically.
If you want help mapping out what a citation authority strategy looks like for your specific niche, reach out to the Woodside team. We have been tracking this space closely and can tell you quickly where the open seats at the table are in your market.
Joey Rahimi is the founder of Woodside Ventures and Aiken House, tracking generative engine optimization and AI citation strategy for brands building long-term search authority.

