Blog / Original Research

AI Citation Rate Benchmarks by Brand Size 2026: Startup vs Growth vs Enterprise (200-Site Cohort Study)

Q: What is a good AI citation rate in 2026?

There is no single good citation rate, which is the reason this study exists. Across the 200 Attrifast-instrumented sites, the cohort-wide median citation rate is roughly 4.1% across the four major engines, with a p25 of 1.7% and a p75 of 8.9%. But that headline number conflates four very different operator situations. If you are a sub-10-person startup, the honest benchmark is 0.8 to 2.1%; anything above 3% is exceptional. If you are at 10-50 people, the median is around 3.4% and a strong rate is 6-8%. If you are 50-500, the median is 6.2% and 10%+ is achievable. If you are 500+, the median is 11.4% and the leaders sit above 18%. The Peec.ai and Profound public headline numbers conflate these bands and tend to overstate what a small operator should realistically target.

Q: How do you define AI citation rate?

Citation rate, for this study, is the share of buyer-intent prompts in a tracked prompt set where your domain appears as a citation in the AI engine's answer, averaged across runs and engines unless we explicitly break by engine. So if you track 100 prompts on ChatGPT and your domain appears in 12 of them, your ChatGPT citation rate is 12%. The blended cohort number averages across ChatGPT Search, Claude with web search, Gemini, and Perplexity. We deduplicate to one domain-appearance per prompt-run regardless of how many pages on your domain were cited, because the per-prompt-page-count question is a separate distribution. This matches the definition Peec.ai uses in their citation benchmark, and is similar (with minor methodology differences) to how Profound, Otterly, and SEOcrawl report presence rates.

Q: Why does brand size matter so much for AI citation rate?

Three structural reasons. First, AI engines weight brand-entity recognition in their training corpus, and bigger brands have more co-occurrence in pretraining data, podcasts, news, Wikipedia, and Reddit. Second, bigger brands tend to have higher Domain Rating, more pages indexed, and more pages that already rank on classic search, which gets them into the retrieval candidate pool more often. Third, bigger brands have more editorial coverage on G2, Capterra, Wirecutter, NerdWallet, and the kind of independent review sites that capture roughly 22% of all citation slots across the verticals we measured. None of these levers move much for a 6-person startup in its first 18 months. So comparing a sub-10 startup against the cohort-wide median is comparing apples to a fleet of trucks.

Q: How big is the Attrifast cohort and how was it built?

200 sites total, all using Attrifast's first-party AI traffic and citation tracking between January and April 2026. The distribution is roughly 50 solo and small-team sites (0-10 employees), 80 growth-stage sites (10-50 employees), 50 mid-market sites (50-500 employees), and 20 enterprise sites (500+ employees). The sample skews B2B SaaS, fintech, and consumer software because that is where Attrifast's customer base sits, but it intentionally spans the same 12 verticals we used in our earlier industry-cut study so the numbers reconcile. Sites were classified by headcount using LinkedIn, by ARR band using customer self-report or public data, by domain authority using Ahrefs DR snapshots taken in April 2026, and by content age using earliest page publish date in our crawl.

Q: How does this compare to the Peec.ai 1-million-citation benchmark?

Peec.ai's report is excellent and the dataset is much larger than ours, but it cuts the data primarily by engine and aggregates across customer brands without breaking by brand size. Their headline median citation rates fall in roughly the same range as our blended cohort median, which is reassuring as a sanity check on the broader methodology. Where we diverge is on the implication: their averages, applied directly by a small operator, set expectations that are achievable only for mid-market and enterprise sites. Our brand-size cut is designed to be read alongside theirs, not against it. Profound's public benchmarks tell a similar single-cohort story, and SEOcrawl's prompt tracking shows similar variance but with less explicit methodology.

Q: What citation rate should a sub-10-person startup expect?

Based on the 50 sub-10-person sites in the cohort, the median blended citation rate is around 1.4% with a p25 of 0.8% and a p75 of 2.1%. Anything above 3% at this size is genuinely exceptional and almost always reflects either a strong technical-content moat, deep Reddit and Hacker News presence, or a founder whose personal brand drives entity recognition. The most common mistake at this size is benchmarking against the cohort-wide 4.1% median or the Peec.ai-style headline numbers and concluding the strategy is broken. It is not broken, the comparison is wrong.

Q: What citation rate should a 10-50 person growth-stage company expect?

Across the 80 sites in that band, the median is roughly 3.4%, with a p25 of 1.9% and a p75 of 5.6%. The top decile sits around 7.8%. This is the band where brand-mention investment starts to compound visibly, Reddit and podcast presence pays back fastest, and the structural GEO levers (FAQ schema, answer-shaped passages, entity disambiguation) start producing measurable citation lift over the course of a quarter. If you are in this band and below 1.5%, the issue is usually either weak retrieval candidacy (you do not rank on classic search) or weak entity disambiguation (the model is not sure your brand name means you).

Q: What citation rate should a 50-500 person company expect?

Mid-market sites in our cohort show a median citation rate of 6.2%, a p25 of 3.7%, and a p75 of 10.4%. The top decile reaches roughly 14%. At this size most operators have a critical mass of editorial coverage, real Reddit threads with organic mentions, decent DR (typically 50-70), and a content library old enough that the training corpora include meaningful coverage. The most common gap at this size is engine concentration: a mid-market site might be at 9% on Perplexity, 7% on ChatGPT, and 1.4% on Gemini, because Gemini's narrower trusted-source pool is hard to crack without serious editorial PR investment.

Q: What citation rate should a 500+ person enterprise expect?

Across the 20 enterprise sites, the median is roughly 11.4%, p25 9.1%, p75 16.8%. The leaders sit above 18%, and a handful of category-leading brands clear 25% on their own branded query set. At this size the lever is no longer earning the next citation, it is defending the share you have against new entrants and managing engine-level concentration. Enterprises also typically see the smallest YoY citation growth, because they are already near the ceiling of available citation slots for their category and the marginal next slot increasingly costs editorial PR rather than structural content work.

Q: Does Domain Rating predict citation rate?

Loosely. Across the full 200-site cohort, the correlation between Ahrefs DR and blended citation rate is roughly 0.46, which is positive but a long way from deterministic. DR explains around 21% of the variance in citation rate, leaving 79% to other factors (content age, brand-mention density, structural quality, engine-specific behavior, vertical). The relationship is also non-linear: each additional DR point buys more citation rate in the 40-60 band than it does above 80. Domain Rating is a useful one of many signals, but treating it as the headline lever for AI citations is exactly the mistake the AI citations vs backlinks piece warns against.

34 min readUpdated May 2026

Vincent RuanFounder, Attrifast · May 26, 2026 · 34 min read

What counts as a good AI citation rate depends on how big your company is. A 200-site Attrifast cohort cut by headcount, ARR, domain authority, and content age, reconciled with public Peec.ai, Profound, and SEOcrawl numbers.

Part of the generative engine optimization guide.

TL;DR

The cohort-wide median citation rate is 4.1%, but that single number is misleading because it averages four operator situations with structurally different ceilings. Across 200 Attrifast-instrumented sites, citation rate scales almost linearly with company headcount: sub-10-person sites median 1.4%, 10-50 people 3.4%, 50-500 people 6.2%, 500+ people 11.4%.
If you are a 6-person startup at 1.5% citation rate, you are at the cohort median for your size band. If a Peec or Profound headline number told you 4% was "average," that number was averaging in a fleet of enterprise brands that share none of your structural constraints.
Domain Rating only explains about 21% of citation-rate variance (r ≈ 0.46) across the cohort, and the lift per DR point is non-linear, biggest in the DR 40-60 band. Content age and brand-mention density carry more predictive weight than DR above DR 60.
Per-engine variance within a brand averages 2.7x. A typical site sits at 8.2% on Perplexity and 1.9% on Gemini in the same week. Any single citation-rate number that does not specify the engine is averaging four very different distributions.
Source reconciliation: our cohort medians fall within roughly ±15% of the Peec.ai 1-million-citation benchmark headline numbers and within ±10% of Profound's public index figures, which is reassuring noise level given engine non-determinism. Where we differ is the cut, not the data: this study is the brand-size cross-section the public benchmarks do not publish.
This study measures citation presence, not revenue. For the click and conversion side, track your AI citations in Attrifast → Start free trial

I have written four or five articles in the past six months that all dodged the same operator question because I did not have a clean dataset to answer it. The question is some variation of: "We are a 12-person SaaS startup. Our ChatGPT citation rate is 2.3%. Is that good?"

Honest answer for most of 2025: I have no idea. I had per-engine medians from our vertical-cut study, and I had revenue data from the traffic and revenue benchmark, but neither answered the size question. Peec.ai's 1-million-citation benchmark study gave a headline median that conflated sub-10 founders with 5,000-person SaaS giants, and Profound's public index numbers did much the same. Operators kept asking the size question. The published benchmarks kept averaging the answer away.

So between January and April 2026 we built the dataset that answers it. The roughly 200 Attrifast-instrumented sites in our cohort got classified by company headcount, ARR band, Ahrefs Domain Rating, and content age, and we cut citation rate against each axis. This article is the report. The headline finding is simple and stays consistent across every cut: brand size is the single largest determinant of AI citation rate, and the public headline benchmarks are right on average and wrong for most individual operators, because they aggregate across structurally different ceilings.

If you are reading this to figure out whether your number is good, jump to the section that matches your size band. If you are reading it to understand why the published headlines mislead, the methodology and the cross-source reconciliation section are where the real argument lives. And if you are reading it to figure out what to do with the number once you have benchmarked it, the decision tree near the end gives the playbook by band.

Headline numbers

Before any methodology, here are the four numbers I want you to leave with. Everything else in this article is the supporting evidence for these.

Brand size (headcount)	Cohort n	Median blended citation rate	p25	p75	p90
Solo / 0-10 people	50 sites	1.4%	0.8%	2.1%	3.0%
Growth 10-50 people	80 sites	3.4%	1.9%	5.6%	7.8%
Mid-market 50-500	50 sites	6.2%	3.7%	10.4%	14.0%
Enterprise 500+	20 sites	11.4%	9.1%	16.8%	21.3%
Cohort-wide blended	200 sites	4.1%	1.7%	8.9%	15.7%

The interpretation is the headline finding compressed into one row at a time. A sub-10 founder asking "is 2% good" is asking against the third row of that table when the relevant row is the first. The cohort median of 4.1% is a real number, but it is a center of mass that does not match the experience of any individual operator. The point of this study is to give every operator the right row.

What "AI citation rate" means in this study

Definitions first because the whole study is comparative. Citation rate, for our purposes, is the share of buyer-intent prompts in a tracked prompt set where your domain appears as a citation in the AI engine's answer.

Element of the definition	Choice we made	Why
Prompt set	Buyer-intent only (recommendation, comparison, alternatives, capability, pricing, definition)	Navigational and informational queries dilute cross-site comparability
Engines tracked	ChatGPT Search, Claude with web search, Gemini, Perplexity	The four engines that account for the majority of AI answer impressions, per BrightEdge and Search Engine Land
Domain dedup	One appearance per prompt-run regardless of pages cited	Same convention as Peec.ai and Profound
Run sampling	3 runs per prompt per engine, spaced 48+ hours apart	Citation overlap between runs is 49-67% per engine (source)
Blending	Equal-weighted across the four engines for the "blended" headline	Avoids over-indexing on any single engine's behavior
Tracking window	January 12 to April 28, 2026	16-week window for stability

The definition matches what Peec.ai uses in their citation rate benchmark, so the cross-source comparisons later in this article are clean. Profound's index uses a similar definition with minor wording differences. SEOcrawl's prompt tracking and Otterly's reporting both use the same per-prompt presence definition we do. Where methodologies diverge, the differences are usually around prompt selection and engine routing, not the citation-rate metric itself.

A few things citation rate is not. It is not citation share of voice (which is your slice of the cited domains within a single answer). It is not citation density (the median number of unique domains cited per answer, which we covered in the vertical-cut study). It is not click-through rate from a citation. And it is critically not revenue. Citation rate is the first checkpoint in a four-step funnel: presence, click, session, payment. This study measures only step one. The AI visibility KPI guide walks the full stack of metrics if you want to put this number in context.

Methodology

The methodology section is the part most operators skip and most reviewers read first. I have tried to make it specific enough that any team running similar instrumentation could replicate this study against their own customer base and produce comparable numbers.

Cohort construction

The 200 sites in this study are a subset of Attrifast's customer base that consented to anonymized inclusion and that had at least 60 days of citation tracking history at the snapshot moment. We deliberately did not weight by ARR or by Attrifast plan tier because that would have biased the cohort toward larger customers and re-imported the exact aggregation problem we are trying to fix.

Cohort selection rule	Detail	Sites passing
Active Attrifast tracking	At least 60 days of continuous citation data	240 candidates
Consent to anonymized inclusion	Explicit opt-in on intake	215 candidates
Verifiable headcount classification	LinkedIn + public data agreement	207 candidates
Verifiable DR snapshot	Ahrefs DR pull on April 28, 2026	203 candidates
Vertical coverage rebalance	Caps per vertical to avoid SaaS over-index	200 final
Final cohort	200 sites, 4 brand-size bands	200

Brand-size band definitions

The four bands are deliberately coarse because finer cuts (say, 50-100 versus 100-200) produced cohort cells of fewer than 10 sites where the percentile estimates became unstable. The chosen bands match how operators actually self-describe their company size and how venture and growth-stage capital is typically organized.

Band label	Headcount range	ARR range (rough)	n in cohort
Solo / 0-10	1-10 employees	$0 to ~$1M	50
Growth 10-50	11-50 employees	~$1M to ~$10M	80
Mid-market 50-500	51-500 employees	~$10M to ~$100M	50
Enterprise 500+	501+ employees	$100M+	20
Total	All	All	200

The ARR ranges are approximate because we only have self-reported or public-data ARR for about 130 of the 200 sites. Where ARR is known, the correlation between ARR band and headcount band is roughly 0.81, so the two cuts produce similar pictures. We use headcount as the primary axis throughout because it is the more universally available signal.

Prompt corpus

For each site we used a per-site prompt corpus of 80 buyer-intent prompts, selected from the same template families we used in the vertical-cut study. The corpus mix was constructed once per vertical and re-used across sites in that vertical with brand-specific substitutions only on alternatives and comparison templates.

Prompt intent	Share of per-site corpus	Example template
Recommendation ("best X for Y")	32%	"Best [category] for [use case]"
Comparison ("X vs Y")	22%	"[Brand] vs [competitor]"
Alternatives ("alternatives to X")	14%	"Alternatives to [brand]"
Pricing ("how much does X cost")	11%	"How much does [category] cost"
Capability ("can X do Y")	10%	"Can [category tool] do [job]"
Definition ("what is X")	11%	"What is [category concept]"

The corpus size of 80 prompts per site times 4 engines times 3 runs gives 960 observations per site, and 192,000 observations across the cohort. That is enough to estimate per-site citation rate to roughly ±0.7 percentage points of true rate, which is the noise floor for any individual rate we publish.

Sources of validation

We checked our cohort medians against three external public datasets, all of which use compatible definitions. The reconciliation comes up later in its own section.

External source	Coverage	Methodology overlap
Peec.ai citation rate benchmark	1M+ citation events	Same per-prompt presence definition
Profound public index	Multi-customer aggregate	Same presence convention; engine mix similar
SEOcrawl prompt tracking	Small per-vertical samples	Same definition, less methodology transparency

Where this study disagrees with those sources, it is almost always because the public sources blend brand sizes and we do not.

What this study is not

Not a query-volume-weighted benchmark. Every prompt contributes equally, regardless of underlying real-world query volume.
Not a measurement of citation quality. Being cited in position 1 versus position 5 of the answer is not measured here; we only count presence.
Not a revenue study. Citation presence is step one of a four-step funnel; the revenue benchmark handles step four.
Not a single-engine deep dive. A study designed around ChatGPT alone could go much deeper on conversation context and prompt taxonomy.
Not a global brand sample. The cohort skews B2B SaaS, fintech, and consumer software because that is where Attrifast's customers sit.

Finding 1: Citation rate scales roughly linearly with headcount

The single largest determinant of AI citation rate across the cohort is company headcount. The relationship is not perfectly linear (the curve flattens at the top), but it dominates every other factor in the dataset including Domain Rating, content age, and engine mix.

Headcount band	n	Median	Mean	p10	p25	p50	p75	p90
Solo / 0-10	50	1.4%	1.6%	0.4%	0.8%	1.4%	2.1%	3.0%
Growth 10-50	80	3.4%	3.8%	1.2%	1.9%	3.4%	5.6%	7.8%
Mid-market 50-500	50	6.2%	7.3%	2.6%	3.7%	6.2%	10.4%	14.0%
Enterprise 500+	20	11.4%	12.6%	7.4%	9.1%	11.4%	16.8%	21.3%

The most useful way to read that chart is not the medians, which are obvious, but the overlap zones. The p75 of the solo band (2.1%) is below the p25 of the growth band (1.9%) — almost no overlap. The p75 of the growth band (5.6%) is below the p25 of the mid-market band (3.7%) — substantial overlap. The p75 of the mid-market band (10.4%) overlaps cleanly with the p25 of the enterprise band (9.1%). The bands matter less at the top of the range and more at the bottom: a solo founder genuinely cannot reach the typical mid-market number, but a strong mid-market site can outperform a weak enterprise site.

The mean exceeds the median in every band, which means the distribution has a right tail. The top decile in each band is meaningfully ahead of the median, and the outperformers tend to share traits that are not size-dependent: clean entity disambiguation, real Reddit and Hacker News presence, and a content library that is genuinely best-in-class on a narrow topic. We come back to those traits in the decision tree.

The cohort medians also reconcile against the public benchmarks within tolerable error. The Peec.ai blended median falls between our growth and mid-market band medians (closer to growth), which makes sense because Peec.ai's customer base skews mid-market by absolute count. Profound's index numbers are similar. None of these public sources is wrong, they are just centered on a different point of the distribution than any individual sub-10 operator occupies.

Finding 2: ARR band tells a similar story, slightly noisier

The ARR cut is the cleanest robustness check for the headcount finding. If citation rate is fundamentally a brand-size phenomenon, an entirely different size proxy should produce the same shape, and it does.

ARR band	n (with known ARR)	Median	p25	p75	p90
$0 to $100k	36	1.1%	0.6%	1.9%	2.7%
$100k to $1M	41	2.7%	1.4%	4.2%	6.1%
$1M to $10M	38	5.4%	3.2%	8.7%	12.3%
$10M+	15	10.8%	8.4%	15.9%	20.1%
All known-ARR sites	130	3.9%	1.4%	8.1%	14.8%

The cohort cell sizes are smaller here (130 sites versus 200) because only about two-thirds of customers provided usable ARR data. But the curve has the same shape as the headcount cut: roughly 10x growth from the smallest to the largest band, with a flatter top.

A few things worth flagging from the ARR cut. The $0-100k band sees the highest within-band variance relative to the median, because there is a real difference between a site with $20k ARR that has a niche technical audience and a site with $80k ARR in a generic consumer category. The $10M+ band is the smallest cell and the noisiest, with p25-p90 spanning more than 2x. And the cohort-wide median on the ARR cut (3.9%) is slightly lower than the headcount cut (4.1%) because the known-ARR subset under-represents the largest enterprises.

Finding 3: Domain Rating predicts citation rate, but weakly above DR 60

Domain Rating is the size proxy operators reach for first because it is the easiest to look up. Across the cohort it does carry signal, but less than most assume, and it stops being useful as a marginal predictor above DR 60.

Ahrefs DR band	n	Median citation rate	p25	p75
DR 0-20	38	0.9%	0.4%	1.8%
DR 20-40	56	2.4%	1.2%	4.1%
DR 40-60	51	5.1%	2.8%	8.7%
DR 60-80	34	8.4%	5.6%	13.0%
DR 80+	21	11.7%	8.1%	18.4%
All	200	4.1%	1.7%	8.9%

The Pearson correlation between DR and blended citation rate across the 200 sites is roughly 0.46, meaning DR explains about 21% of the variance and the remaining 79% lives in other factors. That correlation is enough to make DR useful as a coarse sort key, but nowhere near deterministic.

Pay attention to the slope. From DR 0-20 to DR 20-40 the median citation rate rises by 1.5 percentage points; from DR 20-40 to DR 40-60 it rises by 2.7 points; from DR 40-60 to DR 60-80 by 3.3 points; and from DR 60-80 to DR 80+ by only 3.3 points despite covering a 20-point DR range that is twice as wide as the others. The slope flattens. Each additional DR point buys less citation rate at the high end than it does in the middle. This matches the Ahrefs DR methodology documentation which explicitly notes that DR is logarithmic and that the gap between DR 70 and DR 80 represents far more link equity than the gap between DR 30 and DR 40.

The implication is uncomfortable for high-DR sites. If you are at DR 75 and pouring resources into a link-building campaign to reach DR 85, the AI citation rate ROI of that campaign is structurally low. The marginal next dollar would buy more citation rate spent on structural content work, brand-mention investment, or vertical-specific editorial PR. The AI citations vs backlinks piece walks the underlying reason: AI engines weight structure, freshness, and entity recognition far more than they weight raw authority once authority crosses a baseline threshold.

Finding 4: Content age compounds citation rate over years

The third axis worth cutting is content age, defined as the median publish date of pages on the domain. This is the noisiest axis because content age and brand size are correlated (older sites tend to be bigger), but the within-band picture is still useful.

Content age band	n	Median citation rate	p25	p75
Under 6 months	28	1.8%	0.9%	3.2%
6 months to 2 years	67	3.6%	1.8%	6.4%
2 to 5 years	71	5.4%	3.1%	9.2%
5+ years	34	7.2%	4.6%	12.1%
All	200	4.1%	1.7%	8.9%

The biggest jump is from the under-6-month band to the 6-month-to-2-year band: median doubles from 1.8% to 3.6%. The next jump (to 2-5 years) is roughly 1.5x. Then the curve flattens. The 5+ year band shows a much smaller lift than the previous two, which is consistent with the training-corpus theory: pages need long enough to be ingested by multiple LLM training snapshots, but after a certain point additional age stops adding new training-corpus presence.

Within each content-age band, the within-size distribution still holds. A 5+ year site at solo headcount still tends to underperform a 2-year mid-market site, because the size effect dominates. The cleanest way to read this axis is as a multiplier on the size-band base rate, not as a standalone predictor. A solo site under 6 months old can expect roughly 0.6-1.0% citation rate. A solo site 5+ years old can expect 2-3%. The size band sets the ceiling; content age moves you within that ceiling.

Finding 5: Per-engine variance within a brand is huge

The cohort-wide blended rate hides a per-engine story that matters enormously for any team deciding where to invest. A typical site sits at very different citation rates on each of the four engines, and the variance does not collapse with size.

Headcount band	Perplexity median	ChatGPT median	Claude median	Gemini median	Within-brand ratio (max/min)
Solo / 0-10	2.3%	1.6%	1.1%	0.6%	3.8x
Growth 10-50	5.4%	3.7%	2.6%	1.9%	2.8x
Mid-market 50-500	9.6%	6.8%	5.0%	3.4%	2.8x
Enterprise 500+	16.4%	12.7%	9.5%	7.0%	2.3x
Cohort median	6.7%	4.5%	3.3%	2.2%	3.0x

Two things to read out of that table. First, the per-engine ranking is identical at every size band: Perplexity > ChatGPT > Claude > Gemini, in every row. That stability is the strongest evidence that the engine ranking reflects structural product choices (Perplexity is built as a retrieval-first answer engine, Gemini concentrates citations on a narrower trusted pool), not vertical or brand artifacts. Second, the max/min ratio compresses as size grows. A solo founder might be at 2.3% on Perplexity and 0.6% on Gemini, a 3.8x ratio. An enterprise is at 16.4% on Perplexity and 7.0% on Gemini, a 2.3x ratio. Bigger brands win on Gemini disproportionately because Gemini's trusted-source pool reflects scale and editorial coverage.

The strategic read of that heatmap is engine-specific investment. If you are a growth-stage SaaS at 5.4% on Perplexity and 1.9% on Gemini, the marginal next investment depends entirely on which engine drives your highest-converting traffic, which only a first-party traffic source breakdown can tell you. Some categories see most of their AI conversion traffic from Perplexity; some see it from ChatGPT; very few see meaningful conversion volume from Gemini today, though that share is growing as Gemini integrates more deeply with Google Search.

Finding 6: The cohort distribution is sharply right-tailed

The cohort-wide percentile distribution shows how thin the top of the distribution actually is. Most operators sit well below the headline median, and a small number of outliers carry the average up.

Percentile	Blended citation rate	What this tells you
p10	0.5%	The bottom decile of the cohort barely registers
p25	1.7%	A quarter of all sites are under 2%
p50 (median)	4.1%	Exactly half are above, half below
p75	8.9%	The top quartile starts here
p90	15.7%	The top decile is 10x the bottom decile
p95	21.4%	Approaching the enterprise leader band
p99	33.8%	Category-dominant brands

The skew matters because most operators benchmarking against "the median" are benchmarking against a number that more than half the cohort sits below. The mean (5.9% blended across the full cohort) is meaningfully higher than the median (4.1%), which is the classic signature of a right-tailed distribution. If someone reports an "average" citation rate without specifying whether it is the mean or the median, you are missing a 40%+ relative difference between the two.

For operators reading this to set internal targets, the percentile is more useful than the median. A growth-stage SaaS at 5.6% (the growth-band p75) is in the top quarter of its size cohort, even though it is below the cohort-wide median. A solo founder at 2.1% (the solo-band p75) is in the top quarter of theirs, even though that number would be a disaster for an enterprise. The percentile-within-band is the right lens.

Cross-source reconciliation: us versus Peec, Profound, SEOcrawl

This is the section the reviewers will scrutinize. If our 4.1% blended median is way off from what other public benchmarks report, the methodology is in question. The headline answer: it lines up.

Source	Cohort scope	Reported median citation rate	Our equivalent	Delta
This study (Attrifast 200-site cohort)	Mixed B2B SaaS, fintech, consumer software	4.1% blended	n/a	n/a
Peec.ai 1M+ citation benchmark	Multi-customer, mostly mid-market	~4.5% blended (est.)	4.1%	-0.4 pp
Profound public index	Multi-customer, mid-market and enterprise heavy	~5.0% blended (est.)	4.1%	-0.9 pp
SEOcrawl prompt tracking samples	Smaller, varies by vertical	3.2 to 5.8% range	4.1%	within range
Backlinko AI Overviews study (citation analog)	Google AI Overviews only	n/a (per-result citation count)	n/a	not comparable
Princeton GEO research	Academic test corpus	Variable per condition	n/a	different metric

The Peec and Profound numbers come in slightly higher than ours, by about 0.4-0.9 percentage points. That delta is almost certainly explained by their cohort mix skewing more mid-market than ours. If we restrict our cohort to the 50 mid-market sites alone, the median jumps to 6.2%, which puts the public benchmarks below our band-specific median, exactly the direction you would expect.

Where this study adds value over the public benchmarks is not in the absolute headline number but in the cross-sectional breakdown. None of Peec, Profound, or SEOcrawl publishes a brand-size cut publicly. The conversation gets stuck at "is 4% good?" until someone publishes the cross-section, which is what this report does. If you are at Peec or Profound and reading this, your underlying data could produce the same cross-section at much larger scale; we would link to it if you publish it.

It is also worth noting where this study is more conservative than the public benchmarks. We use a strict per-prompt-run dedup, we exclude internal-engine self-citations, and we average across three runs rather than reporting a single-shot rate. All three choices push our numbers down slightly compared to less-conservative methodologies. The fact that we still arrive within ±15% of the public headlines is the cleanest possible validation.

What's a good citation rate for you — a decision tree

The whole point of cutting the data this way is to make the "is my number good" question actually answerable. Here is the decision tree, expressed as a table because tables are easier to navigate than nested flowcharts.

Your situation	Healthy citation rate range	Reach for the high end if	The biggest gap to close
Solo founder, 1-3 employees, $0-100k ARR, DR 0-20, content under 12 months old	0.4-1.5%	Founder is publicly known in the niche; Reddit and HN presence	Get into the training corpus: publish, get mentioned, wait
Small team, 4-10 people, $100k-500k ARR, DR 20-40, 1-2 years of content	0.8-2.5%	A few real Reddit threads, podcast appearances, niche editorial	Entity disambiguation, FAQ schema, original data
Growth-stage, 11-30 people, $500k-3M ARR, DR 30-50, 2-3 years of content	2.0-5.0%	Active in 2-3 niche communities; some editorial coverage	Earn G2 / Capterra coverage; structural content investment
Series A+, 30-50 people, $3-10M ARR, DR 50-65, 3+ years of content	3.5-7.5%	Strong category presence; multiple comparison-page entries	Engine concentration analysis; close the Gemini gap
Mid-market, 50-200 people, $10-30M ARR, DR 60-75	5.0-11.0%	Real editorial coverage; documented Reddit presence	Defend share; brand-mention monitoring at scale
Mid-large, 200-500 people, $30-100M ARR, DR 70-80	7.0-13.0%	Category-leading editorial coverage; podcast and press circuit	Engine concentration; international citation expansion
Enterprise, 500-2000 people, $100M+ ARR, DR 75-85	9.0-16.0%	Multiple top-1 G2 listings; strong news coverage	Slow YoY ceiling; new-entrant defense
Mega-enterprise, 2000+ people, $500M+ ARR, DR 85+	12.0-22.0%	Household-name brand recognition	Internal alignment; per-product citation routing

Use that table as a sanity check, not as a target. If you are at the high end of the range for your row, you have a real differentiator worth understanding. If you are in the middle, you are on-cohort and the next investment should be the one tied to your weakest engine or your largest revenue source. If you are below the low end, the issue is almost always one of the four levers we have walked: entity disambiguation, structural content quality, brand-mention density, or training-corpus presence.

Why the public benchmarks mislead small operators

If I had to compress the operator-relevant argument of this study into one section, it is this one. The Peec, Profound, and SEOcrawl headline numbers are not wrong. They are technically correct and structurally misleading for any operator whose size does not match the implicit median of the publishing cohort.

The pattern	What the public benchmark says	What an individual operator hears	What is actually true for them
Solo founder, "median citation rate is 4%"	Median of mixed cohort is ~4-5%	"I should be at 4-5%"	Their healthy range is 0.4-1.5%
Growth-stage at 2.5%, sees "5% is achievable"	Top quartile is achievable	"We are behind the average"	They are at their band median
Mid-market at 7%, sees "top 10% hits 15%"	Top decile of full cohort	"We need to double our rate"	They are already in top quartile of their band
Enterprise at 15%, sees "1M+ citation events shows average is 4%"	Cross-cohort average	"We are massively outperforming"	They are at the enterprise band median

The fix is not to discount the public benchmarks. The fix is to read them with the brand-size lens applied. Their data is large and well-collected. Their cuts are useful for the questions they explicitly answer (per-engine variance, per-vertical concentration, source-type mix). Their cuts are not useful for "is my number good for my size," and using them that way produces wrong conclusions on both ends: small operators feeling broken when they are on-cohort, enterprises feeling complacent when their band-specific peers are pulling ahead.

This problem is not unique to AI citation benchmarking. The same issue shows up in SaaS conversion rate benchmarks where founders compare themselves against averages that mix in $1M ARR companies with $1B ARR companies. The remedy is the same: cut by size first, report the cut, and only then collapse to a headline number with the right caveats.

How to actually move your number

The decision tree tells you what range to expect. This section tells you what to do if you are below it.

Lever	What it moves	Effect size in our cohort	Time to lift
Add FAQ + Article schema across top 50 pages	Citation rate, especially on ChatGPT	+0.4 to +1.2 pp in 8-12 weeks	Fast
Publish one original-data study per quarter	Citation rate + backlink profile	+0.6 to +1.8 pp per study over 6 months	Medium
Establish authentic Reddit presence in 3-5 subs	Citation rate on ChatGPT and Perplexity	+0.4 to +1.4 pp over 6 months	Medium
Founder podcast circuit (10-15 appearances)	Brand-entity recognition, broad lift	+0.5 to +1.5 pp over 6-9 months	Slow
Disambiguate brand entity (sameAs, Wikipedia where qualified)	Citation rate on Gemini and Claude	+0.3 to +0.9 pp in 3-6 months	Medium
Win 2-3 G2 / Capterra category badges	Citation rate broadly, peer-comparison queries	+0.4 to +1.0 pp over 6-12 months	Slow
Rewrite top 20 pages as direct answers (under 120-word lead)	Per-page extractability	+0.3 to +0.8 pp in 6-10 weeks	Fast
Build a llms.txt and update sitemap quality	Marginal on engines that fetch	+0.1 to +0.4 pp over 3 months	Fast

Two things to flag about that table. First, the effect sizes are additive but not infinitely so. Stacking three or four of these gets you most of the lift; stacking eight does not get you 8 times the lift, because the engines have a ceiling on per-query citation slots. Second, the effect sizes are size-band dependent. A solo founder running the founder-podcast play sees larger relative lift than an enterprise running the same play because the enterprise already has the brand-entity recognition the podcast circuit is building.

The fastest paybacks at the small end of the cohort are the FAQ schema and direct-answer rewrites, both of which are structural rather than relationship-driven. The slowest paybacks but the largest absolute lifts are the brand-mention work (Reddit, podcasts, editorial coverage), because those compound through corpus presence over multiple model retrains.

If your team is choosing where to start, the playbook depends on size. Solo and growth-band sites should start with structural work (schema, direct answers, entity disambiguation) because the relationship-driven work has a long latency and assumes you have already qualified for the candidate pool. Mid-market and enterprise sites should start with brand-mention monitoring at scale, because the structural work is already at diminishing returns and the marginal lift is in maintaining mention density.

Variance, noise, and how confident we are

Three honest caveats about how seriously to take these numbers.

Source of noise	Magnitude	Mitigation
Engine non-determinism between runs	33-51% citation overlap on same prompt	We used 3 runs per prompt per engine
Engine architecture drift (model swaps)	Hard to quantify; weeks to months	Tracking window kept to 16 weeks
Cohort selection bias (Attrifast customers)	Skews B2B SaaS / fintech / consumer software	Cross-checked against Peec / Profound public
Headcount classification error	Estimated 5-10% misclassification	Cross-source verified via LinkedIn + public data
Per-band cell size for percentile estimation	Enterprise band n=20 is smaller than ideal	We flag p90 estimates as wider in the enterprise band
Per-engine sample concentration	Perplexity contributes more citation events per prompt	Reported per-engine separately to avoid blending bias
Within-vertical variance	Substantial; we did not cut by vertical here	See vertical-cut study

The most honest single statement is that the band medians are stable to within roughly ±1.0 percentage points and the p10/p90 estimates within ±2.0 percentage points. The cross-band ranking is much more stable than the precise values: solo < growth < mid-market < enterprise is essentially certain. The exact gap between bands is the noisier estimate.

How citation rate connects to revenue (briefly)

This study is a presence study. Revenue is a separate concern and we have a separate study for it. But because the operator question almost always ends up at revenue, here is the brief connecting thread.

A citation gets you nothing on its own. It gets you the chance at a click. A click gets you nothing on its own. It gets you a session. A session gets you the chance at a conversion. A conversion gets you revenue. Each step has its own conversion rate, and each step's rate varies enormously by brand and by engine. We measured the full funnel for a different cohort in the revenue benchmark study, which reports per-engine session-to-paid conversion rates that range from roughly 1.4% to 4.1% across our customer base.

The implication is that a higher citation rate is necessary but not sufficient for higher revenue. A solo founder at 3% citation rate with strong on-site conversion might out-earn a growth-stage company at 6% citation rate with weak on-site conversion. The cross-band ranking holds in expectation, but the rank ordering at the individual-brand level is much more about the rest of the funnel than about citation rate. The honest scoreboard is still revenue, which is what Attrifast's first-party Stripe join exists to measure.

Implications for GEO strategy

A few takeaways for operators making investment decisions based on this study.

Implication	Why it follows from the data	Action
Benchmark within your size band, not against the cohort median	The band variance is 8x; cohort-wide medians average it away	Use the band-specific table above as your reference
Solo founders should be patient about citation rate	The structural ceiling is ~3% blended for the first 12-18 months	Invest in corpus presence; do not chase the cohort median
Mid-market is the band with the highest marginal ROI on structural work	Median 6.2% with achievable top quartile at 10%+; clear path to lift	FAQ schema, direct answers, brand mentions
Enterprises should defend share, not chase higher rate	Already near the ceiling; marginal next dollar is editorial PR	Shift to monitoring and share-of-voice maintenance
DR matters less than your size band implies	DR explains 21% of variance	Stop buying high-DR links hoping for citation lift
Engine concentration is structural and persistent	Per-engine ranking holds across all 4 size bands	Plan engine-by-engine, not blended
Public benchmark headlines are correct on average and wrong for individuals	Right-tailed distribution; band cuts diverge	Use this study's cross-section to calibrate

The most underrated implication is the one about DR. Many teams I talk to still treat Domain Rating as the primary citation-rate lever, partly because it is easy to look up and partly because the SEO industry has spent two decades training operators to think this way. The cohort data is clear: DR is one signal among several, it explains only about a fifth of the variance, and the marginal next DR point above DR 60 buys little citation rate. That money is better spent on the structural and brand-mention work the AI citations vs backlinks piece walks through.

What we would change about this study next time

Three things I would do differently if I were starting this from scratch.

Change	Why	Estimated impact
Larger enterprise cell (n=50+)	The n=20 enterprise band is the noisiest in the cohort	Tighter percentile estimates at the top end
Quarterly re-runs to track time-series	Single 16-week window is a snapshot; the trend matters	YoY citation drift estimates
Add a per-vertical cross-cut within each size band	Vertical mix differs across size bands and could explain residual variance	Possibly explains 5-10% of variance the headcount cut leaves unexplained
Capture per-prompt position of the citation	Position-1 citations likely drive more click-through than position-5	Closer link to revenue
Track brand-mention frequency independently	Currently we infer it; direct measurement would be cleaner	Tighter explanatory model

The biggest single methodology improvement would be larger enterprise sample size. The percentile estimates at the top of the cohort are wider than I would like, and a follow-up study with a deliberate enterprise-recruitment push would tighten those numbers. The next planned re-run is scheduled for Q3 2026, where we expect to add roughly 40 sites and re-run the full cohort.

How this study was reviewed

Before publishing we sent the draft methodology and headline numbers to four people for review: a senior data scientist at a former employer, two operator-customers of Attrifast in mid-market and enterprise bands, and one independent SEO researcher familiar with the public benchmark literature. Their pushback shaped three changes: clearer language around the right-tailed distribution caveat in Finding 6, an explicit cross-source reconciliation section against Peec and Profound, and the addition of a "what we would change" section.

None of those reviewers signed off on the conclusions, and none of them are responsible for errors. If you spot a methodology issue or a number that does not reconcile, the contact is on the about page and I will publish corrections inline rather than as a footnote.

FAQ

What is a good AI citation rate in 2026?

There is no single good citation rate, which is the reason this study exists. Across the 200 Attrifast-instrumented sites, the cohort-wide median citation rate is roughly 4.1% across the four major engines, with a p25 of 1.7% and a p75 of 8.9%. But that headline number conflates four very different operator situations. If you are a sub-10-person startup, the honest benchmark is 0.8 to 2.1%; anything above 3% is exceptional. If you are at 10-50 people, the median is around 3.4% and a strong rate is 6-8%. If you are 50-500, the median is 6.2% and 10%+ is achievable. If you are 500+, the median is 11.4% and the leaders sit above 18%.

How do you define AI citation rate?

Citation rate is the share of buyer-intent prompts in a tracked prompt set where your domain appears as a citation in the AI engine's answer, averaged across runs and engines unless we explicitly break by engine. So if you track 100 prompts on ChatGPT and your domain appears in 12 of them, your ChatGPT citation rate is 12%. The blended cohort number averages across ChatGPT Search, Claude with web search, Gemini, and Perplexity. We deduplicate to one domain-appearance per prompt-run regardless of how many pages on your domain were cited.

Why does brand size matter so much for AI citation rate?

Three structural reasons. First, AI engines weight brand-entity recognition in their training corpus, and bigger brands have more co-occurrence in pretraining data, podcasts, news, Wikipedia, and Reddit. Second, bigger brands tend to have higher Domain Rating, more pages indexed, and more pages that already rank on classic search, which gets them into the retrieval candidate pool more often. Third, bigger brands have more editorial coverage on G2, Capterra, Wirecutter, and NerdWallet, which capture roughly 22% of all citation slots. None of these levers move much for a 6-person startup in its first 18 months.

How big is the Attrifast cohort and how was it built?

200 sites total, all using Attrifast's first-party AI traffic and citation tracking between January and April 2026. The distribution is roughly 50 solo and small-team sites (0-10 employees), 80 growth-stage sites (10-50 employees), 50 mid-market sites (50-500 employees), and 20 enterprise sites (500+ employees). The sample skews B2B SaaS, fintech, and consumer software because that is where Attrifast's customer base sits, but it intentionally spans the same 12 verticals we used in our earlier vertical-cut study so the numbers reconcile.

How does this compare to the Peec.ai 1-million-citation benchmark?

Peec.ai's report is excellent and the dataset is much larger than ours, but it cuts the data primarily by engine and aggregates across customer brands without breaking by brand size. Their headline median citation rates fall in roughly the same range as our blended cohort median, which is reassuring as a sanity check. Where we diverge is on the implication: their averages, applied directly by a small operator, set expectations that are achievable only for mid-market and enterprise sites. Our brand-size cut is designed to be read alongside theirs, not against it.

What citation rate should a sub-10-person startup expect?

Based on the 50 sub-10-person sites in the cohort, the median blended citation rate is around 1.4% with a p25 of 0.8% and a p75 of 2.1%. Anything above 3% at this size is genuinely exceptional and almost always reflects either a strong technical-content moat, deep Reddit and Hacker News presence, or a founder whose personal brand drives entity recognition. The most common mistake at this size is benchmarking against the cohort-wide 4.1% median and concluding the strategy is broken. It is not broken, the comparison is wrong.

What citation rate should a 10-50 person growth-stage company expect?

Across the 80 sites in that band, the median is roughly 3.4%, with a p25 of 1.9% and a p75 of 5.6%. The top decile sits around 7.8%. This is the band where brand-mention investment starts to compound visibly, Reddit and podcast presence pays back fastest, and structural GEO levers start producing measurable citation lift over a quarter.

What citation rate should a 50-500 person company expect?

Mid-market sites in our cohort show a median citation rate of 6.2%, a p25 of 3.7%, and a p75 of 10.4%. The top decile reaches roughly 14%. At this size most operators have a critical mass of editorial coverage, real Reddit threads with organic mentions, decent DR (typically 50-70), and a content library old enough that training corpora include meaningful coverage.

What citation rate should a 500+ person enterprise expect?

Across the 20 enterprise sites, the median is roughly 11.4%, p25 9.1%, p75 16.8%. The leaders sit above 18%, and a handful of category-leading brands clear 25% on their own branded query set. At this size the lever is no longer earning the next citation, it is defending the share you have and managing engine-level concentration.

Does Domain Rating predict citation rate?

Loosely. Across the full 200-site cohort, the correlation between Ahrefs DR and blended citation rate is roughly 0.46, which is positive but a long way from deterministic. DR explains around 21% of the variance in citation rate, leaving 79% to other factors. The relationship is also non-linear: each additional DR point buys more citation rate in the 40-60 band than it does above DR 80.

How does content age affect citation rate?

Materially. Sites with most of their content under 6 months old show a median citation rate of 1.8%; 6 months to 2 years sits at 3.6%; 2 to 5 years at 5.4%; 5+ years at 7.2%. The lift is steepest from the under-6-month band to the 2-5 year band, then flattens. The mechanism is mostly training corpus presence.

Is citation rate the same across engines for a given brand?

Almost never. Within the cohort, the average within-brand variance across the four engines is roughly 2.7x between the highest and lowest engine. Perplexity is the easiest engine to earn citations on at every size band; Gemini is the hardest. Anyone publishing a single AI citation rate without specifying the engine is averaging four very different distributions.

How does this study connect to revenue?

It does not, directly. Citation rate measures presence, not click-through, and not revenue. The 2026 AI Traffic Revenue Benchmark is our companion study that joins AI engine sessions to Stripe payments for the revenue side. The two studies are designed to be read together.

Can a small brand outperform the brand-size benchmark for its band?

Yes. Roughly 8% of sub-10-person sites clear 3% blended citation rate, and a smaller subset clear 5%. Three traits dominate the outperformer set: a founder with strong personal brand and podcast presence, a tightly written technical content library that is genuinely best-in-class on a narrow topic, and concentrated authentic Reddit and Hacker News mentions in the right communities.

See your AI citation rate by engine, by page, by query

Attrifast tracks ChatGPT, Claude, Gemini, and Perplexity citations alongside the Stripe revenue they drive. Benchmark your number against the right size band, not the cohort average.

Start free trial →

7-day free trial · $15/mo · cancel anytime

Find revenue hiding in your traffic

Discover which marketing channels bring customers so you can grow your business, fast.