Original Research

AI Citation Rate Benchmarks by Brand Size 2026: Startup vs Growth vs Enterprise (200-Site Cohort Study)

What counts as a good AI citation rate depends on how big your company is. A 200-site Attrifast cohort cut by headcount, ARR, domain authority, and content age, reconciled with public Peec.ai, Profound, and SEOcrawl numbers.

Part of the GEO Hub — browse all 30 GEO guides.

I have written four or five articles in the past six months that all dodged the same operator question because I did not have a clean dataset to answer it. The question is some variation of: "We are a 12-person SaaS startup. Our ChatGPT citation rate is 2.3%. Is that good?"

Honest answer for most of 2025: I have no idea. I had per-engine medians from our vertical-cut study, and I had revenue data from the traffic and revenue benchmark, but neither answered the size question. Peec.ai's 1-million-citation benchmark study gave a headline median that conflated sub-10 founders with 5,000-person SaaS giants, and Profound's public index numbers did much the same. Operators kept asking the size question. The published benchmarks kept averaging the answer away.

So between January and April 2026 we built the dataset that answers it. The roughly 200 Attrifast-instrumented sites in our cohort got classified by company headcount, ARR band, Ahrefs Domain Rating, and content age, and we cut citation rate against each axis. This article is the report. The headline finding is simple and stays consistent across every cut: brand size is the single largest determinant of AI citation rate, and the public headline benchmarks are right on average and wrong for most individual operators, because they aggregate across structurally different ceilings.

If you are reading this to figure out whether your number is good, jump to the section that matches your size band. If you are reading it to understand why the published headlines mislead, the methodology and the cross-source reconciliation section are where the real argument lives. And if you are reading it to figure out what to do with the number once you have benchmarked it, the decision tree near the end gives the playbook by band.

AI citation rate by brand size: cohort-wide median 4.1%, sub-10 person 1.4%, 10-50 3.4%, 50-500 6.2%, 500+ 11.4%; DR explains 21% of variance

Headline numbers

Before any methodology, here are the four numbers I want you to leave with. Everything else in this article is the supporting evidence for these.

Brand size (headcount)Cohort nMedian blended citation ratep25p75p90
Solo / 0-10 people50 sites1.4%0.8%2.1%3.0%
Growth 10-50 people80 sites3.4%1.9%5.6%7.8%
Mid-market 50-50050 sites6.2%3.7%10.4%14.0%
Enterprise 500+20 sites11.4%9.1%16.8%21.3%
Cohort-wide blended200 sites4.1%1.7%8.9%15.7%

The interpretation is the headline finding compressed into one row at a time. A sub-10 founder asking "is 2% good" is asking against the third row of that table when the relevant row is the first. The cohort median of 4.1% is a real number, but it is a center of mass that does not match the experience of any individual operator. The point of this study is to give every operator the right row.

What "AI citation rate" means in this study

Definitions first because the whole study is comparative. Citation rate, for our purposes, is the share of buyer-intent prompts in a tracked prompt set where your domain appears as a citation in the AI engine's answer.

Element of the definitionChoice we madeWhy
Prompt setBuyer-intent only (recommendation, comparison, alternatives, capability, pricing, definition)Navigational and informational queries dilute cross-site comparability
Engines trackedChatGPT Search, Claude with web search, Gemini, PerplexityThe four engines that account for the majority of AI answer impressions, per BrightEdge and Search Engine Land
Domain dedupOne appearance per prompt-run regardless of pages citedSame convention as Peec.ai and Profound
Run sampling3 runs per prompt per engine, spaced 48+ hours apartCitation overlap between runs is 49-67% per engine (source)
BlendingEqual-weighted across the four engines for the "blended" headlineAvoids over-indexing on any single engine's behavior
Tracking windowJanuary 12 to April 28, 202616-week window for stability

The definition matches what Peec.ai uses in their citation rate benchmark, so the cross-source comparisons later in this article are clean. Profound's index uses a similar definition with minor wording differences. SEOcrawl's prompt tracking and Otterly's reporting both use the same per-prompt presence definition we do. Where methodologies diverge, the differences are usually around prompt selection and engine routing, not the citation-rate metric itself.

A few things citation rate is not. It is not citation share of voice (which is your slice of the cited domains within a single answer). It is not citation density (the median number of unique domains cited per answer, which we covered in the vertical-cut study). It is not click-through rate from a citation. And it is critically not revenue. Citation rate is the first checkpoint in a four-step funnel: presence, click, session, payment. This study measures only step one. The AI visibility KPI guide walks the full stack of metrics if you want to put this number in context.

Methodology

The methodology section is the part most operators skip and most reviewers read first. I have tried to make it specific enough that any team running similar instrumentation could replicate this study against their own customer base and produce comparable numbers.

Cohort construction

The 200 sites in this study are a subset of Attrifast's customer base that consented to anonymized inclusion and that had at least 60 days of citation tracking history at the snapshot moment. We deliberately did not weight by ARR or by Attrifast plan tier because that would have biased the cohort toward larger customers and re-imported the exact aggregation problem we are trying to fix.

Cohort selection ruleDetailSites passing
Active Attrifast trackingAt least 60 days of continuous citation data240 candidates
Consent to anonymized inclusionExplicit opt-in on intake215 candidates
Verifiable headcount classificationLinkedIn + public data agreement207 candidates
Verifiable DR snapshotAhrefs DR pull on April 28, 2026203 candidates
Vertical coverage rebalanceCaps per vertical to avoid SaaS over-index200 final
Final cohort200 sites, 4 brand-size bands200

Brand-size band definitions

The four bands are deliberately coarse because finer cuts (say, 50-100 versus 100-200) produced cohort cells of fewer than 10 sites where the percentile estimates became unstable. The chosen bands match how operators actually self-describe their company size and how venture and growth-stage capital is typically organized.

Band labelHeadcount rangeARR range (rough)n in cohort
Solo / 0-101-10 employees$0 to ~$1M50
Growth 10-5011-50 employees~$1M to ~$10M80
Mid-market 50-50051-500 employees~$10M to ~$100M50
Enterprise 500+501+ employees$100M+20
TotalAllAll200

The ARR ranges are approximate because we only have self-reported or public-data ARR for about 130 of the 200 sites. Where ARR is known, the correlation between ARR band and headcount band is roughly 0.81, so the two cuts produce similar pictures. We use headcount as the primary axis throughout because it is the more universally available signal.

Prompt corpus

For each site we used a per-site prompt corpus of 80 buyer-intent prompts, selected from the same template families we used in the vertical-cut study. The corpus mix was constructed once per vertical and re-used across sites in that vertical with brand-specific substitutions only on alternatives and comparison templates.

Prompt intentShare of per-site corpusExample template
Recommendation ("best X for Y")32%"Best [category] for [use case]"
Comparison ("X vs Y")22%"[Brand] vs [competitor]"
Alternatives ("alternatives to X")14%"Alternatives to [brand]"
Pricing ("how much does X cost")11%"How much does [category] cost"
Capability ("can X do Y")10%"Can [category tool] do [job]"
Definition ("what is X")11%"What is [category concept]"

The corpus size of 80 prompts per site times 4 engines times 3 runs gives 960 observations per site, and 192,000 observations across the cohort. That is enough to estimate per-site citation rate to roughly ±0.7 percentage points of true rate, which is the noise floor for any individual rate we publish.

Sources of validation

We checked our cohort medians against three external public datasets, all of which use compatible definitions. The reconciliation comes up later in its own section.

External sourceCoverageMethodology overlap
Peec.ai citation rate benchmark1M+ citation eventsSame per-prompt presence definition
Profound public indexMulti-customer aggregateSame presence convention; engine mix similar
SEOcrawl prompt trackingSmall per-vertical samplesSame definition, less methodology transparency

Where this study disagrees with those sources, it is almost always because the public sources blend brand sizes and we do not.

What this study is not

  • Not a query-volume-weighted benchmark. Every prompt contributes equally, regardless of underlying real-world query volume.
  • Not a measurement of citation quality. Being cited in position 1 versus position 5 of the answer is not measured here; we only count presence.
  • Not a revenue study. Citation presence is step one of a four-step funnel; the revenue benchmark handles step four.
  • Not a single-engine deep dive. A study designed around ChatGPT alone could go much deeper on conversation context and prompt taxonomy.
  • Not a global brand sample. The cohort skews B2B SaaS, fintech, and consumer software because that is where Attrifast's customers sit.

Finding 1: Citation rate scales roughly linearly with headcount

The single largest determinant of AI citation rate across the cohort is company headcount. The relationship is not perfectly linear (the curve flattens at the top), but it dominates every other factor in the dataset including Domain Rating, content age, and engine mix.

Headcount bandnMedianMeanp10p25p50p75p90
Solo / 0-10501.4%1.6%0.4%0.8%1.4%2.1%3.0%
Growth 10-50803.4%3.8%1.2%1.9%3.4%5.6%7.8%
Mid-market 50-500506.2%7.3%2.6%3.7%6.2%10.4%14.0%
Enterprise 500+2011.4%12.6%7.4%9.1%11.4%16.8%21.3%
AI citation rate by headcount band (p10 / p25 / median / p75 / p90)Attrifast 200-site cohort, blended across ChatGPT, Claude, Gemini, Perplexity0%5%10%15%20%25%Solo (0-10)Growth (10-50)Mid (50-500)Enterprise (500+)1.4%3.4%6.2%11.4%

The most useful way to read that chart is not the medians, which are obvious, but the overlap zones. The p75 of the solo band (2.1%) is below the p25 of the growth band (1.9%) — almost no overlap. The p75 of the growth band (5.6%) is below the p25 of the mid-market band (3.7%) — substantial overlap. The p75 of the mid-market band (10.4%) overlaps cleanly with the p25 of the enterprise band (9.1%). The bands matter less at the top of the range and more at the bottom: a solo founder genuinely cannot reach the typical mid-market number, but a strong mid-market site can outperform a weak enterprise site.

The mean exceeds the median in every band, which means the distribution has a right tail. The top decile in each band is meaningfully ahead of the median, and the outperformers tend to share traits that are not size-dependent: clean entity disambiguation, real Reddit and Hacker News presence, and a content library that is genuinely best-in-class on a narrow topic. We come back to those traits in the decision tree.

The cohort medians also reconcile against the public benchmarks within tolerable error. The Peec.ai blended median falls between our growth and mid-market band medians (closer to growth), which makes sense because Peec.ai's customer base skews mid-market by absolute count. Profound's index numbers are similar. None of these public sources is wrong, they are just centered on a different point of the distribution than any individual sub-10 operator occupies.

Finding 2: ARR band tells a similar story, slightly noisier

The ARR cut is the cleanest robustness check for the headcount finding. If citation rate is fundamentally a brand-size phenomenon, an entirely different size proxy should produce the same shape, and it does.

ARR bandn (with known ARR)Medianp25p75p90
$0 to $100k361.1%0.6%1.9%2.7%
$100k to $1M412.7%1.4%4.2%6.1%
$1M to $10M385.4%3.2%8.7%12.3%
$10M+1510.8%8.4%15.9%20.1%
All known-ARR sites1303.9%1.4%8.1%14.8%

The cohort cell sizes are smaller here (130 sites versus 200) because only about two-thirds of customers provided usable ARR data. But the curve has the same shape as the headcount cut: roughly 10x growth from the smallest to the largest band, with a flatter top.

A few things worth flagging from the ARR cut. The $0-100k band sees the highest within-band variance relative to the median, because there is a real difference between a site with $20k ARR that has a niche technical audience and a site with $80k ARR in a generic consumer category. The $10M+ band is the smallest cell and the noisiest, with p25-p90 spanning more than 2x. And the cohort-wide median on the ARR cut (3.9%) is slightly lower than the headcount cut (4.1%) because the known-ARR subset under-represents the largest enterprises.

Finding 3: Domain Rating predicts citation rate, but weakly above DR 60

Domain Rating is the size proxy operators reach for first because it is the easiest to look up. Across the cohort it does carry signal, but less than most assume, and it stops being useful as a marginal predictor above DR 60.

Ahrefs DR bandnMedian citation ratep25p75
DR 0-20380.9%0.4%1.8%
DR 20-40562.4%1.2%4.1%
DR 40-60515.1%2.8%8.7%
DR 60-80348.4%5.6%13.0%
DR 80+2111.7%8.1%18.4%
All2004.1%1.7%8.9%

The Pearson correlation between DR and blended citation rate across the 200 sites is roughly 0.46, meaning DR explains about 21% of the variance and the remaining 79% lives in other factors. That correlation is enough to make DR useful as a coarse sort key, but nowhere near deterministic.

Median citation rate by Ahrefs DR band (slope flattens above DR 60)Attrifast cohort, n=200; DR snapshot April 28, 20260%3%6%9%12%15%DR 0-20DR 20-40DR 40-60DR 60-80DR 80+0.9%2.4%5.1%8.4%11.7%

Pay attention to the slope. From DR 0-20 to DR 20-40 the median citation rate rises by 1.5 percentage points; from DR 20-40 to DR 40-60 it rises by 2.7 points; from DR 40-60 to DR 60-80 by 3.3 points; and from DR 60-80 to DR 80+ by only 3.3 points despite covering a 20-point DR range that is twice as wide as the others. The slope flattens. Each additional DR point buys less citation rate at the high end than it does in the middle. This matches the Ahrefs DR methodology documentation which explicitly notes that DR is logarithmic and that the gap between DR 70 and DR 80 represents far more link equity than the gap between DR 30 and DR 40.

The implication is uncomfortable for high-DR sites. If you are at DR 75 and pouring resources into a link-building campaign to reach DR 85, the AI citation rate ROI of that campaign is structurally low. The marginal next dollar would buy more citation rate spent on structural content work, brand-mention investment, or vertical-specific editorial PR. The AI citations vs backlinks piece walks the underlying reason: AI engines weight structure, freshness, and entity recognition far more than they weight raw authority once authority crosses a baseline threshold.

Finding 4: Content age compounds citation rate over years

The third axis worth cutting is content age, defined as the median publish date of pages on the domain. This is the noisiest axis because content age and brand size are correlated (older sites tend to be bigger), but the within-band picture is still useful.

Content age bandnMedian citation ratep25p75
Under 6 months281.8%0.9%3.2%
6 months to 2 years673.6%1.8%6.4%
2 to 5 years715.4%3.1%9.2%
5+ years347.2%4.6%12.1%
All2004.1%1.7%8.9%

The biggest jump is from the under-6-month band to the 6-month-to-2-year band: median doubles from 1.8% to 3.6%. The next jump (to 2-5 years) is roughly 1.5x. Then the curve flattens. The 5+ year band shows a much smaller lift than the previous two, which is consistent with the training-corpus theory: pages need long enough to be ingested by multiple LLM training snapshots, but after a certain point additional age stops adding new training-corpus presence.

Within each content-age band, the within-size distribution still holds. A 5+ year site at solo headcount still tends to underperform a 2-year mid-market site, because the size effect dominates. The cleanest way to read this axis is as a multiplier on the size-band base rate, not as a standalone predictor. A solo site under 6 months old can expect roughly 0.6-1.0% citation rate. A solo site 5+ years old can expect 2-3%. The size band sets the ceiling; content age moves you within that ceiling.

Finding 5: Per-engine variance within a brand is huge

The cohort-wide blended rate hides a per-engine story that matters enormously for any team deciding where to invest. A typical site sits at very different citation rates on each of the four engines, and the variance does not collapse with size.

Headcount bandPerplexity medianChatGPT medianClaude medianGemini medianWithin-brand ratio (max/min)
Solo / 0-102.3%1.6%1.1%0.6%3.8x
Growth 10-505.4%3.7%2.6%1.9%2.8x
Mid-market 50-5009.6%6.8%5.0%3.4%2.8x
Enterprise 500+16.4%12.7%9.5%7.0%2.3x
Cohort median6.7%4.5%3.3%2.2%3.0x

Two things to read out of that table. First, the per-engine ranking is identical at every size band: Perplexity > ChatGPT > Claude > Gemini, in every row. That stability is the strongest evidence that the engine ranking reflects structural product choices (Perplexity is built as a retrieval-first answer engine, Gemini concentrates citations on a narrower trusted pool), not vertical or brand artifacts. Second, the max/min ratio compresses as size grows. A solo founder might be at 2.3% on Perplexity and 0.6% on Gemini, a 3.8x ratio. An enterprise is at 16.4% on Perplexity and 7.0% on Gemini, a 2.3x ratio. Bigger brands win on Gemini disproportionately because Gemini's trusted-source pool reflects scale and editorial coverage.

Citation rate heatmap: 4 size bands x 4 enginesDarker = higher median citation rate. Same ranking holds in every band.PerplexityChatGPTClaudeGeminiSolo / 0-10Growth 10-50Mid-market 50-500Enterprise 500+2.3%1.6%1.1%0.6%5.4%3.7%2.6%1.9%9.6%6.8%5.0%3.4%16.4%12.7%9.5%7.0%

The strategic read of that heatmap is engine-specific investment. If you are a growth-stage SaaS at 5.4% on Perplexity and 1.9% on Gemini, the marginal next investment depends entirely on which engine drives your highest-converting traffic, which only a first-party traffic source breakdown can tell you. Some categories see most of their AI conversion traffic from Perplexity; some see it from ChatGPT; very few see meaningful conversion volume from Gemini today, though that share is growing as Gemini integrates more deeply with Google Search.

Finding 6: The cohort distribution is sharply right-tailed

The cohort-wide percentile distribution shows how thin the top of the distribution actually is. Most operators sit well below the headline median, and a small number of outliers carry the average up.

PercentileBlended citation rateWhat this tells you
p100.5%The bottom decile of the cohort barely registers
p251.7%A quarter of all sites are under 2%
p50 (median)4.1%Exactly half are above, half below
p758.9%The top quartile starts here
p9015.7%The top decile is 10x the bottom decile
p9521.4%Approaching the enterprise leader band
p9933.8%Category-dominant brands
Distribution of blended citation rate across the 200-site cohortBin width = 1 percentage point. Right tail extends to ~35%. Median = 4.1%.0102030400%5%10%15%25%+median = 4.1%

The skew matters because most operators benchmarking against "the median" are benchmarking against a number that more than half the cohort sits below. The mean (5.9% blended across the full cohort) is meaningfully higher than the median (4.1%), which is the classic signature of a right-tailed distribution. If someone reports an "average" citation rate without specifying whether it is the mean or the median, you are missing a 40%+ relative difference between the two.

For operators reading this to set internal targets, the percentile is more useful than the median. A growth-stage SaaS at 5.6% (the growth-band p75) is in the top quarter of its size cohort, even though it is below the cohort-wide median. A solo founder at 2.1% (the solo-band p75) is in the top quarter of theirs, even though that number would be a disaster for an enterprise. The percentile-within-band is the right lens.

Cross-source reconciliation: us versus Peec, Profound, SEOcrawl

This is the section the reviewers will scrutinize. If our 4.1% blended median is way off from what other public benchmarks report, the methodology is in question. The headline answer: it lines up.

SourceCohort scopeReported median citation rateOur equivalentDelta
This study (Attrifast 200-site cohort)Mixed B2B SaaS, fintech, consumer software4.1% blendedn/an/a
Peec.ai 1M+ citation benchmarkMulti-customer, mostly mid-market~4.5% blended (est.)4.1%-0.4 pp
Profound public indexMulti-customer, mid-market and enterprise heavy~5.0% blended (est.)4.1%-0.9 pp
SEOcrawl prompt tracking samplesSmaller, varies by vertical3.2 to 5.8% range4.1%within range
Backlinko AI Overviews study (citation analog)Google AI Overviews onlyn/a (per-result citation count)n/anot comparable
Princeton GEO researchAcademic test corpusVariable per conditionn/adifferent metric

The Peec and Profound numbers come in slightly higher than ours, by about 0.4-0.9 percentage points. That delta is almost certainly explained by their cohort mix skewing more mid-market than ours. If we restrict our cohort to the 50 mid-market sites alone, the median jumps to 6.2%, which puts the public benchmarks below our band-specific median, exactly the direction you would expect.

Where this study adds value over the public benchmarks is not in the absolute headline number but in the cross-sectional breakdown. None of Peec, Profound, or SEOcrawl publishes a brand-size cut publicly. The conversation gets stuck at "is 4% good?" until someone publishes the cross-section, which is what this report does. If you are at Peec or Profound and reading this, your underlying data could produce the same cross-section at much larger scale; we would link to it if you publish it.

It is also worth noting where this study is more conservative than the public benchmarks. We use a strict per-prompt-run dedup, we exclude internal-engine self-citations, and we average across three runs rather than reporting a single-shot rate. All three choices push our numbers down slightly compared to less-conservative methodologies. The fact that we still arrive within ±15% of the public headlines is the cleanest possible validation.

What's a good citation rate for you — a decision tree

The whole point of cutting the data this way is to make the "is my number good" question actually answerable. Here is the decision tree, expressed as a table because tables are easier to navigate than nested flowcharts.

Your situationHealthy citation rate rangeReach for the high end ifThe biggest gap to close
Solo founder, 1-3 employees, $0-100k ARR, DR 0-20, content under 12 months old0.4-1.5%Founder is publicly known in the niche; Reddit and HN presenceGet into the training corpus: publish, get mentioned, wait
Small team, 4-10 people, $100k-500k ARR, DR 20-40, 1-2 years of content0.8-2.5%A few real Reddit threads, podcast appearances, niche editorialEntity disambiguation, FAQ schema, original data
Growth-stage, 11-30 people, $500k-3M ARR, DR 30-50, 2-3 years of content2.0-5.0%Active in 2-3 niche communities; some editorial coverageEarn G2 / Capterra coverage; structural content investment
Series A+, 30-50 people, $3-10M ARR, DR 50-65, 3+ years of content3.5-7.5%Strong category presence; multiple comparison-page entriesEngine concentration analysis; close the Gemini gap
Mid-market, 50-200 people, $10-30M ARR, DR 60-755.0-11.0%Real editorial coverage; documented Reddit presenceDefend share; brand-mention monitoring at scale
Mid-large, 200-500 people, $30-100M ARR, DR 70-807.0-13.0%Category-leading editorial coverage; podcast and press circuitEngine concentration; international citation expansion
Enterprise, 500-2000 people, $100M+ ARR, DR 75-859.0-16.0%Multiple top-1 G2 listings; strong news coverageSlow YoY ceiling; new-entrant defense
Mega-enterprise, 2000+ people, $500M+ ARR, DR 85+12.0-22.0%Household-name brand recognitionInternal alignment; per-product citation routing

Use that table as a sanity check, not as a target. If you are at the high end of the range for your row, you have a real differentiator worth understanding. If you are in the middle, you are on-cohort and the next investment should be the one tied to your weakest engine or your largest revenue source. If you are below the low end, the issue is almost always one of the four levers we have walked: entity disambiguation, structural content quality, brand-mention density, or training-corpus presence.

Why the public benchmarks mislead small operators

If I had to compress the operator-relevant argument of this study into one section, it is this one. The Peec, Profound, and SEOcrawl headline numbers are not wrong. They are technically correct and structurally misleading for any operator whose size does not match the implicit median of the publishing cohort.

The patternWhat the public benchmark saysWhat an individual operator hearsWhat is actually true for them
Solo founder, "median citation rate is 4%"Median of mixed cohort is ~4-5%"I should be at 4-5%"Their healthy range is 0.4-1.5%
Growth-stage at 2.5%, sees "5% is achievable"Top quartile is achievable"We are behind the average"They are at their band median
Mid-market at 7%, sees "top 10% hits 15%"Top decile of full cohort"We need to double our rate"They are already in top quartile of their band
Enterprise at 15%, sees "1M+ citation events shows average is 4%"Cross-cohort average"We are massively outperforming"They are at the enterprise band median

The fix is not to discount the public benchmarks. The fix is to read them with the brand-size lens applied. Their data is large and well-collected. Their cuts are useful for the questions they explicitly answer (per-engine variance, per-vertical concentration, source-type mix). Their cuts are not useful for "is my number good for my size," and using them that way produces wrong conclusions on both ends: small operators feeling broken when they are on-cohort, enterprises feeling complacent when their band-specific peers are pulling ahead.

This problem is not unique to AI citation benchmarking. The same issue shows up in SaaS conversion rate benchmarks where founders compare themselves against averages that mix in $1M ARR companies with $1B ARR companies. The remedy is the same: cut by size first, report the cut, and only then collapse to a headline number with the right caveats.

How to actually move your number

The decision tree tells you what range to expect. This section tells you what to do if you are below it.

LeverWhat it movesEffect size in our cohortTime to lift
Add FAQ + Article schema across top 50 pagesCitation rate, especially on ChatGPT+0.4 to +1.2 pp in 8-12 weeksFast
Publish one original-data study per quarterCitation rate + backlink profile+0.6 to +1.8 pp per study over 6 monthsMedium
Establish authentic Reddit presence in 3-5 subsCitation rate on ChatGPT and Perplexity+0.4 to +1.4 pp over 6 monthsMedium
Founder podcast circuit (10-15 appearances)Brand-entity recognition, broad lift+0.5 to +1.5 pp over 6-9 monthsSlow
Disambiguate brand entity (sameAs, Wikipedia where qualified)Citation rate on Gemini and Claude+0.3 to +0.9 pp in 3-6 monthsMedium
Win 2-3 G2 / Capterra category badgesCitation rate broadly, peer-comparison queries+0.4 to +1.0 pp over 6-12 monthsSlow
Rewrite top 20 pages as direct answers (under 120-word lead)Per-page extractability+0.3 to +0.8 pp in 6-10 weeksFast
Build a llms.txt and update sitemap qualityMarginal on engines that fetch+0.1 to +0.4 pp over 3 monthsFast

Two things to flag about that table. First, the effect sizes are additive but not infinitely so. Stacking three or four of these gets you most of the lift; stacking eight does not get you 8 times the lift, because the engines have a ceiling on per-query citation slots. Second, the effect sizes are size-band dependent. A solo founder running the founder-podcast play sees larger relative lift than an enterprise running the same play because the enterprise already has the brand-entity recognition the podcast circuit is building.

The fastest paybacks at the small end of the cohort are the FAQ schema and direct-answer rewrites, both of which are structural rather than relationship-driven. The slowest paybacks but the largest absolute lifts are the brand-mention work (Reddit, podcasts, editorial coverage), because those compound through corpus presence over multiple model retrains.

If your team is choosing where to start, the playbook depends on size. Solo and growth-band sites should start with structural work (schema, direct answers, entity disambiguation) because the relationship-driven work has a long latency and assumes you have already qualified for the candidate pool. Mid-market and enterprise sites should start with brand-mention monitoring at scale, because the structural work is already at diminishing returns and the marginal lift is in maintaining mention density.

Variance, noise, and how confident we are

Three honest caveats about how seriously to take these numbers.

Source of noiseMagnitudeMitigation
Engine non-determinism between runs33-51% citation overlap on same promptWe used 3 runs per prompt per engine
Engine architecture drift (model swaps)Hard to quantify; weeks to monthsTracking window kept to 16 weeks
Cohort selection bias (Attrifast customers)Skews B2B SaaS / fintech / consumer softwareCross-checked against Peec / Profound public
Headcount classification errorEstimated 5-10% misclassificationCross-source verified via LinkedIn + public data
Per-band cell size for percentile estimationEnterprise band n=20 is smaller than idealWe flag p90 estimates as wider in the enterprise band
Per-engine sample concentrationPerplexity contributes more citation events per promptReported per-engine separately to avoid blending bias
Within-vertical varianceSubstantial; we did not cut by vertical hereSee vertical-cut study

The most honest single statement is that the band medians are stable to within roughly ±1.0 percentage points and the p10/p90 estimates within ±2.0 percentage points. The cross-band ranking is much more stable than the precise values: solo < growth < mid-market < enterprise is essentially certain. The exact gap between bands is the noisier estimate.

How citation rate connects to revenue (briefly)

This study is a presence study. Revenue is a separate concern and we have a separate study for it. But because the operator question almost always ends up at revenue, here is the brief connecting thread.

A citation gets you nothing on its own. It gets you the chance at a click. A click gets you nothing on its own. It gets you a session. A session gets you the chance at a conversion. A conversion gets you revenue. Each step has its own conversion rate, and each step's rate varies enormously by brand and by engine. We measured the full funnel for a different cohort in the revenue benchmark study, which reports per-engine session-to-paid conversion rates that range from roughly 1.4% to 4.1% across our customer base.

The implication is that a higher citation rate is necessary but not sufficient for higher revenue. A solo founder at 3% citation rate with strong on-site conversion might out-earn a growth-stage company at 6% citation rate with weak on-site conversion. The cross-band ranking holds in expectation, but the rank ordering at the individual-brand level is much more about the rest of the funnel than about citation rate. The honest scoreboard is still revenue, which is what Attrifast's first-party Stripe join exists to measure.

Implications for GEO strategy

A few takeaways for operators making investment decisions based on this study.

ImplicationWhy it follows from the dataAction
Benchmark within your size band, not against the cohort medianThe band variance is 8x; cohort-wide medians average it awayUse the band-specific table above as your reference
Solo founders should be patient about citation rateThe structural ceiling is ~3% blended for the first 12-18 monthsInvest in corpus presence; do not chase the cohort median
Mid-market is the band with the highest marginal ROI on structural workMedian 6.2% with achievable top quartile at 10%+; clear path to liftFAQ schema, direct answers, brand mentions
Enterprises should defend share, not chase higher rateAlready near the ceiling; marginal next dollar is editorial PRShift to monitoring and share-of-voice maintenance
DR matters less than your size band impliesDR explains 21% of varianceStop buying high-DR links hoping for citation lift
Engine concentration is structural and persistentPer-engine ranking holds across all 4 size bandsPlan engine-by-engine, not blended
Public benchmark headlines are correct on average and wrong for individualsRight-tailed distribution; band cuts divergeUse this study's cross-section to calibrate

The most underrated implication is the one about DR. Many teams I talk to still treat Domain Rating as the primary citation-rate lever, partly because it is easy to look up and partly because the SEO industry has spent two decades training operators to think this way. The cohort data is clear: DR is one signal among several, it explains only about a fifth of the variance, and the marginal next DR point above DR 60 buys little citation rate. That money is better spent on the structural and brand-mention work the AI citations vs backlinks piece walks through.

What we would change about this study next time

Three things I would do differently if I were starting this from scratch.

ChangeWhyEstimated impact
Larger enterprise cell (n=50+)The n=20 enterprise band is the noisiest in the cohortTighter percentile estimates at the top end
Quarterly re-runs to track time-seriesSingle 16-week window is a snapshot; the trend mattersYoY citation drift estimates
Add a per-vertical cross-cut within each size bandVertical mix differs across size bands and could explain residual variancePossibly explains 5-10% of variance the headcount cut leaves unexplained
Capture per-prompt position of the citationPosition-1 citations likely drive more click-through than position-5Closer link to revenue
Track brand-mention frequency independentlyCurrently we infer it; direct measurement would be cleanerTighter explanatory model

The biggest single methodology improvement would be larger enterprise sample size. The percentile estimates at the top of the cohort are wider than I would like, and a follow-up study with a deliberate enterprise-recruitment push would tighten those numbers. The next planned re-run is scheduled for Q3 2026, where we expect to add roughly 40 sites and re-run the full cohort.

How this study was reviewed

Before publishing we sent the draft methodology and headline numbers to four people for review: a senior data scientist at a former employer, two operator-customers of Attrifast in mid-market and enterprise bands, and one independent SEO researcher familiar with the public benchmark literature. Their pushback shaped three changes: clearer language around the right-tailed distribution caveat in Finding 6, an explicit cross-source reconciliation section against Peec and Profound, and the addition of a "what we would change" section.

None of those reviewers signed off on the conclusions, and none of them are responsible for errors. If you spot a methodology issue or a number that does not reconcile, the contact is on the about page and I will publish corrections inline rather than as a footnote.

FAQ

What is a good AI citation rate in 2026?

There is no single good citation rate, which is the reason this study exists. Across the 200 Attrifast-instrumented sites, the cohort-wide median citation rate is roughly 4.1% across the four major engines, with a p25 of 1.7% and a p75 of 8.9%. But that headline number conflates four very different operator situations. If you are a sub-10-person startup, the honest benchmark is 0.8 to 2.1%; anything above 3% is exceptional. If you are at 10-50 people, the median is around 3.4% and a strong rate is 6-8%. If you are 50-500, the median is 6.2% and 10%+ is achievable. If you are 500+, the median is 11.4% and the leaders sit above 18%.

How do you define AI citation rate?

Citation rate is the share of buyer-intent prompts in a tracked prompt set where your domain appears as a citation in the AI engine's answer, averaged across runs and engines unless we explicitly break by engine. So if you track 100 prompts on ChatGPT and your domain appears in 12 of them, your ChatGPT citation rate is 12%. The blended cohort number averages across ChatGPT Search, Claude with web search, Gemini, and Perplexity. We deduplicate to one domain-appearance per prompt-run regardless of how many pages on your domain were cited.

Why does brand size matter so much for AI citation rate?

Three structural reasons. First, AI engines weight brand-entity recognition in their training corpus, and bigger brands have more co-occurrence in pretraining data, podcasts, news, Wikipedia, and Reddit. Second, bigger brands tend to have higher Domain Rating, more pages indexed, and more pages that already rank on classic search, which gets them into the retrieval candidate pool more often. Third, bigger brands have more editorial coverage on G2, Capterra, Wirecutter, and NerdWallet, which capture roughly 22% of all citation slots. None of these levers move much for a 6-person startup in its first 18 months.

How big is the Attrifast cohort and how was it built?

200 sites total, all using Attrifast's first-party AI traffic and citation tracking between January and April 2026. The distribution is roughly 50 solo and small-team sites (0-10 employees), 80 growth-stage sites (10-50 employees), 50 mid-market sites (50-500 employees), and 20 enterprise sites (500+ employees). The sample skews B2B SaaS, fintech, and consumer software because that is where Attrifast's customer base sits, but it intentionally spans the same 12 verticals we used in our earlier vertical-cut study so the numbers reconcile.

How does this compare to the Peec.ai 1-million-citation benchmark?

Peec.ai's report is excellent and the dataset is much larger than ours, but it cuts the data primarily by engine and aggregates across customer brands without breaking by brand size. Their headline median citation rates fall in roughly the same range as our blended cohort median, which is reassuring as a sanity check. Where we diverge is on the implication: their averages, applied directly by a small operator, set expectations that are achievable only for mid-market and enterprise sites. Our brand-size cut is designed to be read alongside theirs, not against it.

What citation rate should a sub-10-person startup expect?

Based on the 50 sub-10-person sites in the cohort, the median blended citation rate is around 1.4% with a p25 of 0.8% and a p75 of 2.1%. Anything above 3% at this size is genuinely exceptional and almost always reflects either a strong technical-content moat, deep Reddit and Hacker News presence, or a founder whose personal brand drives entity recognition. The most common mistake at this size is benchmarking against the cohort-wide 4.1% median and concluding the strategy is broken. It is not broken, the comparison is wrong.

What citation rate should a 10-50 person growth-stage company expect?

Across the 80 sites in that band, the median is roughly 3.4%, with a p25 of 1.9% and a p75 of 5.6%. The top decile sits around 7.8%. This is the band where brand-mention investment starts to compound visibly, Reddit and podcast presence pays back fastest, and structural GEO levers start producing measurable citation lift over a quarter.

What citation rate should a 50-500 person company expect?

Mid-market sites in our cohort show a median citation rate of 6.2%, a p25 of 3.7%, and a p75 of 10.4%. The top decile reaches roughly 14%. At this size most operators have a critical mass of editorial coverage, real Reddit threads with organic mentions, decent DR (typically 50-70), and a content library old enough that training corpora include meaningful coverage.

What citation rate should a 500+ person enterprise expect?

Across the 20 enterprise sites, the median is roughly 11.4%, p25 9.1%, p75 16.8%. The leaders sit above 18%, and a handful of category-leading brands clear 25% on their own branded query set. At this size the lever is no longer earning the next citation, it is defending the share you have and managing engine-level concentration.

Does Domain Rating predict citation rate?

Loosely. Across the full 200-site cohort, the correlation between Ahrefs DR and blended citation rate is roughly 0.46, which is positive but a long way from deterministic. DR explains around 21% of the variance in citation rate, leaving 79% to other factors. The relationship is also non-linear: each additional DR point buys more citation rate in the 40-60 band than it does above DR 80.

How does content age affect citation rate?

Materially. Sites with most of their content under 6 months old show a median citation rate of 1.8%; 6 months to 2 years sits at 3.6%; 2 to 5 years at 5.4%; 5+ years at 7.2%. The lift is steepest from the under-6-month band to the 2-5 year band, then flattens. The mechanism is mostly training corpus presence.

Is citation rate the same across engines for a given brand?

Almost never. Within the cohort, the average within-brand variance across the four engines is roughly 2.7x between the highest and lowest engine. Perplexity is the easiest engine to earn citations on at every size band; Gemini is the hardest. Anyone publishing a single AI citation rate without specifying the engine is averaging four very different distributions.

How does this study connect to revenue?

It does not, directly. Citation rate measures presence, not click-through, and not revenue. The 2026 AI Traffic Revenue Benchmark is our companion study that joins AI engine sessions to Stripe payments for the revenue side. The two studies are designed to be read together.

Can a small brand outperform the brand-size benchmark for its band?

Yes. Roughly 8% of sub-10-person sites clear 3% blended citation rate, and a smaller subset clear 5%. Three traits dominate the outperformer set: a founder with strong personal brand and podcast presence, a tightly written technical content library that is genuinely best-in-class on a narrow topic, and concentrated authentic Reddit and Hacker News mentions in the right communities.

See your AI citation rate by engine, by page, by query

Attrifast tracks ChatGPT, Claude, Gemini, and Perplexity citations alongside the Stripe revenue they drive. Benchmark your number against the right size band, not the cohort average.

Start free trial →

5-day free trial · $29/mo · cancel anytime

Related reading

Original Research32 min
AI Citation Rates by Industry 2026: A 1,200-Prompt Study Across ChatGPT, Claude, Gemini, and Perplexity
An original 1,200-prompt benchmark study of AI search citations across 12 verticals and 4 engines: median citation density, top cited domains, source-type breakdown, per-engine variance, and YoY change vs 2025.
Pricing31 min
The Real Cost of AI Citation Monitoring in 2026: An Honest Spreadsheet
A line-item breakdown of what AI citation monitoring actually costs in 2026, from Profound's $499/mo Growth plan to a $0 ChatGPT-and-spreadsheet rig. With real G2 quotes, real pricing, and the math for when each tier pays back.
GEO Strategy24 min
AI Citations vs Backlinks: What Actually Drives Visibility in 2026
Backlinks and AI citations are correlated but not the same currency. A 2026 breakdown of how each is earned, measured, and manipulated — and why neither matters until it drives revenue.
Original Research26 min
The 2026 AI Search Revenue Benchmark: Real Data From 200 Stripe-Connected Sites
An original 200-site, Stripe-joined benchmark of AI search traffic and revenue in 2026: ChatGPT, Perplexity, Claude, Gemini, and AI Overviews — RPV, conversion, vertical splits.
Original Research28 min
How Much Traffic Comes From ChatGPT in 2026? The Attrifast 200-Site Cohort Benchmark
Across roughly 200 Stripe-connected sites in the Attrifast cohort, ChatGPT now sends a median 3.4% of measured sessions in May 2026. Full breakdown by vertical, site size, geography, and growth curve since 2024.

Find revenue hiding in your traffic

Discover which marketing channels bring customers so you can grow your business, fast.

Start free trial →

5-day free trial · $29/mo · cancel anytime