Performance Analysis

Track content performance across generative AI engines over time using industry-standard KPIs to locate volatility and guide the next iteration.

Machine translation

This page was machine-translated from the Chinese original. Please report inaccuracies to mail@geo.fan.

TL;DR: The core of GEO performance monitoring is answering "How many times has my content been cited in ChatGPT / Claude / Gemini / Perplexity / Google AI Overviews, and what share of mainstream answers do I hold?" The industry has converged on five standard KPIs in 2026 (Mention Rate / Citation Rate / Share of Voice / Share of Answer / Position). Using unified metrics is what makes cross-platform and cross-period comparison meaningful.

The five standard KPIs

The industry (Search Engine Land, GenOptima, Averi, LLM Pulse, etc.) has largely standardized on these names:

1. Mention Rate

The proportion of AI answers in your test prompt set that "mention" your brand / content (including plain-text mentions without a link).

Mention Rate = answers mentioning you ÷ total queries × 100%

Example: out of 30 test prompts, "GEO.Fan" appears in 6 AI answers → Mention Rate = 20%.

2. Citation Rate

The proportion of AI answers in your test prompt set that include a clickable link to your domain. Higher value than Mention Rate because it drives traffic directly.

Citation Rate = answers containing your domain link ÷ total queries × 100%

In a given query category, your brand mentions as a share of all brand mentions. The core metric for competitor comparison.

SoV = your brand mentions ÷ all brand mentions in category × 100%

When cited, the proportion of the AI-generated answer text (word / token count) attributed to your content. Measures whether "AI just mentions you in passing or quotes you at length."

SoA = words from your content ÷ total answer word count × 100%

5. Position

When cited, your ranking in the AI answer's citation list. Similar to search ranking but with smaller samples and more volatility.

Designing the test prompt set

KPIs are only stable for cross-comparison when the prompt set is fixed. Recommendations:

Coverage: 20–30 prompts covering the questions your target readers are most likely to ask
Grouping: informational ("What is X"), comparative ("X vs Y"), operational ("How to do X"), recommendation ("X recommendations") — each as a quarter of the set
Stability: once defined, keep the prompt set unchanged for at least 3 months to avoid baseline drift
Update: review the set quarterly — retire stale questions, add new hot topics

Platform coverage

You should cover at least five traffic surfaces (sample each engine independently):

Platform	Type	Priority
ChatGPT (with Search)	Conversational AI + AI search	P0
Claude	Conversational AI	P0
Perplexity	AI search	P0
Google AI Overviews / AI Mode	Search AI summary	P0
Gemini	Conversational AI	P1
Copilot (Bing)	Conversational AI + search	P1
Qwen / Doubao / ERNIE Bot / Kimi	Chinese conversational AI	P0 for Chinese sites

Each engine has different citation preferences:

ChatGPT values authoritative sources and structured argumentation
Claude prefers depth, long context, clear citations, original insight
Gemini / Google AI Overviews is most tightly coupled with Google search results — traditional SEO signals still matter
Perplexity has the most transparent citation mechanism — strongly structured, high citation density content benefits most
Copilot mirrors Bing's index
Qwen / Chinese AI is more sensitive to Chinese sources and localized scenarios

Sampling methodology and cadence

Role	Frequency	Operation
Individual / small team	Monthly	Manually run prompt set, log in a spreadsheet
Mid-sized team	Weekly	Semi-automated scripts + monthly review
Enterprise	Daily / real-time	Commercial tools (Profound, LLMrefs, Superlines, Pulse) or self-hosted

Minimum viable workflow (monthly):

In each target engine, ask all prompts in the set
Record: brand mentioned? link included? citation position? word count of cited paragraph?
Calculate the five KPIs
Compare to last month, find the prompts with the largest swings
Compile a "top 3–5 pages to optimize this month" list

Industry benchmarks (for reference)

Looking at your own numbers isn't enough — you need to know what "normal" looks like. Some public benchmarks for comparison:

Benchmark	Value	Source
Zhihu citation rate in Chinese AI answers	29.9%	IT Home 2026
Reddit citation rate in English AI answers	40.1%	SparkToro 2025
Citation probability when content appears on 4+ platforms	×2.8	KDD 2024
Citation overlap between different GPT versions	only 7%	Writesonic 2025 LLM Citation Study
AI-cited visitor conversion vs. ordinary search	4.4–23×	BrightEdge 2025
Google AI Overviews one-year CTR change	-30% (search volume +49%)	BrightEdge 2025

Implications:

Multi-platform distribution is an amplifier — the same content appearing on 4+ authoritative platforms (Zhihu answer + WeChat public account + personal site + industry media) is ~3× more likely to be cited
Don't equate "cited in ChatGPT" with "cited in GPT-4 / GPT-5 alike" — citation preferences vary wildly across versions
AI traffic is small but precious — each AI-cited visitor converts like 4–23 ordinary search visitors

Full data citations in Resources.

Typical anomaly patterns

Pattern	Possible cause	Recommended action
Mention Rate drops across all platforms	AI engine model upgrade / training data switch	Wait 1–2 weeks; if no recovery, sample-recheck the site
Single page Citation Rate drops sharply	Content changed, external link broken, or reported	Roll back or rewrite the page
Single engine SoV drops	That engine's policy changed	Targeted optimization for that engine's characteristics
Position drops	A competitor published better content	Run Competitor Analysis and shore up the gap
Slow climb	Healthy state	Maintain current strategy
High Mention, low Citation	AI mentions you but doesn't link	Strengthen structured citation sources (schema.org, explicit canonical)

Reporting cadence

Weekly: glance at top-line Mention Rate / Citation Rate — only investigate anomalies
Monthly: full review of all five KPIs, set next month's 3–5 priority pages
Quarterly: strategy review, adjust the prompt set and content topic direction

Performance Analysis

On this page