Methodology & Transparency

How we measure AI visibility

No black-box score. Here is exactly how 42 factors across seven categories build the GEO Score - with weightings, data basis, and honest limitations.

The 42 GEO Factors

Seven categories. 42 weighted core factors, plus ~18 sub-checks per factor (60 detail checks in total). Every factor corresponds to a real backend check in the scoring engine.

E-E-A-T & Authority

Weight20%

Author Schema

Structured author data with expertise signals

Organization Schema

Complete company identity incl. sameAs links

NAP Consistency

Name, Address, Phone consistent across all sources

Knowledge Graph Entry

Business entry in Google Knowledge Panel

Backlink Profile

Quality and quantity of external links

Brand Mentions

Unlinked brand mentions on external sites

Trust Signals

Certificates, awards, memberships

Credentials

Industry certificates, licenses, accreditations

About Page

Complete company presentation with team and history

Contact Information

Complete, verifiable contact details

Review Signals

Aggregated reviews on external platforms

Content Quality

Weight20%

Text Length & Depth

Sufficient text depth for AI understanding

Q&A Format

Question-answer structure for direct AI answer generation

Source Citations

Verifiable sources for factual claims

Statistics & Data

Quantitative evidence increases citation probability

Readability

Flesch score, sentence length, comprehensibility

Heading Structure

Semantically correct H1-H4 hierarchy

Keyword Coverage

Topical completeness for user intent

Entity Density

Number of named entities (persons, places, products)

FAQ Coverage

Frequent user questions answered

How-To Content

Step-by-step guides for AI snippets

Comparison Content

Product and service comparisons

Use Cases

Concrete scenarios and practical examples

Data Richness

Technical specifications, prices, availability

Technical Foundation

Weight15%

robots.txt Configuration

Correct directives for all major AI crawlers

llms.txt Present

Machine-readable table of contents for LLM access

SSR Availability

Server-Side Rendering - no JavaScript rendering needed

Page Speed (Core Web Vitals)

LCP, FID, CLS meeting Google standards

Mobile Optimization

Fully responsive, no viewport breakages

HTTPS

Valid SSL certificate, no mixed-content warnings

Canonical Tags

Duplicate content prevention via correct canonicals

XML Sitemap

Current sitemap for crawl control

Schema Validation

Valid markup without errors in Google Rich Results Test

Core Web Vitals

INP, LCP, CLS in green range

Crawlability

No crawl blocks on relevant pages

Indexability

No noindex tags on important pages

Structured Data

Weight15%

JSON-LD Product Schema

Price, availability, SKU, MPN, Offers correctly filled

JSON-LD Service Schema

ServiceType, areaServed, provider, offers validated

JSON-LD FAQ Schema

Structured Q&A for AI answer generation

JSON-LD LocalBusiness Schema

80+ subtypes (Doctor, Plumber, Restaurant, ...) correctly mapped

JSON-LD Organization Schema

Complete company identity incl. sameAs links

JSON-LD Review Schema

Verified reviews with rating, text, reviewer

JSON-LD Breadcrumb Schema

Hierarchical navigation for AI context

AI Crawler Access

Weight10%

GPTBot Access

OpenAI training and retrieval crawler not blocked

ClaudeBot Access

Anthropic crawler allowed for Claude knowledge base

PerplexityBot Access

Perplexity retrieval crawler allowed for live answers

OAI-SearchBot Access

ChatGPT Search indexer reachable

Google Extended Access

Gemini training not blocked

wai.json Present

Experimental AI agent discovery format (early adopter)

AI Citation & Visibility

Weight12%

Citation Rate

Share of queries where the domain is cited

Mention Rate

Frequency of unlinked mentions in AI answers

Share of Voice

Market share in AI answers vs. competitors

Sentiment Score

Tone of AI mentions (positive/neutral/negative)

Cross-Engine Consistency

Consistency of representation across 9 AI engines

Hallucination Score

Detected factual errors in AI answers about your business

Answer Inclusion Rate

Share of direct answers that use your content

Freshness & Currency

Weight8%

Product Feed Freshness

Time elapsed since last feed update

Price Feed Consistency

Price alignment between feed and website

IndexNow Active

Automatic notification to Bing, Yandex, Naver, Seznam

Last-Modified Header

HTTP header for crawl efficiency and freshness signals

Score Weighting

Not all factors carry equal weight. Structured data and technical fundamentals contribute most to the GEO Score.

E-E-A-T & Authority

20%

Content Quality

20%

Technical Foundation

15%

Structured Data

15%

AI Crawler Access

10%

AI Citation & Visibility

12%

Freshness & Currency

Weightings are based on empirical observations across 9 AI engines (ChatGPT, Claude, Perplexity, Gemini, Copilot, DeepSeek, Grok, Z.AI, Kimi) over several months. Structured data (25%) and technical SEO (20%) dominate because AI systems primarily process machine-readable data - not natural-language text. The business data layer (5%) is deliberately low-weighted: missing feeds cost points, but perfect feeds barely lift the score.

What We Do NOT Measure

Honesty is part of the methodology. Here are the actual limits of our system.

No direct AI platform API guarantees

We have no privileged access to the internal ranking algorithms of OpenAI, Anthropic, or Google. Our measurement is based on observable output behavior of the models.

No score-to-citation guarantee

A high GEO Score statistically increases the probability of citation - it does not guarantee it. AI answers are non-deterministic.

No personal profile data

We measure domain-level visibility. No user profiles, no end-customer tracking - GDPR-oriented by design.

No real-time monitoring

AI engine queries run on a typical 24-hour polling cycle. Real-time visibility is not technically measurable - no platform offers an API for it.

KPI Levels at a Glance

From technical data quality to measurable business impact - four levels that together provide a complete picture.

9.1

Technical Data Quality

Foundation: Is the data machine-readable, complete, and fresh?

Schema completeness > 90%
Feed freshness < 24h
Crawler block rate = 0%

9.2

AI Visibility

Is the company mentioned, cited, or recommended in AI answers?

Mention rate (7 engines)
Citation rate with URL
Share of voice vs. competitors

9.3

Answer Quality

How accurate and positive are AI statements about the company?

Hallucination rate (target: < 5%)
Sentiment score (positive/neutral/negative)
Factual accuracy on prices, services

9.4

Business Impact

Does AI visibility translate to measurable traffic, leads, and revenue?

AI referral traffic (via UTM + GA4)
Lead attribution from AI channels
Conversion rate: AI traffic vs. SEO traffic

Scientific Foundations

The GEO Score is not based on gut feeling. These publications form the empirical basis.

2024ACM KDD 2024 · Princeton & Georgia Tech

GEO: Generative Engine Optimization

Aggarwal et al.

First systematic study on optimization for generative search engines. Defines citations, impressions, and share of voice as primary GEO metrics - basis for our factor categorization.

2026Stanford HAI · 2026

Stanford AI Index 2026

Stanford Human-Centered AI Institute

Hallucination rates by model and domain. Basis for our hallucination score and the calibration of our 9-engine test matrix.

2024Patronus AI & IBM Research · 2024

Lynx & Granite Guardian - Hallucination Detection Benchmarks

Patronus AI / IBM Research

Detection benchmarks for factual errors in LLM outputs. Methodology adopted for our 8-layer hallucination detection (Layer 9 = optional AI-semantic extension).

2026Google Search Central · 2026

Structured Data - Guidelines & Testing

Google LLC

Official specification for structured data. Defines which schema types Google (and thus Gemini) uses for answer generation.

Beconova is not an academic research institution. The cited sources underpin the theoretical foundations - the practical weightings derive from our own measurement empiricism over 6+ months of production data.

Questions about the methodology?

Transparency is not a marketing promise

If anything about our methodology is unclear, contact us directly. No buzzwords, no deflection.

Ask a question Try for free

Methodology & Transparency

How we measure AI visibility

No black-box score. Here is exactly how 42 factors across seven categories build the GEO Score - with weightings, data basis, and honest limitations.

The 42 GEO Factors

Seven categories. 42 weighted core factors, plus ~18 sub-checks per factor (60 detail checks in total). Every factor corresponds to a real backend check in the scoring engine.

E-E-A-T & Authority

Weight20%

Author Schema

Structured author data with expertise signals

Organization Schema

Complete company identity incl. sameAs links

NAP Consistency

Name, Address, Phone consistent across all sources

Knowledge Graph Entry

Business entry in Google Knowledge Panel

Backlink Profile

Quality and quantity of external links

Brand Mentions

Unlinked brand mentions on external sites

Trust Signals

Certificates, awards, memberships

Credentials

Industry certificates, licenses, accreditations

About Page

Complete company presentation with team and history

Contact Information

Complete, verifiable contact details

Review Signals

Aggregated reviews on external platforms

Content Quality

Weight20%

Text Length & Depth

Sufficient text depth for AI understanding

Q&A Format

Question-answer structure for direct AI answer generation

Source Citations

Verifiable sources for factual claims

Statistics & Data

Quantitative evidence increases citation probability

Readability

Flesch score, sentence length, comprehensibility

Heading Structure

Semantically correct H1-H4 hierarchy

Keyword Coverage

Topical completeness for user intent

Entity Density

Number of named entities (persons, places, products)

FAQ Coverage

Frequent user questions answered

How-To Content

Step-by-step guides for AI snippets

Comparison Content

Product and service comparisons

Use Cases

Concrete scenarios and practical examples

Data Richness

Technical specifications, prices, availability

Technical Foundation

Weight15%

robots.txt Configuration

Correct directives for all major AI crawlers

llms.txt Present

Machine-readable table of contents for LLM access

SSR Availability

Server-Side Rendering - no JavaScript rendering needed

Page Speed (Core Web Vitals)

LCP, FID, CLS meeting Google standards

Mobile Optimization

Fully responsive, no viewport breakages

HTTPS

Valid SSL certificate, no mixed-content warnings

Canonical Tags

Duplicate content prevention via correct canonicals

XML Sitemap

Current sitemap for crawl control

Schema Validation

Valid markup without errors in Google Rich Results Test

Core Web Vitals

INP, LCP, CLS in green range

Crawlability

No crawl blocks on relevant pages

Indexability

No noindex tags on important pages

Structured Data

Weight15%

JSON-LD Product Schema

Price, availability, SKU, MPN, Offers correctly filled

JSON-LD Service Schema

ServiceType, areaServed, provider, offers validated

JSON-LD FAQ Schema

Structured Q&A for AI answer generation

JSON-LD LocalBusiness Schema

80+ subtypes (Doctor, Plumber, Restaurant, ...) correctly mapped

JSON-LD Organization Schema

Complete company identity incl. sameAs links

JSON-LD Review Schema

Verified reviews with rating, text, reviewer

JSON-LD Breadcrumb Schema

Hierarchical navigation for AI context

AI Crawler Access

Weight10%

GPTBot Access

OpenAI training and retrieval crawler not blocked

ClaudeBot Access

Anthropic crawler allowed for Claude knowledge base

PerplexityBot Access

Perplexity retrieval crawler allowed for live answers

OAI-SearchBot Access

ChatGPT Search indexer reachable

Google Extended Access

Gemini training not blocked

wai.json Present

Experimental AI agent discovery format (early adopter)

AI Citation & Visibility

Weight12%

Citation Rate

Share of queries where the domain is cited

Mention Rate

Frequency of unlinked mentions in AI answers

Share of Voice

Market share in AI answers vs. competitors

Sentiment Score

Tone of AI mentions (positive/neutral/negative)

Cross-Engine Consistency

Consistency of representation across 9 AI engines

Hallucination Score

Detected factual errors in AI answers about your business

Answer Inclusion Rate

Share of direct answers that use your content

Freshness & Currency

Weight8%

Product Feed Freshness

Time elapsed since last feed update

Price Feed Consistency

Price alignment between feed and website

IndexNow Active

Automatic notification to Bing, Yandex, Naver, Seznam

Last-Modified Header

HTTP header for crawl efficiency and freshness signals

Score Weighting

Not all factors carry equal weight. Structured data and technical fundamentals contribute most to the GEO Score.

E-E-A-T & Authority

20%

Content Quality

20%

Technical Foundation

15%

Structured Data

15%

AI Crawler Access

10%

AI Citation & Visibility

12%

Freshness & Currency

What We Do NOT Measure

Honesty is part of the methodology. Here are the actual limits of our system.

No direct AI platform API guarantees

We have no privileged access to the internal ranking algorithms of OpenAI, Anthropic, or Google. Our measurement is based on observable output behavior of the models.

No score-to-citation guarantee

A high GEO Score statistically increases the probability of citation - it does not guarantee it. AI answers are non-deterministic.

No personal profile data

We measure domain-level visibility. No user profiles, no end-customer tracking - GDPR-oriented by design.

No real-time monitoring

AI engine queries run on a typical 24-hour polling cycle. Real-time visibility is not technically measurable - no platform offers an API for it.

KPI Levels at a Glance

From technical data quality to measurable business impact - four levels that together provide a complete picture.

9.1

Technical Data Quality

Foundation: Is the data machine-readable, complete, and fresh?

Schema completeness > 90%
Feed freshness < 24h
Crawler block rate = 0%

9.2

AI Visibility

Is the company mentioned, cited, or recommended in AI answers?

Mention rate (7 engines)
Citation rate with URL
Share of voice vs. competitors

9.3

Answer Quality

How accurate and positive are AI statements about the company?

Hallucination rate (target: < 5%)
Sentiment score (positive/neutral/negative)
Factual accuracy on prices, services

9.4

Business Impact

Does AI visibility translate to measurable traffic, leads, and revenue?

AI referral traffic (via UTM + GA4)
Lead attribution from AI channels
Conversion rate: AI traffic vs. SEO traffic

Scientific Foundations

The GEO Score is not based on gut feeling. These publications form the empirical basis.

2024ACM KDD 2024 · Princeton & Georgia Tech

GEO: Generative Engine Optimization

Aggarwal et al.

First systematic study on optimization for generative search engines. Defines citations, impressions, and share of voice as primary GEO metrics - basis for our factor categorization.

2026Stanford HAI · 2026

Stanford AI Index 2026

Stanford Human-Centered AI Institute

Hallucination rates by model and domain. Basis for our hallucination score and the calibration of our 9-engine test matrix.

2024Patronus AI & IBM Research · 2024

Lynx & Granite Guardian - Hallucination Detection Benchmarks

Patronus AI / IBM Research

Detection benchmarks for factual errors in LLM outputs. Methodology adopted for our 8-layer hallucination detection (Layer 9 = optional AI-semantic extension).

2026Google Search Central · 2026

Structured Data - Guidelines & Testing

Google LLC

Official specification for structured data. Defines which schema types Google (and thus Gemini) uses for answer generation.

Questions about the methodology?

Transparency is not a marketing promise

If anything about our methodology is unclear, contact us directly. No buzzwords, no deflection.

Ask a question Try for free