Question 1

The gate — who enters the segment

Accepted Answer

An agency is listed only if all four hold. The gate cuts by tier, not by strength: a strong player that genuinely serves small business ranks where it earns, above us when it is better. SMB-primary: serves private clients / small business as its primary segment, not enterprise-only with SMB as a token line. Self-serve offer: has a transparent self-serve offer a buyer can purchase themselves — a published price or fixed package, no mandatory 'contact sales'. GEO substance: names GEO / AI-visibility as an actual service (names answer engines, citation work, structured data, measures AI answers) — relabeled generic SEO is listed but flagged. Live business: is a live, reachable business — real site, real contact.

Question 2

The seven axes

Accepted Answer

Every axis must pass the neutral-buyer test: would a small-business buyer agree it matters, not knowing who scores well on it? An agency's own raw AI-visibility is mostly domain age plus anchoring — a confound — so it is weighted lightly. The heavy weight goes to age-independent, checkable signals. M1 Own AI-visibility (9%): the agency is itself findable, and described correctly, in AI answers. M2 Method transparency (16%): publishes how it works and what it measures, not a black box. M3 Evidence verifiability (23%): named, specific, checkable proof over anonymous hype. M4 Pricing openness (12%): published price / range, not 'request a quote'. M5 AI reachability (10%): the agency's own site is reachable to AI crawlers — not blocked in robots.txt, server-rendered, ideally with an llms.txt. E1 Segment fit (12%): genuinely built for the private / small-business buyer. E2 Promise cleanliness (18%): no snake-oil — no guaranteed rankings, no fakery

Question 3

How are the axes weighted?

Accepted Answer

Measured axes (M1–M5) carry 70%; editorial axes (E1–E2) carry 30% — the reproducible core dominates by design, and judgment stays a labelled minority. M1 is the lightest (9%) because raw visibility is confounded by domain age; M3 is the heaviest (23%) as the most buyer-predictive, most anti-hype axis. M5 (10%) checks the agency's own site is reachable to AI crawlers — an AI-visibility shop that blocks them fails its own craft. Weights are v2 priors and move under calibration.

Question 4

The red-flag cap

Accepted Answer

An egregious claim — guaranteed rankings or placements, fake reviews offered, 'we manipulate AI' — caps the composite regardless of other scores. Snake-oil cannot be out-weighted by a slick site; it ceilings the agency.

Question 5

The focus penalty — a specialist who does everything isn't one

Accepted Answer

Some agencies that clear the gate are not GEO specialists but generalist combines — an SEO shop welded to a website factory, selling the whole stack at once with AI-visibility as one more item on the menu. We read that breadth as a trust signal pointing the wrong way: a 'specialist' who takes on everything is rarely deep in any one part, and GEO is the part most often bolted on as an afterthought. So the composite carries a fixed focus penalty, applied after the red-flag cap and shown on every entry.

Question 6

Reproducible by design

Accepted Answer

Measured axes carry the evidence snippet and source they came from; a missing finding is recorded as an explicit 'not found', never a guessed number. Every score is stamped 'data as of' a date, and the probe set (ChatGPT + Claude, search-grounded, clean session) is fixed so any external auditor can re-run it. Editorial axes and the top of the list go through human review before publish.

Question 7

What this ruler does NOT measure

Accepted Answer

Track-record depth and the volume of named cases favour older incumbents we cannot fully see; NDA-bound outcomes are unscored. We weight reproducible, buyer-checkable signal on purpose — and we list ourselves on the same ruler, shown exactly where we stand.

Axis	What it rewards	Weight
M1 · Own AI-visibility	the agency is itself findable, and described correctly, in AI answers	9%
M2 · Method transparency	publishes how it works and what it measures, not a black box	16%
M3 · Evidence verifiability	named, specific, checkable proof over anonymous hype	23%
M4 · Pricing openness	published price / range, not 'request a quote'	12%
M5 · AI reachability	the agency's own site is reachable to AI crawlers — not blocked in robots.txt, server-rendered, ideally with an llms.txt	10%
E1 · Segment fit	genuinely built for the private / small-business buyer	12%
E2 · Promise cleanliness	no snake-oil — no guaranteed rankings, no fakery	18%

How we measure

The gate — who enters the segment

The seven axes

The red-flag cap

The focus penalty — a specialist who does everything isn't one

Reproducible by design