Interactive Leaderboard

Top 100 Websites AI Readiness Leaderboard — April 2026

The only sortable, filterable ranking of how the world’s biggest websites handle AI crawlers. Real data from our multi-signal scanner: llms.txt, GPTBot / ClaudeBot / PerplexityBot rules, LLM knowability, Wayback training-data likelihood, and bot cloaking.

Based on the State of AI Crawlers 2026 research report. Data window: April 10, 2026. Sample: 100 sites.

100
sites ranked
30
average readiness score
31%
have an llms.txt file
60%
show AI bot cloaking

Check where YOUR site ranks

Run the same scanner on your domain and see how you compare to the top 100. Free, no signup, 10-30 seconds.

# Domain Score Grade llms.txt Cloaking Knowability Training Data Action

Methodology (short version)

We sampled the top 100 content-producing domains from the Tranco list (April 2026), excluding CDN/infrastructure domains. Each site was scanned by the ZeroKit AI Readiness Checker v2 — a multi-page crawl plus three extended signals: Wayback Machine history, knowability proxy (Wikipedia + Common Crawl + DuckDuckGo), and four-user-agent cloaking detection. The scoring weights are: AI bot rules 30%, llms.txt 20%, structured data 25%, content citability 15%, AI meta directives 10%. Full methodology and limitations in the research report.

Related