Free tool

Can AI crawlers reach your site?

Enter your domain. See whether the crawlers behind ChatGPT, Perplexity, Claude, Gemini, and Copilot are allowed in, plus your sitemap and robots.txt discoverability.

ScopeWhole domain
Crawlers8 checked
CostFree

Reads your public robots.txt, sitemap, and homepage. Domain-level check, no data stored.

What we check

Access is the first gate.

If a crawler is blocked, that engine cannot cite you no matter how good your content is. This checker reads the signals that decide whether crawlers get in.

01 · AI answer engines

Whether the retrieval crawlers behind ChatGPT (OAI-SearchBot, ChatGPT-User), Perplexity, Claude, Google AI, and Copilot are allowed to reach your site in robots.txt.

02 · Training crawler policy

Whether GPTBot and Google-Extended are allowed. Blocking these limits model training, not retrieval, so it can be a deliberate choice. We flag it without penalizing your score.

03 · Discoverability

Homepage reachability, whether robots.txt exists, whether a sitemap is declared and reachable, and whether you publish an llms.txt.

FAQ

Crawler access, answered.

Can this tell me if my site is in ChatGPT's index?

No, and any tool that claims to is guessing. There is no public API for ChatGPT or Bing index membership. This checker verifies something concrete and actionable instead: whether the crawlers that feed those engines are allowed to reach your site, which is a precondition for being cited.

Which crawlers does it check?

OpenAI (OAI-SearchBot, ChatGPT-User, GPTBot), PerplexityBot, ClaudeBot, Googlebot and Google-Extended, and Bingbot. ChatGPT search and Copilot lean on the OpenAI and Bing crawlers respectively, and AI Overviews uses Googlebot.

Should I block GPTBot?

It depends on your goals. GPTBot is used for model training, not for ChatGPT search retrieval. Blocking it keeps your content out of training data while still allowing OAI-SearchBot to cite you. The tool flags GPTBot's status as information, not a penalty.

Why does crawler access matter for AI visibility?

If a crawler is blocked in robots.txt, that engine cannot fetch your pages, so it cannot cite them no matter how good the content is. Crawl access is the first gate. Everything else, structure, schema, authority, only matters once the crawler can get in.

Access is step one

See your real AI visibility.

Letting crawlers in makes you eligible. Zumi shows you what actually happens across every engine: where you are cited, where competitors beat you, and what to fix first.