Free Tool

AI Bot
Checker

Is your host quietly turning away ChatGPT, Claude and Perplexity? Test 14 top AI crawlers against your site in 30 seconds, and get a ready-to-send message for your hosting company if anything is blocked.

No sign-up required

Tests 14 top AI crawlers

Fix message written for you

Enter URL

Check robots.txt

Test Live Access

Get Fix Message

Connecting to site...

Bot-by-Bot Results

Send This to Your Hosting Company

Blocked in Your robots.txt

What Does the AI Bot Checker Test?

The checker tests your site against 14 of the most important AI crawlers: GPTBot, OAI-SearchBot and ChatGPT-User from OpenAI, ClaudeBot, Claude-SearchBot and Claude-User from Anthropic, PerplexityBot and Perplexity-User, Meta-ExternalAgent, Applebot, Bytespider, CCBot, plus the Google-Extended and Applebot-Extended robots.txt controls.

Each bot is checked two ways. First, your robots.txt file is parsed to see whether the bot is blocked by a Disallow rule, either by name or by a wildcard. Second, a live request is sent to your homepage identifying as that bot, and the response is compared against a normal browser request. If the browser gets HTTP 200 but the bot gets a 403, a 429, or a dropped connection, something between the visitor and your page is rejecting that bot. That second test matters because some crawlers, Bytespider being the best-known example, simply ignore robots.txt. Server-level rules are the only thing that actually stops them, and the only thing that can silently block the bots you DO want.

If anything is blocked, the tool writes a ready-to-send message for your hosting company asking them to confirm whether the genuine crawlers are allowed, and to unblock them if not.

Why It Matters

AI search is where a growing share of buying decisions start. When someone asks ChatGPT, Claude or Perplexity to recommend a business like yours, the AI can only cite sites its crawlers can reach. Some managed hosts now block AI crawlers by default, silently, and the first you hear of it is when your business never appears in AI answers. We wrote up a real 19-day crawler log showing exactly what AI bots do on a site that lets them in.

This tool is for site owners who want a straight answer: can the AI crawlers that matter actually reach my site? And if not, who is blocking them, my robots.txt or my host?

The 14 AI Crawlers We Test

Every crawler on this list is documented by its operator and answers to a named robots.txt token. Twelve get the full two-way check: a robots.txt parse plus a live request to your homepage. The two marked robots.txt control never crawl under their own name, so only the robots.txt check applies.

Crawler	Operator	What it feeds	How it's checked
GPTBot AI search indexing	OpenAI	Builds the index behind ChatGPT search and model training	robots.txt live test
OAI-SearchBot AI search indexing	OpenAI	Powers ChatGPT search results and link citations	robots.txt live test
ChatGPT-User Live user visits	OpenAI	Visits your pages live when a ChatGPT user asks about you	robots.txt live test
ClaudeBot AI search indexing	Anthropic	Crawls for Claude, including search and model improvement	robots.txt live test
Claude-SearchBot AI search indexing	Anthropic	Indexes pages to improve Claude search results and citations	robots.txt live test
Claude-User Live user visits	Anthropic	Visits your pages live when a Claude user asks about you	robots.txt live test
PerplexityBot AI search indexing	Perplexity	Builds the Perplexity answer-engine index and citations	robots.txt live test
Perplexity-User Live user visits	Perplexity	Visits your pages live when a Perplexity user asks about you	robots.txt live test
Meta-ExternalAgent Model training	Meta	Crawls for Meta AI training and product indexing	robots.txt live test
Applebot AI search indexing	Apple	Powers Siri, Spotlight and Apple Intelligence answers	robots.txt live test
Bytespider Model training	ByteDance (TikTok)	Collects training data for ByteDance AI models	robots.txt live test
CCBot Model training	Common Crawl	Open web dataset used to train many AI models	robots.txt live test
Google-Extended robots.txt control	Google	robots.txt switch for Gemini training and grounding	robots.txt
Applebot-Extended robots.txt control	Apple	robots.txt switch for Apple AI model training	robots.txt

Why isn't Grok on the list? xAI publishes no crawler documentation, and Grok's observed fetches identify as an ordinary browser rather than a named bot. With no token to match in robots.txt and no user agent to test, there is nothing a checker (or a firewall) can reliably verify, so we leave it out rather than report a result we can't stand behind.

01 What does the AI Bot Checker actually test?

Two things, for each of 14 top AI crawlers. First, it parses your robots.txt file and checks whether the bot is blocked by a Disallow rule, either named directly or caught by a wildcard. Second, it sends a live request to your homepage identifying as that bot and compares the response with a normal browser request. If the browser gets HTTP 200 and the bot gets a 403, a 429, or a dropped connection, something at the server, WAF or CDN level is rejecting that bot.

02 A bot shows as "rejected at server level". What does that mean?

Requests presenting that bot's user agent are being turned away before they reach your page, while normal browser traffic gets through. That is a server, firewall or CDN rule rather than anything in your robots.txt, which means you usually cannot fix it yourself. The tool writes a message you can send to your hosting company asking them to confirm whether the real crawler is allowed and to unblock it if not.

03 Could a "rejected" result be a false alarm?

It can be, and the tool is upfront about this. Our test requests come from our own servers, not from OpenAI's or Anthropic's networks. Some security systems verify crawlers by IP address and only reject impostors while letting the real, verified bot through. From the outside those two cases look identical. That is why the generated message asks your host a precise question: is the genuine crawler, coming from the provider's published IP ranges, allowed to crawl the site? Their answer settles it either way.

04 Should I allow AI bots or block them?

Allow the search and assistant crawlers, decide for yourself on the training-only ones. GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Claude-SearchBot and PerplexityBot decide whether your business can appear in ChatGPT, Claude and Perplexity answers, and AI assistants are now a real source of customers. Pure training crawlers like CCBot and Bytespider bring no direct traffic, so blocking those is a legitimate choice. Blanket-blocking everything makes you invisible to AI search.

05 My robots.txt blocks bots I never added rules for. Where did they come from?

Three usual suspects. Some SEO and security plugins add AI bot blocks as a default or one-click setting. Some hosts ship a managed robots.txt template with AI crawlers pre-blocked. And some site owners copied a "block all AI bots" snippet from a blog post in 2023 and forgot about it. Check your SEO plugin settings first, then ask your host whether they manage your robots.txt.

06 Do AI bots respect robots.txt?

The big ones do. OpenAI, Google, Anthropic and Apple all document and honour their robots.txt tokens. Some training crawlers have a worse reputation, with Bytespider the best-known example of a bot widely reported to ignore robots.txt. That is exactly why this tool also runs the live server test: robots.txt tells you what you have asked bots to do, the live test tells you what your server actually enforces.

07 What does the AI Visibility Grade mean?

A means every AI search and assistant crawler can reach your site. C means only training-focused bots are blocked, which is a reasonable policy choice. D means one or two important search or assistant crawlers are blocked. F means several of the crawlers that power AI search answers cannot reach your site, and your business is unlikely to be cited by ChatGPT, Claude or Perplexity.

08 I just changed robots.txt or my host unblocked a bot. Why are the results the same?

Results for each domain are cached for 15 minutes. That keeps repeat checks instant and stops the tool being used to bombard a site with requests, but it also means a change you made seconds ago will not show up straight away. The notice above your results tells you when the check actually ran and how long until a fresh one is available. Wait that out, run the check again, and you will see the effect of your change.

09 Is the check safe to run on my site?

Yes. It makes around 15 ordinary GET requests to your homepage and robots.txt, deliberately staggered so no more than three arrive at once, with a pause between each batch. That is gentler than a single visitor browsing your site. It does not log in, post data, or probe for vulnerabilities, and it has no effect on your search rankings. Results are also cached per domain for 15 minutes, so repeat checks do not touch your site at all.

Hosting That Lets AI Search Find You

Every 365i hosting plan welcomes the AI search crawlers that send you customers, with WAF protection against the traffic that does not. Blocked bots can be reviewed and unblocked on request.

Get Started Contact Us

AI Bot
Checker

Check AI Bot Access

Bot-by-Bot Results

Send This to Your Hosting Company

Blocked in Your robots.txt

What Does the AI Bot Checker Test?

Why It Matters

The 14 AI Crawlers We Test

Frequently Asked Questions

Hosting That Lets AI Search Find You

AI BotChecker

Check AI Bot Access

Bot-by-Bot Results

Send This to Your Hosting Company

Blocked in Your robots.txt

What Does the AI Bot Checker Test?

Why It Matters

The 14 AI Crawlers We Test

Frequently Asked Questions

Hosting That Lets AI Search Find You

AI Bot
Checker