Skip to main content
Free Tool

AI Bot
Checker

Is your host quietly turning away ChatGPT, Claude and Perplexity? Test 14 top AI crawlers against your site in 30 seconds, and get a ready-to-send message for your hosting company if anything is blocked.

No sign-up required
Tests 14 top AI crawlers
Fix message written for you
Enter URL
Check robots.txt
Test Live Access
Get Fix Message

Check AI Bot Access

Enter your website URL to test whether the top AI crawlers can reach it

Connecting to site...

What Does the AI Bot Checker Test?

The checker tests your site against 14 of the most important AI crawlers: GPTBot, OAI-SearchBot and ChatGPT-User from OpenAI, ClaudeBot, Claude-SearchBot and Claude-User from Anthropic, PerplexityBot and Perplexity-User, Meta-ExternalAgent, Applebot, Bytespider, CCBot, plus the Google-Extended and Applebot-Extended robots.txt controls.

Each bot is checked two ways. First, your robots.txt file is parsed to see whether the bot is blocked by a Disallow rule, either by name or by a wildcard. Second, a live request is sent to your homepage identifying as that bot, and the response is compared against a normal browser request. If the browser gets HTTP 200 but the bot gets a 403, a 429, or a dropped connection, something between the visitor and your page is rejecting that bot. That second test matters because some crawlers, Bytespider being the best-known example, simply ignore robots.txt. Server-level rules are the only thing that actually stops them, and the only thing that can silently block the bots you DO want.

If anything is blocked, the tool writes a ready-to-send message for your hosting company asking them to confirm whether the genuine crawlers are allowed, and to unblock them if not.

Why It Matters

AI search is where a growing share of buying decisions start. When someone asks ChatGPT, Claude or Perplexity to recommend a business like yours, the AI can only cite sites its crawlers can reach. Some managed hosts now block AI crawlers by default, silently, and the first you hear of it is when your business never appears in AI answers. We wrote up a real 19-day crawler log showing exactly what AI bots do on a site that lets them in.

This tool is for site owners who want a straight answer: can the AI crawlers that matter actually reach my site? And if not, who is blocking them, my robots.txt or my host?

The 14 AI Crawlers We Test

Every crawler on this list is documented by its operator and answers to a named robots.txt token. Twelve get the full two-way check: a robots.txt parse plus a live request to your homepage. The two marked robots.txt control never crawl under their own name, so only the robots.txt check applies.

Crawler Operator What it feeds How it's checked
GPTBot AI search indexing OpenAI Builds the index behind ChatGPT search and model training robots.txt live test
OAI-SearchBot AI search indexing OpenAI Powers ChatGPT search results and link citations robots.txt live test
ChatGPT-User Live user visits OpenAI Visits your pages live when a ChatGPT user asks about you robots.txt live test
ClaudeBot AI search indexing Anthropic Crawls for Claude, including search and model improvement robots.txt live test
Claude-SearchBot AI search indexing Anthropic Indexes pages to improve Claude search results and citations robots.txt live test
Claude-User Live user visits Anthropic Visits your pages live when a Claude user asks about you robots.txt live test
PerplexityBot AI search indexing Perplexity Builds the Perplexity answer-engine index and citations robots.txt live test
Perplexity-User Live user visits Perplexity Visits your pages live when a Perplexity user asks about you robots.txt live test
Meta-ExternalAgent Model training Meta Crawls for Meta AI training and product indexing robots.txt live test
Applebot AI search indexing Apple Powers Siri, Spotlight and Apple Intelligence answers robots.txt live test
Bytespider Model training ByteDance (TikTok) Collects training data for ByteDance AI models robots.txt live test
CCBot Model training Common Crawl Open web dataset used to train many AI models robots.txt live test
Google-Extended robots.txt control Google robots.txt switch for Gemini training and grounding robots.txt
Applebot-Extended robots.txt control Apple robots.txt switch for Apple AI model training robots.txt

Why isn't Grok on the list? xAI publishes no crawler documentation, and Grok's observed fetches identify as an ordinary browser rather than a named bot. With no token to match in robots.txt and no user agent to test, there is nothing a checker (or a firewall) can reliably verify, so we leave it out rather than report a result we can't stand behind.

FAQ

Frequently Asked Questions

Two things, for each of 14 top AI crawlers. First, it parses your robots.txt file and checks whether the bot is blocked by a Disallow rule, either named directly or caught by a wildcard. Second, it sends a live request to your homepage identifying as that bot and compares the response with a normal browser request. If the browser gets HTTP 200 and the bot gets a 403, a 429, or a dropped connection, something at the server, WAF or CDN level is rejecting that bot.

Requests presenting that bot's user agent are being turned away before they reach your page, while normal browser traffic gets through. That is a server, firewall or CDN rule rather than anything in your robots.txt, which means you usually cannot fix it yourself. The tool writes a message you can send to your hosting company asking them to confirm whether the real crawler is allowed and to unblock it if not.

It can be, and the tool is upfront about this. Our test requests come from our own servers, not from OpenAI's or Anthropic's networks. Some security systems verify crawlers by IP address and only reject impostors while letting the real, verified bot through. From the outside those two cases look identical. That is why the generated message asks your host a precise question: is the genuine crawler, coming from the provider's published IP ranges, allowed to crawl the site? Their answer settles it either way.

Allow the search and assistant crawlers, decide for yourself on the training-only ones. GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Claude-SearchBot and PerplexityBot decide whether your business can appear in ChatGPT, Claude and Perplexity answers, and AI assistants are now a real source of customers. Pure training crawlers like CCBot and Bytespider bring no direct traffic, so blocking those is a legitimate choice. Blanket-blocking everything makes you invisible to AI search.

Three usual suspects. Some SEO and security plugins add AI bot blocks as a default or one-click setting. Some hosts ship a managed robots.txt template with AI crawlers pre-blocked. And some site owners copied a "block all AI bots" snippet from a blog post in 2023 and forgot about it. Check your SEO plugin settings first, then ask your host whether they manage your robots.txt.

The big ones do. OpenAI, Google, Anthropic and Apple all document and honour their robots.txt tokens. Some training crawlers have a worse reputation, with Bytespider the best-known example of a bot widely reported to ignore robots.txt. That is exactly why this tool also runs the live server test: robots.txt tells you what you have asked bots to do, the live test tells you what your server actually enforces.

A means every AI search and assistant crawler can reach your site. C means only training-focused bots are blocked, which is a reasonable policy choice. D means one or two important search or assistant crawlers are blocked. F means several of the crawlers that power AI search answers cannot reach your site, and your business is unlikely to be cited by ChatGPT, Claude or Perplexity.

Results for each domain are cached for 15 minutes. That keeps repeat checks instant and stops the tool being used to bombard a site with requests, but it also means a change you made seconds ago will not show up straight away. The notice above your results tells you when the check actually ran and how long until a fresh one is available. Wait that out, run the check again, and you will see the effect of your change.

Yes. It makes around 15 ordinary GET requests to your homepage and robots.txt, deliberately staggered so no more than three arrive at once, with a pause between each batch. That is gentler than a single visitor browsing your site. It does not log in, post data, or probe for vulnerabilities, and it has no effect on your search rankings. Results are also cached per domain for 15 minutes, so repeat checks do not touch your site at all.

Hosting That Lets AI Search Find You

Every 365i hosting plan welcomes the AI search crawlers that send you customers, with WAF protection against the traffic that does not. Blocked bots can be reviewed and unblocked on request.