AlphaWe’re still building this tool. Results may be incomplete or inaccurate, and features may change.It’s publicly accessible so others can try it and share feedback.

IP Ranges & User Agents

Complete reference of AI bot identifiers used by AI Search Index for detection and verification.

AI Training Crawlers

Crawlers used to collect data for training AI models:

BotProviderUser Agent Pattern
GPTBotOpenAIGPTBot/1.0
ClaudeBotAnthropicClaudeBot/1.0
Google-ExtendedGoogleGoogle-Extended
DeepSeekBotDeepSeekDeepSeekBot
xAI-GrokxAIxAI-Grok
CCBotCommon CrawlCCBot/2.0
BytespiderByteDanceBytespider
cohere-aiCoherecohere-ai
Applebot-ExtendedAppleApplebot-Extended
QwenBotAlibabaQwenBot
HuggingFaceBotHugging FaceHuggingFaceBot
AI2BotAllen AIAI2Bot

AI Search & Chat Bots

Crawlers that fetch content in real-time to answer user queries:

BotProviderUser Agent Pattern
ChatGPT-UserOpenAIChatGPT-User/1.0
ChatGPT-AgentOpenAIChatGPT-Agent (+ Signature-Agent header)
OAI-SearchBotOpenAIOAI-SearchBot/1.0
OpenAI-OperatorOpenAIOpenAI-Operator
Claude-WebAnthropicClaude-Web/1.0
PerplexityBotPerplexityPerplexityBot
MistralAI-UserMistralMistralAI-User
Gemini-Deep-ResearchGoogleGemini-Deep-Research
Meta-ExternalAgentMetameta-externalagent
CopilotMicrosoftCopilot
Genspark-WebagentGensparkGenspark-Webagent/1.0
ArcSearchThe Browser CompanyArcSearch/1.0
YouBotYou.comYouBot
KagiBotKagiKagiBot
Brave-LeoBraveBrave-Leo
PhindBotPhindPhindBot

Official IP Range Sources

We automatically fetch and validate IP ranges from these official sources:

OpenAI

Official JSON endpoints for GPTBot, SearchBot, and ChatGPT-User:

Perplexity

Official JSON endpoints for PerplexityBot and Perplexity-User:

Mistral

Official JSON endpoint for MistralAI-User:

mistral.ai/mistralai-user-ips.json

Anthropic (Claude)

Documented IP ranges from Anthropic's official documentation:

docs.anthropic.com/en/api/ip-addresses

IPv4: 160.79.104.0/23, 160.79.104.0/21
IPv6: 2607:6bc0::/48

Google

Google crawler verification documentation:

developers.google.com/search/docs/crawling-indexing/verifying-googlebot

HTTP Message Signatures

Some AI providers use cryptographic signatures (RFC 9421) for high-assurance bot verification:

ChatGPT-Agent (OpenAI)

ChatGPT's agentic features send signed requests with these headers:

Signature-Agent: "https://chatgpt.com"

Signature-Input: sig1=("@method" "@path"...);keyid="..."

Signature: sig1=:base64-encoded-signature:

Public key for verification: platform.openai.com/.well-known/agent-signing-key

Detection Confidence Levels

We assign confidence levels based on how bots are identified:

MethodConfidenceDescription
HTTP SignatureHighestCryptographically verified (RFC 9421)
IP Range MatchHighIP matches official published ranges
Reverse DNSHighDNS resolves to provider domain
User Agent OnlyMediumUA matches pattern, IP unverified
Client FingerprintMediumBrowser automation indicators detected

Automatic Updates

Our detection database is continuously updated:

  • IP ranges refreshed hourly from official JSON endpoints
  • New bot user agents added as they emerge
  • Detection patterns improved based on real traffic analysis
  • Agentic AI detection updated for new browser-based agents

Found a bot we're missing?

If you notice an AI bot that we're not detecting, please let us know! Contact us at support with the user agent and any other details you have.