OpenAI·training crawler
GPTBot
Every GPTBot request to findloc.ai is logged via Next.js middleware. This page is a live view of that data — refreshed hourly, no manual updates.
36.0k
lifetime hits
36.0k
hits · 30d
197
hits · 24h
Daily hits · last 30 days
2026-05-0630 days2026-06-04
Most-crawled URLs
About this crawler
GPTBot is OpenAI’s training crawler. It re-fetches sites repeatedly to refresh the corpus used to train the next ChatGPT model. By far the highest-volume AI crawler on the open web in 2026 — and on findloc.ai specifically, it accounts for ~74% of all AI traffic.
How this page works
- Every request to findloc.ai passes through a Next.js middleware.
- If the User-Agent matches a known AI bot pattern, one row is inserted into the
ai_crawler_visitsPostgres table. Fire-and-forget so the bot’s response isn’t slowed. - This page queries that table via three SECURITY DEFINER RPCs (
ai_bot_hit_counts,ai_bot_daily_hits,ai_bot_top_paths) on every render. - Page response is cached for 1 hour via Next.js ISR — so DB load is bounded even under load, and the freshest number is always at most 60 minutes old.
live Supabase aggregates · refreshed hourly · CC-BY 4.0