Common Crawl·shared-dataset crawler
CCBot
Every CCBot request to findloc.ai is logged via Next.js middleware. This page is a live view of that data — refreshed hourly, no manual updates.
1
lifetime hits
1
hits · 30d
0
hits · 24h
Daily hits · last 30 days
2026-05-0630 days2026-06-04
Most-crawled URLs
About this crawler
CCBot is Common Crawl’s shared-dataset crawler. Common Crawl's crawler. Common Crawl is the public corpus most large language models train on. Being in Common Crawl means being seen by every major LLM lab indirectly.
How this page works
- Every request to findloc.ai passes through a Next.js middleware.
- If the User-Agent matches a known AI bot pattern, one row is inserted into the
ai_crawler_visitsPostgres table. Fire-and-forget so the bot’s response isn’t slowed. - This page queries that table via three SECURITY DEFINER RPCs (
ai_bot_hit_counts,ai_bot_daily_hits,ai_bot_top_paths) on every render. - Page response is cached for 1 hour via Next.js ISR — so DB load is bounded even under load, and the freshest number is always at most 60 minutes old.
live Supabase aggregates · refreshed hourly · CC-BY 4.0