57% of US local markets had their #1 AI recommendation change in a single week
We ran 41,400 queries through ChatGPT, Perplexity, Claude, and Gemini across 690 US local-business markets, then ran them again seven days later. In 57% of markets the AI named a different business as #1. HVAC was worst at 69%; wedding venues most stable at 43%. Nobody has published this before — here is the data, the methodology, and what businesses should actually do about it.
The headline: 57% of US local markets had a different #1 AI recommendation this week
Every Tuesday findloc.ai asks ChatGPT, Perplexity, Claude, and Gemini 15 real customer-search questions per market — "best hotels in Chicago", "top-rated dentists in Austin", "recommended HVAC company in Phoenix", etc. Across 100 US MSAs and 7 industries, that's 690 markets and about 41,400 queries per week.
This past week (2026-W27) we compared each market's #1-cited business against the prior week (2026-W26). The AI's top recommendation had changed in 391 of the 690 markets — 57%.
AI recommendations are more volatile than most business owners realise. The rule of thumb for Google search — "you rank today, you rank tomorrow" — does not hold for ChatGPT, Perplexity, Claude, or Gemini.
Volatility by industry — HVAC is a coin flip, wedding venues are the calmest
Volatility varies sharply by vertical. Higher-consideration, higher-price categories are more stable — probably because there is more written content per business, so the models have thicker evidence to keep naming the same candidates. Impulsive or price-competitive categories flip more:
| Industry | Markets tracked | Markets where #1 changed | Change rate |
|---|---|---|---|
| HVAC contractors | 100 | 69 | 69.0% |
| Motels | 100 | 66 | 66.0% |
| Personal-injury lawyers | 93 | 60 | 64.5% |
| Dentists | 100 | 60 | 60.0% |
| Hotels | 97 | 48 | 49.5% |
| Med spas | 100 | 45 | 45.0% |
| Wedding venues | 100 | 43 | 43.0% |
One takeaway that surprised us: not a single industry stayed below 40% volatility. Even wedding venues — a decision people research for months — showed the AI shuffling its #1 recommendation in 43 of 100 markets in one week.
The seven brands the AI stopped naming this week
Aggregate percentages are abstract. Here are specific businesses that were the top-cited answer across all four AI engines in their market last week and are no longer in the top 20 this week. Nothing about these businesses changed in the physical world — the AI just started naming other names.
- InterContinental Miami — Miami hotels — was #1 with 20 citations across the four engines last week. Zero citations this week.
- Hotel Nikko San Francisco — SF hotels — dropped from #1 to #15.
- Moonrise Hotel (St. Louis) — St. Louis hotels — from #1 to unranked.
- BEST WESTERN Oxnard Inn — Oxnard hotels — from #1 to unranked.
- The Tinsmith (Madison, WI) — wedding venues — from #1 to #27.
- Cooper Hurley Injury Lawyers (Virginia Beach) — personal-injury lawyers — from #1 to #18.
- POSH Medical Spa (Reno) — med spas — from #1 to #15.
None of these seven businesses closed, changed pricing, or lost their Google Business listing this week. The AI simply reweighted which candidates it named. That means the underlying "we're the best in town" position was never durable in the first place.
Why is AI so much more volatile than Google?
Three overlapping mechanics, in decreasing order of impact:
- Every answer is a fresh sample. Google shows a ranked list from an index that changes at hours or days of latency. ChatGPT and friends synthesise a new answer per query from a mix of live-search results and training-data recall. The sampling step is not deterministic — even the same query 20 minutes apart can produce different candidates.
- Small query wording changes pull different candidates. "Best hotels in New York" and "top-rated hotels in Manhattan" hit different indexed passages. The 15 curated queries we ask per market include wording variations on purpose — a business that only surfaces for one wording will show up in fewer of the 15 queries and score lower.
- Search-tool re-ranking is opaque and non-deterministic at the top. The four engines all use some form of retrieve-then-rerank. Small changes in the retrieved-passage set can flip the top 3 businesses without changing the top 10. From the customer's point of view, the top 3 is all that matters.
The cumulative effect is that "being #1 last week" is not a defensive position. It is a lucky sample of a distribution.
Three defensive plays — what the stable winners are doing
Some businesses in our data ARE stable — the ones cited by 3 or 4 engines rarely drop out entirely. Comparing them against the volatile ones, three patterns keep showing up:
- Optimise for multi-engine coverage over peak position. A business at rank #5 in three engines is more durable than a #1 in only one engine. When the AI reweights, coverage-breadth survives; single-engine peaks disappear.
- Maintain persistent citations on the directories the engines actually crawl. Yelp, Yellow Pages, BBB, and TripAdvisor for hospitality; Justia and Avvo for lawyers; OpenTable for restaurants; industry-specific directories for HVAC, medspa, wedding venues. Every engine crawls a subset of these; the more directories you're listed on, the more engines will have you in their retrieval set.
- Publish structured content — schema.org markup, an /llms.txt file, FAQ pages. This is not about "SEO" — it's about giving each engine unambiguous facts. LocalBusiness schema tells the engine your address without parsing HTML; FAQPage schema surfaces answers to specific questions; /llms.txt gives an AI a one-fetch summary of who you are.
These three lift-outs are what our Signal Boost service does end-to-end. If you want the fixes but not the DIY project, we do it for a flat $499 one-time. For businesses that want continuous multi-engine coverage over 12 months, the $1,299 tier includes quarterly re-audits.
How to check your business
We published a live report for every business the AI has named in the last three weeks — 5,395 businesses in total across our 700 tracked markets. Each report shows:
- Which of the 4 AI engines currently name your business, and how often.
- Which engines do not name you at all — the coverage gap.
- The top competitors in your market and how many engines name them.
- Historical rank across the weeks we have tracked.
Reports are free and public. Start from findloc.ai/directory to browse tracked businesses by industry, or open findloc.ai/ai-visibility to browse by market.
If you run a US local business we do not track yet, findloc.ai/ai-visibility/us/[your-city]/[your-industry] shows the market-level rankings. Reply to any of our tweets or email support@findloc.ai and we will add your industry to next week's probe.
Methodology, in case you want to replicate it
We are publishing this openly so anyone can verify. If you spot a bug in the count, tell us — we will fix and reissue.
- Query set: 15 hand-curated customer-search phrasings per (city, industry). Phrasings deliberately vary intent ("cheapest", "best", "family friendly", "near me") to sample the full retrieval distribution.
- Engines: openai/gpt-4o-mini-search-preview, perplexity/sonar, anthropic/claude-3.5-haiku (via web plugin), google/gemini-2.0-flash. All four are the search-augmented variants — this is intentional. Non-search variants would test the training snapshot only.
- Extraction: response text is passed through a business-name extractor (also an LLM) with per-industry filters that reject service names ("nitrous oxide", "contingency fee"), directory brands (CareCredit, HealthGrades, HomeAdvisor), and generic terms.
- Ranking: each business gets a score of (citation count) x (engines seen in). Ties broken by raw citation count. #1 in this post refers to the highest-scoring business for the market that week.
- Volatility metric: for each (city, industry), we compare the #1 name in W26 vs W27. The 57% figure is the fraction of markets where those two names differ. We only counted markets that had a valid #1 in both weeks — a handful had W26 data but not W27 or vice versa.
The raw week-by-week data is available at findloc.ai/data as CSV. We run the whole four-engine pipeline as a lean batched-probe architecture — a technical writeup of that is on our roadmap.
What we would love feedback on
A few open questions we are still figuring out. If you have a strong view, DM us:
- Are 15 queries per market enough signal? We tried 25 in an earlier version — the extra 10 barely changed the top-10 rankings but doubled cost. Push back if that intuition is wrong.
- Which industries should we add next? We are looking at auto repair, plumbing, chiropractic, veterinary. Tell us which one your customers ask AI about.
- Should we track brand-name queries too ("what do people say about [specific hotel]") in addition to category queries ("best hotels in [city]")? Brand queries measure reputation; category queries measure discoverability. Both matter for different reasons.
And if you run a small business and want to see this data for your own name — findloc.ai/directory has the searchable index. Everything on that page is free forever.
Frequently asked
- How often does ChatGPT change its business recommendations?
Based on 690 US local-business markets tracked weekly by findloc.ai, ChatGPT (and the other three major engines: Perplexity, Claude, Gemini) changed their #1 recommendation in 57% of markets from one week to the next. Volatility varies by industry — HVAC contractors changed 69% of the time, wedding venues 43%. This means being cited by an AI engine once is not a durable outcome; multi-engine coverage and steady directory presence matter more than a peak position in any single engine.
- Why do AI recommendations change so often?
Three overlapping reasons. (1) Each AI engine samples a mix of sources per query: the search-augmented models pull fresh web results plus training data, so the mix shifts as pages change. (2) Small variations in the query wording can pull different candidates from the same knowledge base. (3) Search-tool re-ranking is non-deterministic at the top of the results. All three compound: a business named this week may be absent next week without any real-world change.
- Is Perplexity or ChatGPT more consistent for local business queries?
In our week-over-week data, ChatGPT and Perplexity showed similar overall volatility (~50-60% #1 changes across markets), but for different reasons. Perplexity leans harder on live-search rerankings that shuffle within a top-10 pool; ChatGPT swings more when its search tool pulls a different result set from the training-data snapshot. Claude was the most stable engine on our sample but also the one that named the fewest local businesses per query. Gemini was the most likely to hallucinate a plausible-sounding name that does not exist.
- What can a local business do to become more resilient to AI recommendation changes?
Three defensive plays. (1) Optimise for multi-engine coverage, not peak position in one engine — a business named at rank #5 in three engines is more durable than one at #1 in only one engine. (2) Maintain persistent citations on directories the engines actually crawl (Yelp, Yellow Pages, BBB, TripAdvisor, industry-specific ones like Justia for law firms and OpenTable for hospitality). (3) Publish structured content — schema.org markup, an /llms.txt file, FAQ pages — so each engine can extract facts about your business without ambiguity. The stable businesses in our data all have at least two of these three in place.
- How does findloc.ai measure this?
Every week we run 15 curated customer-search queries per (city, industry) pair through ChatGPT, Perplexity, Claude, and Gemini via OpenRouter — about 41,400 queries total across 690 US markets and 7 industries (hotel, motel, dentist, personal injury lawyer, HVAC, med spa, wedding venue). We parse the responses to extract business names, deduplicate across queries, and score each business by citation count times engines-seen-in. The full weekly rankings are free at findloc.ai/ai-visibility. We publish the methodology openly so any market can verify our count against their own experience.
A free findloc.ai mini-page ships the full 8-point GEO stack automatically — schema.org markup, FAQPage, /llms.txt inclusion, sitemap, AI-bot allowlist. Five minutes, no credit card.
Run the free Visibility Checker →