How many reruns/prompts are you using for GEO / AI visibility tracking?
Curious how people here are handling reruns / sampling methodology for GEO / AI visibility tracking.
When you measure brand visibility in ChatGPT, Perplexity, Gemini, Claude etc.:
- how many prompt variants are you using per intent cluster?
- how many reruns per prompt?
- how often do you refresh measurements (daily, weekly, realtime)?
Right now I’m testing:
- ~15 prompt variants
- 3 reruns
- daily/weekly tracking
- limited reruns
But I’m noticing pretty high variance between sessions and models, especially in ChatGPT and Perplexity.
Trying to figure out what people consider statistically “good enough” before drawing conclusions for clients.
Feels like a lot of GEO tooling is still using methodologies that are way too small to be reliable.