IsItBullshit: Comparing multiple AI models actually improves reliability?
I’ve been experimenting more with AI tools lately for research and writing, and one thing I keep noticing is how differently models answer the exact same prompt.
Sometimes they mostly agree, but other times the reasoning is completely different even when all the answers sound confident.
Because of that, I started comparing multiple outputs more often instead of trusting one response. Recently I’ve been testing askNestr for this since it lets me view multiple model responses together.
What surprised me is that disagreements between models sometimes reveal weak assumptions or uncertainty way faster than fact-checking one answer alone.
But I honestly can’t tell if this is actually a smarter workflow or if it just creates the illusion of better reliability because multiple AIs are involved.
People who use AI heavily is this genuinely useful, or mostly placebo?