
Most Multi-Agent Failures Aren’t Hallucinations — They’re Inherited Assumptions
After working with long-context and multi-agent workflows for a while, I’ve started noticing that many “LLM failures” aren’t really hallucinations in the usual sense.
They’re inherited assumptions.
Agent A makes a weak assumption.
Agent B inherits it as contextual truth.
Agent C optimizes around it for coherence.
At that point the system can look highly intelligent while reasoning around a premise nobody ever re-validated.
What surprised me is how consistently this appears in:
- agent chains
- long-context workflows
- memory-heavy systems
- retrieval pipelines
- orchestration frameworks
The common pattern seems less related to prompting quality and more related to uncontrolled reasoning state propagation.
A few mitigation patterns that helped significantly:
- forcing assumption enumeration before major decisions
- inserting verification boundaries between agents
- segmented execution contexts
- explicit uncertainty injection
- passing validated summaries instead of raw conversational history
Ironically, many advanced users seem to independently converge toward similar workflows:
smaller scoped tasks, isolated reasoning states, controlled memory propagation.
I documented some of these patterns and mitigation protocols in a free technical guide while experimenting with long-context stability and reasoning reliability.
Curious whether others building multi-agent systems have observed similar “assumption propagation” failures.