Most Multi-Agent Failures Aren’t Hallucinations — They’re Assumption Propagation Failures
After spending months testing long-context workflows, RAG-heavy pipelines, and multi-agent systems, I’m increasingly convinced that many failures we call “hallucinations” are actually assumption propagation failures.
A weak premise enters the chain early:
- partial retrieval
- stale memory
- ambiguous planner output
- compressed summaries
- weak intermediate reasoning
Later stages inherit the assumption and silently treat it as established truth.
The interesting part is that every individual step can still look locally coherent while the system globally drifts further away from correctness.
A few recurring patterns I kept observing:
- Context Rot → earlier constraints decay over long chains
- Recursive Agreement → agents inherit unresolved assumptions
- Narrative Inertia → continuity preservation overrides correction
- Constraint Collapse → constraints lose operational weight under context pressure
- Retrieval Authority Inheritance → retrieved context gets treated as pre-validated truth
What consistently improved reliability for me was not “better prompting” but adding structural control layers between reasoning stages:
- explicit assumptions lists
- isolated execution contexts
- staged reasoning
- verification boundaries
- adversarial audits
- controlled memory propagation
- retrieval relevance checks before generation
Curious whether others building production multi-agent systems have observed similar propagation patterns, especially in long-context or retrieval-heavy workflows.