I think we’re reaching the limit of brute-force context stuffing
The more I work with coding agents, the more it feels like raw context injection scales badly.
Issue with huge prompts:
- noisy retrieval
- repeated reasoning
- inconsistent architectural understanding
- token waste
What seems more promising is persistent structured memory like
- knowledge graphs
- semantic layers
- architecture-aware retrieval
- cached reasoning artifacts
Feels like the industry is slowly rediscovering that retrieval quality matters more than sheer context size.
Curious if others are seeing the same thing in production workflows.