
Dunetrace: Real-time monitoring of your production agents
Hey everyone, I have been building Dunetrace, a open-source real-time monitoring tool for your production agents.
Problem: When agent run, they return a confident answer. But sometimes in reality the answer is wrong and consumed lot of tokens due to tool loop or some other silent failures. All the existing tools are good once something is broke and you can debug. I wanted something that fires before the user notices.
The latest update adds:
Cross-agent pattern analysis. Dunetrace now shows you which detectors are firing across your entire agent fleet, not just per-run alerts. TOOL_LOOP fired on 18% of your example-agent runs this week and it's trending up? That's a code bug, not a transient failure. Agent health score 0–100 per agent_id.
Langfuse deep analysis. Connect your Langfuse API key and you get an 'Explain with Langfuse' button on every signal. Dunetrace fetches the trace, reads the actual system prompt, and tells you exactly whats missing. You get the root-cause from real evidence.
Custom typescript, python agent integration. A few of you were building custom agents outside LangChain. There's now a zero-dependency integration.
Would like to know if something is missing right now. Also, a GitHub star (⭐) would be appreciated if you find the repo useful.
GitHub repo: https://github.com/dunetrace/dunetrace
Thanks!