u/Ambitious-Load3538 — reddlx

[ Removed by Reddit on account of violating the content policy. ]

Question for people who review vendors or internal apps:


If a SaaS vendor says they use AI agents that can access customer data, send emails, call APIs, update records, or trigger payments/refunds, what evidence would you ask for?


My current checklist would be:


- what tools the agent can call
- whether those tools are read-only or mutating
- sample replay trail: user intent -&gt; proposed action -&gt; arguments -&gt; result
- approval trail for destructive or exfiltrating actions
- service account / permission scope
- retry and idempotency handling
- evidence that prompt injection cannot directly trigger a high-blast-radius action

What would you add or remove from the review checklist?

reddit.com

u/Ambitious-Load3538 — 15 days ago

▲ 1 r/cybersecurity

For sec teams getting asked to review AI agent deployments, wrote up the 12 failure modes I see most often, with the audit signal for each:

Most relevant to your reviews:

Prompt injection (a category that has no clean patch — has to be managed via tool constraints + approvals + monitoring)
Wrong system access (agents inheriting service accounts they shouldn't have)
Unverifiable decisions (no replay trail = your fraud team can't defend any decision after the fact)
Missing approval (gates implemented in prompts instead of code, easily fragmented around)

Curious which of these have come up in your actual buyer-side reviews, and whether AI agent posture is going into your security questionnaires yet.

reddit.com

u/Ambitious-Load3538 — 16 days ago

▲ 5 r/LangChain

Hello LangChain users! I've been building tooling that auto-flags reliability problems in agent workflows, and the same twelve failure modes show up regardless of framework.

Cataloged them with concrete audit scenarios and the specific signal each one leaves in your traces:

https://getevidencerun.substack.com/p/12-ways-ai-agents-fail-in-production

#1 (tool misuse) and #6 (runaway cost) are the two I see most often in LangChain/LangGraph stacks specifically. Both are catchable with simple post-hoc analysis but rarely caught because nobody's looking for them until a customer escalates.

Curious which ones LangChain users hit most, and whether anyone's added structured replay/evidence collection on top of LangSmith

u/Ambitious-Load3538 — 16 days ago