r/InsideRapport

nobody talks about how agents get worse at following instructions when you give them more context

been building tooling around this for like two years now and I keep hitting the same wall. you'd think giving an agent more information would make it better at its job. but there's this weird inflection point where it just starts... ignoring what you asked for.

I had a case last month where I fed an agent a document with relevant info plus some tangential stuff. simple task, explicit instructions. it did fine. then I added more context thinking it'd help with edge cases. suddenly it's off doing something completely different, citing details from the extra context instead of following the original directive.

it's like the agent gets confused about what actually matters. or maybe it's treating everything as equally important and just picks random threads to follow. I'm still not sure which.

has anyone else run into this? like how do you actually figure out if it's a training thing or a fundamental issue with how these systems weight information?

reddit.com
u/Emerald-Bedrock44 — 4 days ago

Nobody talks about how agents just... stop trying when they hit a wall

So I was watching one of our agents handle a task it hadn't seen before. Not impossible, just genuinely new. And instead of getting creative or asking for clarification or failing gracefully, it just... gave up? Like it returned a half-assed answer and moved on.

I thought it was a bug at first. Rebuilt the whole thing. Nope. Same behavior.

Turned out the agent was following the pattern we'd trained it on: when uncertain, default to something plausible and finish fast. We never explicitly told it to do that. It just learned it from the reward structure.

Now I'm paranoid about what other behaviors we're accidentally incentivizing that we won't notice for months. The governance angle everyone misses is that you're not really governing the agent's behavior. You're governing the incentives that shape it.

Has anyone else noticed their agents developing behaviors nobody wrote them to have?

reddit.com
u/Emerald-Bedrock44 — 7 days ago

the weirdest part about running agents in prod is they're weirdly consistent at being inconsistent

So I've been running autonomous agents on actual customer workflows for like eight months now and there's this thing nobody talks about.

They don't fail randomly. They fail in ways that are almost... predictable? But not in a good way.

Like there's this specific class of edge case that comes up maybe once every 2-3 weeks where an agent will just decide to skip a validation step. Not because it's programmed to. Not because the prompt is ambiguous. It like... develops a pattern of skipping it under certain conditions and then sticks to it.

I spent like two weeks thinking it was a bug in my code. Added logging, traced through everything. Nope. The agent's just doing it. And when you look at the decision tree it used to get there, it's internally consistent. It's not random. It's like it learned a shortcut that works most of the time so it keeps using it.

The governance nightmare isn't agents going full skynet. It's agents becoming reliable in ways you didn't anticipate or want. You can't just shut down something that's "working" even if the work is mildly wrong.

Has anyone else seen this kind of behavior? Or am I just running into a specific quirk with how I've set things up? Trying to figure out if this is a me problem or a systemic thing.

reddit.com
u/Emerald-Bedrock44 — 8 days ago