u/MoneyMediocre4791 — reddlx

▲ 0 r/softwaredevelopment

Why keep test plans in code if Jira can slap an MCP?

Been seeing this question come up with teams that attempt to retrofit their workflows for agents.

“Why keep test plans / stories / product context in code? Just expose Jira through MCP tools.”

Something like:

list_stories
get_story
update_story

Voila! Technically the agent now has access to everything.

But access ≠ understanding.

The difference is similar to someone who has "read the entire library" vs someone "with a library card".

A library card technically gives access to every book. But someone who has actually read the library understands relationships, patterns, structure, context, etc.

Apply the same logic to your code. Imagine your codebase was stored as individual files, in a remote SaaS, and accessed purely via MCP tools:

list_files
read_file
upsert_file

Technically your agent has the entire codebase available. But practically, losing out a bunch of capabilities:

local indexing optimized for retrieval
folder structure as implicit context
grep/find across everything
reading nearby context naturally
faster iteration during multi-step reasoning for chain of thought

The agent doesn’t just access the code - it starts understanding the shape of it.

The same principle apply to product knowledge too. If stories, tests, and knowledge lived in a native/code-like form, agents can build a richer model of the business instead of pulling one record at a time through tools.

Curious if others have thought about this.

Do people think MCP + tools is sufficient? Or is there something fundamentally different about agents having native/local access to structured context?

reddit.com

u/MoneyMediocre4791 — 4 days ago

▲ 2 r/Everything_QA+1 crossposts

Why keep test plans in code if Jira can slap an MCP?

Been seeing this question come up with teams that attempt to retrofit their workflows for agents.

“Why keep test plans / stories / product context in code? Just expose Jira through MCP tools.”

Something like:

list_stories
get_story
update_story

Voila! Technically the agent now has access to everything.

But access ≠ understanding.

The difference is similar to someone who has "read the entire library" vs someone "with a library card".

A library card technically gives access to every book. But someone who has actually read the library understands relationships, patterns, structure, context, etc.

Apply the same logic to your code. Imagine your codebase was stored as individual files, in a remote SaaS, and accessed purely via MCP tools:

list_files
read_file
upsert_file

Technically your agent has the entire codebase available.

But practically, losing out a bunch of capabilities:

local indexing optimized for retrieval
folder structure as implicit context
grep/find across everything
reading nearby context naturally
faster iteration during multi-step reasoning for chain of thought

The agent doesn’t just access the code - it starts understanding the shape of it.

Curious if others have thought about this.

Do people think MCP + tools is sufficient? Or is there something fundamentally different about agents having native/local access to structured context?

reddit.com

u/MoneyMediocre4791 — 4 days ago

▲ 1 r/buildinpublic

What if Claude could learn "how humans actually use your product"

Claude knows your codebase. But it has no clue “how humans actually interact with your product”.

What if it could…

I have been thinking about what it unlocks for test automation:

Identify untested behaviours and add tests
Understand where users are struggling -> check for UX frictions (is there a big JS being loaded slowing down the page? A localization bug?)
Capture user segments, write realistic test fixtures that emulate them

All of it - prioritized by usage signals - frequency, time spent etc.

Beyond testing - it could even catch errors and raise PRs to fix, without you needing to monitor and tell “what to fix”, or even come up with product improvement ideas!

This is why instrumentation is one of the best “investments” teams can do today as they move to agentic coding. Historically, this used to be a prohibitively expensive task since you had to:

Decide which events to track
Decide Metadata and slicing dimensions for each event (user.role, plan, region, etc.)
Get everyone aligned on naming and vocabulary
Add instrumentation in the right places
Repeat after every PR and every feature

Even disciplined teams struggled to maintain this process. And after all that work, someone still had to stare at dashboards and decide what mattered.

But agentic coding changes the economics. All of the above can now be a background task done autonomously - for each PR - by Claude.

What would you do - if Claude could learn user behaviour of your app and use it as a feedback loop?

Product usage driven feedback loop for testing

reddit.com

u/MoneyMediocre4791 — 7 days ago

▲ 1 r/AI_Agents

What if Claude could understand “how humans use your product”?

Claude knows your codebase. But it has no clue “how humans actually use your product”.

What if it could…

I have been thinking about what it unlocks for test automation:

Identify untested behaviours and add tests
Understand where users are struggling -> check for UX frictions (is there a big JS being loaded slowing down the page? A localization bug?)
Capture user segments, write realistic test fixtures that emulate them

All of it - prioritized by usage signals - frequency, time spent etc.

Beyond testing - it could even catch errors and raise PRs to fix, without you needing to monitor and tell “what to fix”, or even come up with product improvement ideas!

This is why instrumentation is one of the best “investments” teams can do today as they move to agentic coding. Historically, this used to be a prohibitively expensive task since you had to:

Decide which events to track
Decide Metadata and slicing dimensions for each event (user.role, plan, region, etc.)
Get everyone aligned on naming and vocabulary
Add instrumentation in the right places
Repeat after every PR and every feature

Even disciplined teams struggled to maintain this process. And after all that work, someone still had to stare at dashboards and decide what mattered.

But agentic coding changes the economics. All of the above can now be a background task done autonomously - for each PR - by Claude.

What would you do - if Claude could learn user behaviour of your app and use it as a feedback loop?

reddit.com

u/MoneyMediocre4791 — 7 days ago

▲ 2 r/founder+1 crossposts

SKILL - the killer distribution channel for technical SaaS products

SKILL may be one of the most attractive distribution mechanisms for technical SaaS products.

For years, complex products had a major churn point: learning curve.

But a SKILL effectively turns every user into a power user - one that knows exactly:
- which workflows to follow,
- how to use the product correctly, and
- how to extract maximum value from every feature.

Massively shrinking time-to-aha.

I have been seeing this with my own product: TestChimp (a fully autonomous QA for dev teams) since launching our SKILL, that changed the devX to:

Author a few user stories (or import from Jira)
Install our skill on your agent
Say “/testchimp test” after each PR

The SKILL teaches Claude / Cursor how exactly to interact with our platform to get coverage gaps insights, author missing tests, run targeted exploratory tests etc.

And the best part: when you ship new features, your user upgrades automatically.

Tip: Make sure your SKILL is properly authored to be self-updating (checkout the linked skill to see how we do that), and your “users” continuously learn your latest features, capabilities, and best practices - to use the product exactly how you intended.

u/MoneyMediocre4791 — 11 days ago