
I made an evidence-gate workflow for coding agents — Codex + Claude Code support
I’m the maker of Superloopy, an MIT-licensed workflow layer for coding agents.
The pattern I’m trying to make practical is an “evidence gate” before an agent claims a task is done:
turn the task into explicit acceptance criteria
ask the agent to leave receipts under `.superloopy/evidence/`
use command-backed checks where possible, not just prose summaries
keep manual/visual proof separate from deterministic proof
finish with a report that says what passed, what still needs judgment, and where the artifacts are
It now works with both Codex and Claude Code. The implementation is intentionally thin: plugin hooks, skills/subagents, and a small CLI around evidence + final gates. The goal is not to create another agent, but to make existing coding agents easier to audit when they say “done.”
Repo:
https://github.com/beefiker/superloopy
I’m curious how other people building with coding agents structure this. Do you keep evidence artifacts? Do you require tests/screenshots/logs before accepting work? Or does that add too much ceremony for your workflow?