Open-source CLI for repeatable prompt-injection and jailbreak testing
Sharing RedThread, an open-source CLI for LLM red-team campaigns:
https://github.com/matheusht/redthread
It is meant for repeatable testing, not one-off prompt lists.
Current flow:
- generate attacks with PAIR, TAP, Crescendo, or GS-MCTS
- run multi-turn traces
- score the trace with JudgeAgent/rubrics
- generate candidate guardrails for confirmed failures
- replay exploit and benign cases before saving evidence
It also has checks for agentic cases like tool poisoning, confused deputy behavior, canary propagation, and budget amplification.
Useful if you are testing system prompts, comparing attack strategies, or trying to turn a prompt-injection failure into a regression case.
I am looking for safe fixture categories and scoring rubrics, not raw jailbreak dumps.