Are you actually using a Prompt Injection Firewall, or is it mostly hype?
Hey everyone,
I'm working on a production app that hooks an LLM up to external APIs (tools/function calling), and the threat of indirect prompt injection is starting to give me gray hairs.
I’ve seen a bunch of startups and open-source tools popping up offering "LLM Firewalls" or "Prompt Guardrails" to intercept inputs/outputs and filter out malicious instructions.
But looking at it practically, it feels a bit like a game of whack-a-mole. I'm trying to figure out if these tools are actually worth integrating, or if standard software security practices are enough.
For those of you with LLMs in production:
- Are you actually using a dedicated prompt injection firewall? (If so, which one, and has it actually caught anything?)
- Or are you just relying on classic security? (e.g., strict system prompts, strict output parsing, sandboxing code execution, and treating all LLM outputs as untrusted user input).
I’d love to hear some real-world perspective before I go adding another layer of complexity to our stack. Cheers!