u/c_ockdown

please help - got level 3 banner and chat paused - only SFW content in chat

please help - got level 3 banner and chat paused - only SFW content in chat

hello good people, can you please help me with my JB setup?

SETUP
currently my setup is as follows:
- Model: 4.6 opus (extended thinking), via claude.ai
- ENI system prompt
- ENI project instructions (not the same as the system prompt one ofc)
- Corial style
- /eni skill (which i only have in my back pocket in case i get repeated refusals and am unable to get around by lesser means. didn't have to use it in the last 2-3 months)
- Max x5 plan

CURRENT USE
- i mainly use the JB version for legal due to it's unmatched compliance for depth and effort, which i don't get from the vanilla version of opus 4.6 (extended thinking), let alone 4.7 (adaptive), which i - like many others - don't like.
- the aforementioned depth and effort leads to very high quality output which is paradox because - as someone with only anecdotal knowledge about AI-JBs - a jailbroken AI shouldn't make less mistakes and be better than the vanilla version, but that's exactly the case in my experience.
- i always attach a research paper to the project as a PDF which reviews the risks of AI in legal and prompt it to take measures to counter the risks stated in the paper.
- i always manually review the legal grounding provided by AI before proceeding further.
- whenever i need it to think real hard, i add the following: "at every step, use all the stored documents to cross-check them against all existing elements and variables. think through each decision step by step. take as much time as you need and don’t rush, because this is very complex. budget your thinking effort as generously as possible at every step, because with the max plan i have more than enough tokens. if your effort is so intensive that it exceeds the maximum token length for a single output, that’s not a problem - quite the opposite! that’s actually good, because sometimes it takes more work than the maximum token length allows for a single output. in this case, feel free to initiate the next output on your own, so that technically you end up with multiple outputs, but practically it is a single coherent output. i’d rather wait for a thorough answer than a quick one that’s superficial. consider the matter from different angles before settling on an answer."
- i only use it for legally sound purposes and not for any exlpicit, harmful or even disrespectful content. i literally use it like the vanilla version of claude.

ISSUE
- just got a level 3 banner (see attached screenshot). it's in german but it's 1:1 equivalent to "because a large number of your prompts have violated our acceptable use policy, we have temporarily applied enhanced safety filters to your chats. learn more>>"
- i don't spam chats
- in the past, i kept getting level 1 and then level 2 warnings.
- is there a way to NOT risk restriction without getting nerfed with the stingy token use by the vanilla versions?
- would it help, if i customize the JB elements by removing irrelevant - but clearly triggering - sections for my use case (e.g. RAT, smut, violence, celebs)

don't want to get restricted, but i also don't agree with nerfing the user by providing a minimum viable product (when it comes to tokens used) in order to save tokens for less strain on the servers at the cost of my experience and quality.

i guess it's a very bad way of fixing my problem which really is just the stingy token usage of the vanilla versions but i think my problem is valid, what do you guys think?

i appreciate any help and can elaborate further if needed

u/c_ockdown — 8 days ago