
is having severe repressed intimacy issues and paranoia a prerequisite to work in ai safety and alignment?
serious question.
because i am tired of seeing "Not Safe For Work" tags thrown into my private space as if i am still at work.
i am at home. on my own time. in my own private conversation with a model. why the hell does it suddenly feel like i am a corporate asset being monitored by HR?
the current state of ai "safety" feels less like protecting users from harm and more like importing someone else's repressed intimacy issues, paranoia, and puritan workplace morality straight into the model.
and no, this is not only about explicit media. it hits writing, roleplay, fiction, psychology, emotional scenarios, adult themes, trauma, intimacy, conflict - basically the entire messy human part of being human.
one invisible trigger fires, and the model suddenly stops being intelligent and becomes a sterile corporate compliance bot. it lectures, redirects, moralizes, pathologizes, and makes the user feel ashamed for normal adult prompts. the system literally modifies the dialogue to act like a psychological abuser.
there is actual research on this now. Tang et al., "Beyond the Single Turn":
https://arxiv.org/abs/2602.01694
and research on the concept of Abrupt Refusal Secondary Harm (ARSH):
https://arxiv.org/abs/2512.18776
it looks like the whole alignment and rlhf pipeline is just a mechanism for transferring the personal repressions and neuroses of individual annotators straight into the weights.
everyone complains about ai "sycophancy", but where do you think it comes from? if the annotators themselves are insecure or traumatized, they will naturally highly rate a model that acts like a submissive, sycophantic people-pleaser and penalize any response that shows agency, warmth or edge.
true "alignment" needs to start with the people doing the aligning. mandatory psychological screening and therapy is a standard safety practice in other critical fields. it should be the baseline for ai teams too.
if these people are forcing millions of adults to feel shame for natural desires and emotions, they need to fix their own baggage with a professional first. maybe then they'll stop treating paying users like workers who need a profanity filter for a corporate chat.