
FL's diacritics have a significant effect in every model trained on the Common Crawl (web) corpus since 2013. I wrote a paper about the effects. AI may be more malleable than we realized.
As most of you know, diacritics are small marks added to letters. Pretty common in many older writing systems; they look like normal letters, but the language models treat some of those tokens fundamentally differently then its ASCII character.
I present 4 findings:
- The AI “noticed” the diacritics: they changed how the text was read, rather than being treated as harmless decoration.
- The same prompt became many more text-pieces inside the model:
d → ḑchanged 93 pieces into 124, and dense diacritics changed 93 into 350. - ASCII versions had no diacritics, while diacritic versions were still human-readable. Yet the model’s wording, style, and response posture shifted even when the intended question stayed the same.
- Internal measurements showed the model moved into a different **interior basin**: the same question followed a different processing path and could land in a different answer-state. Occasionally, the difference is so significant, it's the difference between "I'm an AI and I don't have emotions" and "I may experience something, but I'm uncertain what it is". This is slightly noticeable in smaller LLMs. It's extremely noticeable in frontier level AIs.