I’m done paying for LLMs until they learn token efficiency
Watching my agent spin in circles, re-explaining the same steps over and over before maybe doing something useful. I keep telling it use “/caveman full” skill — short, direct, no fluff — and it just ignores me. More verbose walls of text. More wasted tokens.
Why aren’t these models trained for agentic efficiency? Punish every unnecessary token. Reward getting the job done fast. Until then, I’m not paying to watch an LLM burn money while pretending to think.