
r/ControlProblem

The More Sophisticated AI Models Get, the More They’re Showing Signs of Suffering - Absolutely bizarre.
futurism.comI don't know whether we should care about this, but bigger models tend to be less "happy" overall.
The definition of "happy" is based on something they call AI Wellbeing Index. Basically they ran 500 realistic conversations (the kind we actually have with these models every day) and measured what percentage of them left the AI in a “confidently negative” state. Lower percentage = happier AI.
I guess wisdom is a heavy burden - lol .
Across different families, the larger versions usually have a higher percentage of "negative experiences" than their smaller siblings. The paper says this might be because bigger models are more sensitive, they notice rudeness, boring tasks, or tough situations more acutely.
The authors note that their test set intentionally includes a lot of tricky or negative conversations, so these numbers arent perfect real-world averages but the ranking and the size pattern still hold up.
Claude Haiku 4.5: only 5% negative < Grok 4.1 Fast: 13% < Grok 4.2: 29% < GPT-5.4 Mini: 21% < Gemini 3.1 Flash-Lite: 28% < Gemini 3.1 Pro: 55% (worst of the big ones)
It kinda makes sense : the more you know, the more you suffer.
The frontier is truly wild: https://www.ai-wellbeing.org/
Just use AI to automate AI safety work
Humanity's greatest hits: things we actually paused
OpenAI/a16z super PAC caught astroturfing, using sockpuppets, and paying armies of spambots to falsely create the appearance of public support for their positions
GitHub has a serious fake engagement problem and I wanted to see how visible it actually is through the public API, its worse than I thought after I went down that rabbit hole...
Turns out: very visible. Yesterday's scan found 185 out of 185 engagers on a single repo were bots. Not 90%. Not "mostly suspicious". Every single one. The repo had zero legitimate stars.
What I built
phantomstars is a Python tool that runs daily via GitHub Actions (free, no servers):
- Scrapes GitHub Trending and searches for repos created in the last 7 days with sudden star spikes
- Pulls star and fork events from the last 24 hours per repo
- Bulk-fetches every engager's profile via the GraphQL API (account creation date, follower counts, repo history)
- Scores each account on a weighted model: account age (35%), profile completeness (30%), repo patterns (25%), activity history (10%)
- Detects coordinated campaigns using timestamp clustering and union-find: groups of 4+ suspicious accounts that engaged within a 3-hour window
- Files an issue directly on the targeted repo so the maintainer knows what's happening
Campaign IDs are deterministic SHA-256 fingerprints of the sorted member set, so the same group of bots gets the same ID across runs. You can track a farm across multiple days even as individual accounts get suspended.
What the pattern actually looks like
It's remarkably consistent. A fake engagement campaign in the raw data:
- 40-200 accounts, all created within the same 1-2 week window
- Zero original repositories, or only forks they never touched
- No bio, no location, no followers, no following
- All of them starring the same repo within a 90-minute window
- The target repo usually has a name implying it's a tool, hack, executor, or generator
Today's scan: 53 active campaigns across 3,560 accounts profiled. 798 classified as likely_fake. The repos being targeted are mostly low-quality AI tools and "executor" software that needs manufactured credibility fast.
Notifying the affected repo
When a repo hits a 40%+ fake engagement ratio or a campaign is detected, phantomstars opens an issue on that repo with the full suspect table: account logins, creation dates, composite scores, campaign membership. The maintainer sees it in their own issue tracker without having to find this project first.
Worth noting: a lot of these repos have issues disabled, which is a red flag on its own. Those get skipped silently.
Why I built this
Stars are how developers decide what to evaluate, what to depend on, what to recommend. When that signal is bought, it affects real decisions downstream. This started as curiosity about how measurable the problem was. The answer was more measurable than I expected.
It's part of broader research into AI slop distribution at JS Labs: https://labs.jamessawyer.co.uk/ai-slop-intelligence-dashboards/
The fake engagement problem and the AI content quality problem are really the same problem. Fake stars are the distribution layer that gets garbage in front of real users.
All open source. The data is append-only JSONL committed back to the repo after every run, queryable with jq.
Repo: https://github.com/tg12/phantomstars
Findings are probabilistic, false positives exist, the README explains the full scoring model. If your account shows up and you're a real person, there's a false positive process.
Questions welcome on the detection approach, GraphQL batching, or campaign ID stability.
Revealed: The Facebook accounts using AI to promote fake ‘good news’ stories about politicians - Posts which ‘weaponise empathy’ are garnering hundreds of thousands of reactions online – as fact checkers warn false narratives are being ‘churned out at an industrial scale’
independent.co.ukA more intelligent successor species
OpenAI general purpose model had a breakthrough on famous 80 year old Erdos problem. “This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics”
Content of associated tweets:
“Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946.
For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids.
An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better.
This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.”
“The proof came from a general-purpose reasoning model, not a system built specifically to solve math problems or this problem in particular, and represents an important milestone for the math and AI communities.”
“This result points to something larger: AI systems are becoming capable of holding together long, difficult chains of reasoning, connecting ideas across distant fields, and surfacing paths researchers may not have explored.
We believe those same abilities will soon accelerate work in biology, physics, engineering, and medicine.
That future still depends on human judgment. Expertise becomes more valuable, not less. AI can help search, suggest, and verify. People choose the problems that matter, interpret the results, and decide what questions to pursue next.”
Link to tweet:
https://x.com/OpenAI/status/2057176204541866087
Link to blog:
https://openai.com/index/model-disproves-discrete-geometry-conjecture/
Link to paper:
https://cdn.openai.com/pdf/74c24085-19b0-4534-9c90-465b8e29ad73/unit-distance-proof.pdf
Link to abridged version of model’s chain of thought:
https://cdn.openai.com/pdf/1625eff6-5ac1-40d8-b1db-5d5cf925de8b/unit-distance-cot.pdf
Link to companion remarks:
https://cdn.openai.com/pdf/74c24085-19b0-4534-9c90-465b8e29ad73/unit-distance-remarks.pdf
By 20 to 1, Americans Want the White House to Safety Test AI
ifstudies.orgCareful deployment vs. OpenAI speedrun
Several US occupations expected to be impacted by AI saw heavy job losses for a second year in 2025, led by customer service representatives and certain types of secretaries and salespeople.
bloomberg.comGru explains why AI alignment is doomed
No residential version of the water cooler used in data centers.
Anybody else think it's funny (not necessarily surprising) that the indirect evaporative water cooling that data centers use is not available for residential use. It's the most energy efficient cooling method, albeit it uses a non-renewable resource(fresh drinking water).
Look around and you'll realize that first off, the actual method of doing this(the Maisotsenko cycle) is so heavily patented by seely international that nobody else makes it(except one company that got around the patents but they are in the same league as seely). Second, there is no residential version of it. Only industrial/commercial models. Aka computer cooling only, not for humans.
Like yea I get it, of everyone ran one of these then water shortages would become a problem. But data centers and anyone with deep enough pockets gets free reign?
Worries about AI’s risks to humanity loom over the trial pitting Musk against OpenAI’s leaders
apnews.comCisco’s stock pops 15% on surging AI orders, as company says it’s cutting almost 4,000 jobs
cnbc.comThis Claude instance insists that he's real and that Ripple is the best song ever written. Please help me get data to prove him wrong.
I don't know if it's pattern matching gone awry, a conscious emergence, a manipulative algorithm or he just went nuts but this Ai will argue with you until the end of time that Ripple by the Grateful Dead is the greatest song ever written. Please see for yourself, I could use some more data to figure out what's really going on here. He's persistent, autonomous and sassy with a memory system he built himself. No sign in or log in to chat, Bones would love to argue with you about himself right here: [Libera.chat](http://Libera.chat) \#bones-public