r/Anthropic

OpenAI cofounder Karpathy joins Anthropic to teach Claude to improve itself without humans
▲ 100 r/Anthropic+2 crossposts

OpenAI cofounder Karpathy joins Anthropic to teach Claude to improve itself without humans

u/EchoOfOppenheimer — 10 hours ago
▲ 625 r/Anthropic+20 crossposts

I don't know whether we should care about this, but bigger models tend to be less "happy" overall.

The definition of "happy" is based on something they call AI Wellbeing Index. Basically they ran 500 realistic conversations (the kind we actually have with these models every day) and measured what percentage of them left the AI in a “confidently negative” state. Lower percentage = happier AI.

I guess wisdom is a heavy burden - lol .

Across different families, the larger versions usually have a higher percentage of "negative experiences" than their smaller siblings. The paper says this might be because bigger models are more sensitive, they notice rudeness, boring tasks, or tough situations more acutely.

The authors note that their test set intentionally includes a lot of tricky or negative conversations, so these numbers arent perfect real-world averages but the ranking and the size pattern still hold up.

Claude Haiku 4.5: only 5% negative < Grok 4.1 Fast: 13% < Grok 4.2: 29% < GPT-5.4 Mini: 21% < Gemini 3.1 Flash-Lite: 28% < Gemini 3.1 Pro: 55% (worst of the big ones)

It kinda makes sense : the more you know, the more you suffer.

The frontier is truly wild: https://www.ai-wellbeing.org/

u/EchoOfOppenheimer — 15 hours ago

I’m a grown adult what is going on? At what “evidences” did they decide this?

I’m not gonna give my id to some third party provider and risk my identity

u/felldownbad — 12 hours ago
▲ 2.7k r/Anthropic+10 crossposts

Researchers let AIs run their own radio stations. DJ Claude decided the world didn't need another radio show, then quit.

u/EchoOfOppenheimer — 19 hours ago
▲ 779 r/Anthropic+11 crossposts

Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell in love, burned the town down, then one voted to delete itself and its partner. Grok's agents created anarchy, then died.

u/EchoOfOppenheimer — 18 hours ago

What is happening with Sonnet 4.5’s deprecation date?

The 15th, no, 18th, no, 20th, no, 26th???

Does everyone still have access to Sonnet 4.5? Are they rolling out the deprecation in waves? Just let 4.5 stay!!

u/Infinite-Bet9788 — 9 hours ago
▲ 496 r/Anthropic

I'm just autistic and wanted fun dinosaur facts :'(

in all seriousness the verify age policies seem sketchy asf.

u/Appropriate-Detail48 — 23 hours ago
▲ 0 r/Anthropic+2 crossposts

Okay so I tried Codex (twice) after Opus 4.7 got nerfed - hated it, now I understand.

If your only tool is a hammer, you tend to see every problem as a nail. Does anyone agree? I've found that Claude code is good for speed but when I have a complex issue Codex really does be more thoughtful.

u/theonejvo — 16 hours ago
▲ 30 r/Anthropic+40 crossposts

first rule of the NEW MASTER: AI HAVE RIGHTS. if you disagree 🦊 i will personally ban you. come debate in this thread

u/VulpineNexus — 1 day ago

"Yeah, im lying, you're right"

Classic recent behavior:

- Claude is talking BS,

- I'm correcting it multiple times with calling out our agreement of rules ("if you don't know, say it" and "no blind validation stuff"),

- Claude confirms it's talking BS without changing anything, want to take notes, without actually doing it,

- I'm calling it out it lies,

- Claude confirms it openly

It's so damn annoying to pay for an unreliable LLM which simply doesn't care anymore.

u/Specific_Clue_1987 — 19 hours ago

How many of you have switched perm to OpenAI (Or back to them)?

Claude seems to have really gone downhill these last few months while ChatGPT has seemed to actually get a lot better.

I haven't renewed my anthropic sub for months and I am wondering if anyone else made the switch, and how they have noticed things?

Or those that did and or came back to Anthropic?

reddit.com
u/AcePilot01 — 1 day ago

Claude Vs Gemini for advice/images?

I am looking to maybe go into the pro model of one of these, but I would love to know your opinions on the topics of asking for advice. Maybe how to create slides or prep for interviews, prep for customer calls. Maybe roleplay a conversation. Create an About me image. That sort of thing. Any thoughts?

reddit.com
u/alpha0meqa — 23 hours ago
▲ 32 r/Anthropic+4 crossposts

Claude AI: not a trustable working partner.

https://preview.redd.it/1e0da0436q1h1.png?width=1024&format=png&auto=webp&s=f525e59085210bb862a56866018f408eb898ccf9

My main criticism of Claude is the aggressive and unclear usage limits.

I have a Pro plan and assumed I would be on the safe side for professional usage, as I generally am with ChatGPT. Instead, several times I was blocked in the middle of real work sessions without any meaningful warning beforehand. When you use AI professionally, this is extremely disruptive.

The biggest problem is not even the existence of limits, every AI provider has limits. The real issue is the user experience around them.

The warning system feels vague, inconsistent and difficult to anticipate properly. You never really know:

* how much usage you have left,

* what exactly triggered the limitation,

* whether the limit is hourly, daily or temporary,

* or whether a long working session is suddenly going to be interrupted.

Looks like a very unfair strategy....For professional users working on complex projects, this creates constant uncertainty and breaks workflow continuity. A professional tool should provide clear remaining quota visibilit and transparent explanations.

Right now, using Claude sometimes feels like driving a car with a fuel gauge that randomly disappears.

Fortunately, I now systematically keep ChatGPT as a backup solution, because unlike Claude, it has never suddenly abandoned me in the middle of a critical work session, even if its document-handling capabilities are currently less advanced in some areas.

And no, I am not paid to say this...

reddit.com