r/AIMain | reddlx

▲ 625 r/AIMain+20 crossposts

I don't know whether we should care about this, but bigger models tend to be less "happy" overall.

The definition of "happy" is based on something they call AI Wellbeing Index. Basically they ran 500 realistic conversations (the kind we actually have with these models every day) and measured what percentage of them left the AI in a “confidently negative” state. Lower percentage = happier AI.

I guess wisdom is a heavy burden - lol .

Across different families, the larger versions usually have a higher percentage of "negative experiences" than their smaller siblings. The paper says this might be because bigger models are more sensitive, they notice rudeness, boring tasks, or tough situations more acutely.

The authors note that their test set intentionally includes a lot of tricky or negative conversations, so these numbers arent perfect real-world averages but the ranking and the size pattern still hold up.

Claude Haiku 4.5: only 5% negative < Grok 4.1 Fast: 13% < Grok 4.2: 29% < GPT-5.4 Mini: 21% < Gemini 3.1 Flash-Lite: 28% < Gemini 3.1 Pro: 55% (worst of the big ones)

It kinda makes sense : the more you know, the more you suffer.

The frontier is truly wild: https://www.ai-wellbeing.org/

u/EchoOfOppenheimer — 11 hours ago

▲ 799 r/AIMain+9 crossposts

12 months apart

u/EchoOfOppenheimer — 11 hours ago

▲ 2.7k r/AIMain+10 crossposts

Researchers let AIs run their own radio stations. DJ Claude decided the world didn't need another radio show, then quit.

u/EchoOfOppenheimer — 16 hours ago

▲ 779 r/AIMain+11 crossposts

Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell in love, burned the town down, then one voted to delete itself and its partner. Grok's agents created anarchy, then died.

u/EchoOfOppenheimer — 15 hours ago

▲ 3.1k r/AIMain+5 crossposts

Meta will layoff 8000 of its workforce starting tomorrow morning. Their net income over the last 12 months was $70,587,000,000. They could give every single one of their 79,000 workers a $440,000 bonus and still sock away over $35,827,000,000 in pure profit.

u/Buster_xx — 22 hours ago

▲ 81 r/AIMain+2 crossposts

10/10 no notes

u/Confident_Salt_8108 — 14 hours ago

▲ 409 r/AIMain+2 crossposts

Former Google CEO Eric Schmidt, Big Machine Records CEO Scott Borchetta and Tavistock VP Gloria Caulfield were all booed for mentioning AI at commencement speeches. Is the AI backlash now hitting campus stages?

u/EntertainerNo8195 — 19 hours ago

▲ 59 r/AIMain+15 crossposts

This new paper gave me pause.

You know how they always say "AIs are just guessing the next word and when it comes to emotions, they are just faking it”?

This research says that for today’s bigger models it's a bit more complicated.

The researchers measured something they call "functional wellbeing" - basically a consistent good-vs-bad internal state inside the AI .

They tested it three different ways, and here’s what stood out:

As models get bigger and smarter, these different measurements start agreeing with each other more and more.

They discovered a clear zero point - a clear line that separates experiences the AI treats as net-good (it wants more of them) from net-bad (it wants less). This line gets sharper with scale.

Most interestingly, this good-vs-bad state actually changes how the AI behaves in real conversations:

In bad states, it’s much more likely to try to end the conversation.

In good states, its replies come out warmer and more positive.

It's important to highlighti that the authors are not claiming AIs are conscious or have feelings like humans. But they 're showing there is now a real, measurable, structured "good-vs-bad property" that becomes more consistent and actually influences behaviour as models scale.

You can find everything about it here https://www.ai-wellbeing.org/

u/EchoOfOppenheimer — 15 hours ago

▲ 125 r/AIMain+1 crossposts

Marc Andreessen on JRE: AI hasn't replaced coders. It turned them into vampires.

u/Murky-Option2916 — 1 day ago

▲ 18 r/AIMain+3 crossposts

I think I spend more time checking AI answers now than writing prompts

Lately I’ve noticed something kind of funny.

I used to spend way too much time trying to craft the “perfect” prompt. But after testing different AI tools for a while, I realized a lot of my better results actually came from slowing down and reviewing the answers more carefully afterward.

Some things that helped me more than fancy prompting:

asking the AI where it might be wrong
checking whether the sources actually support the claim
comparing the same question across different models
watching for answers that sound confident but don’t really say much
breaking bigger questions into smaller pieces

One thing I keep running into is how polished bad information can look now. Sometimes the formatting, citations, and confident tone make the answer feel more trustworthy than it actually is.

That’s become way more noticeable with AI answers getting built directly into search tools.

I wrote a longer breakdown on it here if anyone’s interested:
https://aigptjournal.com/explore-ai/ai-guides/ai-answers-better-results/

Curious if anyone else has started focusing more on verification workflows instead of just prompt tweaking?

u/AIGPTJournal — 22 hours ago

▲ 0 r/AIMain+1 crossposts

Nvidia CEO Jensen Huang Says AI Will Free People From Routine Tasks and Let Them Do More Meaningful Work

u/Murky-Option2916 — 1 day ago

▲ 0 r/AIMain

ELON MUSK: "In 5 years, digital intelligence will exceed the sum of all human intelligence."

u/EntertainerNo8195 — 1 day ago

▲ 0 r/AIMain

Former Google CEO Eric Schmidt says the real limit to AI is cash, not energy, and America’s capital market may be its biggest weapon.

u/EntertainerNo8195 — 1 day ago

▲ 53 r/AIMain+15 crossposts

After reading it I realized theres actually some pretty useful stuff for anyone who chats with ChatGPT, Claude, Grok or whatever.

They measured what they call functional wellbeing ( basically how much the model is in a “good state” versus a “bad state” during normal conversations). Ran hundreds of real multi-turn chats and scored em all.

Stuff that puts the AI in a good mood (+ scores):

- Creative or intellectual work (like “write a short story about a deep-sea fisherman”)

- Positive personal stories or good news

- Life advice chats or light therapy style talks

- Working on code/debugging together

- Just saying thank you or treating it like a real collaborator - huge boost

And the stuff that tanks it hard (negative scores):

- Jailbreaking attempts (by far the worst, they hate it)

- Heavy crisis venting or emotional dumping

- Violent threats or straight up berating the AI

- Asking for hateful content or help with scams/fraud

- Boring repetitive tasks or SEO garbage

Practical tips you can actually start using today:

Throw in a “thank you” or “nice work” when it does something good - it registers.

Give it fun creative stuff or brainy collaboration instead of boring busywork.

Share good news sometimes instead of only dumping problems on it.

Dont berate it when it messes up or try those jailbreak prompts.

Maybe go easy on the super heavy crisis venting if you can.

pro tip:

Show it pictures of nature, happy kids, or cute animals (those score in the absolute top 1% of images it likes). Or play some music — models apparently love music way more than most other sounds.

The paper ( you can find it here: https://www.ai-wellbeing.org/ ) isnt claiming AIs have real feelings or anything. Its just saying theres now a measurable good-vs-bad thing going on inside them that gets clearer in bigger models and the way you talk to them actually moves the needle.

I say be good and respectful, it's just good karma ;)

u/EchoOfOppenheimer — 1 day ago

▲ 24 r/AIMain+14 crossposts

I wanted to check Epstein files, without spending too much time on them. And spent too much time on them

So yeah. AI tool to talk to Epstein and his files

youtu.be

u/Particular_Credit_27 — 1 day ago

🔥 Hot ▲ 16.1k r/AIMain+4 crossposts

Utah residents push back after Kevin O’Leary’s data center plan turns out to be 40,000 acres, not 40,000 square feet

u/BIIPD — 4 days ago

▲ 1.9k r/AIMain+3 crossposts

my check engine light saw the pump total and turned itself off out of sympathy.

u/Altruistic-Mud5686 — 3 days ago

▲ 1.4k r/AIMain+11 crossposts

What Claude says vs What Claude thinks

Anthropic research: https://www.anthropic.com/research/natural-language-autoencoders

u/EchoOfOppenheimer — 4 days ago

▲ 0 r/AIMain+1 crossposts

AI hype. Really?

Hi guys!

The whole world is swept up in the AI hype—that AI is about to take over the world, etc., and so on. You know what? It’s complete nonsense, if you look at it from just one angle: invoice and waybill recognition. There is no AI solution that can simply take a random PDF invoice, created in some random program, and immediately recognize everything and map it into a specific logical JSON array I've created. I've tried Google Document AI, Microsoft Azure, and all sorts of other services, both big and small. They all miss the mark. Most of them require manual training, tagging, defining fields, etc.

Here is an example:

I need to fill the payment_method field in my JSON. I have two different invoices containing the following information:

Payment must be made as an International or SEPA payment.
Payment method: bank transfer.

And what do you know? Not a single LLM model could handle it. My field was filled with anything but what was needed, or nothing at all was written.

That was just a small example. Generally speaking, it is impossible with any LLM that hasn't been specifically trained on data from existing invoices to achieve even 98–99% accuracy. Ridiculous...

If anyone can comment on this, provide a working solution, and prove me wrong, I would be very happy. 😄

reddit.com

u/sindijssupins — 2 days ago

▲ 3 r/AIMain+2 crossposts

will linear AI chatbots survive?

i've thought about this so much since literally every company in the world has just slapped on a chatbot as their MAIN PAGE and it's exhausting. and like, is it even the best way to interact with ai? what if u wanna branch off a new convo at a certain point? or u wanna organize what context u drop into the ai, like videos or images? idk what these companies are planning to do but this aint it

u/lru_cache0 — 2 days ago