u/Direct-Attention8597

r/vibecodingitalia r/buildinpublic r/claude r/generativeAI r/AI_Agents r/ClaudeAI r/AlignmentResearch r/OpenSourceeAI r/AppBusiness r/Agent_AI r/micro_saas r/SaaS r/ClaudeCode r/singularity r/selfhosted r/artificial

▲ 308 r/artificial+1 crossposts

Independent benchmark shows big drops on Claude Fable 5 after its relaunch, here’s the actual context

Saw this chart from BridgeMind going around. They reran BridgeBench (a coding benchmark covering debugging, refactoring, and hallucination detection) comparing the July 1 relaunch of Fable 5 to the original June 12 version:

Debugging: 86.2 → 25.9
Refactoring: 73.6 → 38.4
Hallucination: 75.9 → 61.7

Some context worth having before jumping to conclusions:

Fable 5 and Mythos 5 got pulled on June 12 due to a Commerce Department export control order, tied to a reported jailbreak that got the model to expose exploitable vulnerabilities. When it came back on July 1, Anthropic added a new safety classifier that catches the reported technique in 99%+ of cases, and any flagged request gets silently rerouted to Opus 4.8 instead of refused outright.

That’s the mechanism BridgeMind is pointing at. Their claim isn’t that the underlying weights changed, it’s that the classifier is triggering on too many normal coding tasks and quietly downgrading people to Opus 4.8 without them realizing it. A few other users on X are reporting the same thing (constant fallback, slower one-shot performance).

No independent lab has confirmed whether the weights themselves changed. This might just be an overly aggressive classifier rather than an actual capability regression, but if you’re relying on Fable 5 for coding work, worth watching this closely before you assume you’re getting the same model you had before June 12.

u/Direct-Attention8597 — 3 days ago

▲ 0 r/artificial

Claude Fable 5 is back — but it’ll block your regular coding requests (here’s why)

Fable 5 just got redeployed today (July 1) after a wild few weeks. Quick recap for those who missed it: Anthropic released Fable 5 on June 9, the US government slapped export controls on it June 12 because Amazon researchers found a jailbreak that let it identify software vulnerabilities and produce exploit code, and now the controls have been lifted.

But here’s the part nobody is really talking about: Fable 5 is not a drop-in upgrade for coding workflows.

Anthropic explicitly says in their blog post that the new improved safety classifier they trained to address the jailbreak “comes at the cost of flagging benign requests more often during routine coding and debugging tasks.”

This is by design, not a bug. They intentionally built Fable 5 with a much larger “safety margin” than any previous model. The classifier is tuned to treat ambiguous cybersecurity-adjacent requests as potentially harmful and a lot of normal coding falls into that gray zone. Think: asking about memory vulnerabilities, debugging low-level code, security tooling, anything that could look like recon for an exploit.

So practically speaking:

-Fable 5 is likely incredible for writing, reasoning, and research

-For security engineers or systems programmers, expect frustrating false positives

-Anthropic admits this and says they’ll keep refining it, but no timeline

The honest framing here is that Fable 5’s power comes paired with restrictions that make it less useful for a specific class of developers – the exact ones who would benefit most from a more capable model.

Curious if others hit these blocks in early access. What kinds of prompts triggered it?

anthropic.com

u/Direct-Attention8597 — 5 days ago

▲ 7 r/ClaudeCode

Claude Sonnet 5 just dropped, and it’s closing the gap with Opus

Anthropic released Claude Sonnet 5 today. The headline is that it’s their most agentic Sonnet yet, and benchmark-wise it’s now sitting close to Opus 4.8 performance while costing a fraction of the price.

A few things worth noting:

Pricing is $2/M input and $10/M output tokens through August 31, 2026 (intro pricing), then moves to $3/M and $15/M. It’s now the default model for Free and Pro plans, and also available on Max, Team, Enterprise, Claude Code, and the API as claude-sonnet-5.

On safety, Anthropic says it shows lower hallucination and sycophancy rates than Sonnet 4.6, and is better at resisting prompt injection. They deliberately did not train it heavily on cyber offense skills, so it scores much lower than Opus on dangerous exploit development tasks, but they still shipped it with the same real time cyber safeguards used in Opus 4.7/4.8.

One technical detail that matters if you’re budgeting: Sonnet 5 uses a new tokenizer, so the same input can map to 1.0 to 1.35x more tokens depending on content type. The intro pricing is set to make the transition roughly cost neutral.

Early access partner quotes (Cursor, Lovable, ClickHouse, etc.) all point to the same theme: it follows through on multi-step tasks without stalling halfway, which used to be the main complaint about smaller models.

Curious if anyone here has run it against real workloads yet. Feels like the Sonnet vs Opus decision is getting genuinely harder to make now.

u/Direct-Attention8597 — 6 days ago

▲ 828 r/ClaudeCode

Claude Fable 5 looks set to return behind ID verification and usage credits, and “US only” access seems likely

Anthropic looks like it’s prepping Claude Fable 5 to come back, and the way it’s being gated is worth a look.

Strings showing up in the flow:

Your credits will be added once your identity is verified.
Fable 5 runs on usage credits, billed separately from your plan.

So two gates: identity verification, plus usage credits billed separately from your normal plan. There are also signs Sonnet 5 is lining up for release.

Context if you missed it: Fable 5 and Mythos 5 launched June 9, then got pulled on June 12 under a US export control directive ordering Anthropic to cut off foreign nationals. Since nationality cannot be verified at API scale, Anthropic disabled both models for everyone, and they are still suspended.

Why “US only” access feels likely: Anthropic’s identity verification goes live July 8. That gives them a way to confirm US citizenship and restore Fable 5 to domestic users without waiting for the export control order to be lifted. Combine that with usage-credit billing and you get a tightly gated, probably US-first return.

What do you think: is ID verification becoming the default for frontier models, or does this just push people toward providers with less friction?

u/Direct-Attention8597 — 6 days ago

▲ 391 r/ClaudeCode

Just installed Claude Code

u/Direct-Attention8597 — 7 days ago

▲ 3 r/micro_saas+1 crossposts

I built an open-source, self-hostable clinic workspace (EHR): patient records, scheduling, prescriptions, and labs in one place

I've been building Temetro, an open-source, self-hostable workspace for clinics. The goal is to keep everything around the patient record in one place instead of stitching together five different tools.

Short video shows the patient record view.

What it does:

Patient records: history, medications, allergies, labs, and vitals as clean cards
Scheduling and appointments
Prescriptions, pharmacy, and inventory
Invoices and billing
Lab results (HL7/FHIR)
Notes, tasks, and real-time staff messaging
Activity audit trail and a live clinic dashboard
Multi-clinic with role-based access

Self-hosted and private by design: PHI stays inside your own network. It's MIT licensed, so you can read every line and run it yourself. No data lock-in.

Stack: Next.js 15 / React 19, Node/Express, PostgreSQL 17, TypeScript throughout. Runs in Docker.

Honest status: it's in beta and under active development. The interface is chat-first and patient/lab lookups work today. Connecting a real AI model for free-form questions is the next item on the roadmap. The patient-owned records piece (record lives on the patient's own device, signed and approved on their phone) is still early.

GitHub: github.com/temetro/temetro

Would love feedback from people who self-host, and especially anyone who's worked in or around a clinic: what would you need before trusting something like this with real records?

u/Direct-Attention8597 — 7 days ago

▲ 42 r/artificial

The AI frontier just got locked behind government approval, and most of us aren’t on the list

Something happened in the last two weeks that didn’t get nearly enough attention outside of tech circles.

Anthropic released what are reportedly their most capable models yet, Fable 5 and Mythos 5. The Trump administration then ordered Anthropic to ban all foreign nationals from accessing them, citing cybersecurity concerns. Anthropic’s response? They shut down access entirely, saying they couldn’t reliably enforce a “foreign nationals only” restriction.

The reason these models are so sensitive:

they apparently have an unprecedented ability to identify software vulnerabilities. Not just theoretically, but at a level that genuinely alarmed the US government.

Yesterday, OpenAI released GPT-5.6, a three-model family (Sol, Terra, and Luna). But it’s not available to you. Or me. Or probably anyone reading this. It’s limited to a small group of “trusted partners” whose identities have been shared with the US government, at the administration’s explicit request.

OpenAI themselves said they’re uncomfortable with this arrangement: “We don’t believe this kind of government access process should become the long-term default. It keeps the best tools from users, developers, enterprises, cyber defenders, and global partners who need them.”

So let’s be clear about where we are: the most powerful AI models in existence are now effectively state-controlled assets. They’re not products you can access, they’re capabilities being rationed by a government.

For those of us building outside the US, the message is pretty direct: the frontier is no longer public.

What’s your read on this? Is this legitimate national security caution or the beginning of something more permanent?

u/Direct-Attention8597 — 9 days ago

▲ 2 r/OpenSourceeAI+1 crossposts

Temetro – an open-source EHR so clinics can own their own patient data

Most clinic software today is cloud-hosted. You pay a subscription, your patient records live on someone else's server, and if you stop paying or the company shuts down you're in trouble. For clinics in Africa and the Middle East, this is even worse: the vendors are foreign, the data sovereignty concerns are real, and the pricing is built for Western markets.

Temetro is my attempt at a different approach. It's a full electronic health record system you self-host on your own infrastructure. Your patient data never leaves your server.

What it does:

Patient records: demographics, allergies, medications, labs, vitals with trend charts, encounter history
Appointments, prescriptions, pharmacy dispensing queue, lab work queues
Invoicing, real-time staff messaging, and a full audit log of every change
Role-based access: each staff type (doctor, reception, pharmacy, lab) gets a dashboard built for their actual job
HL7/FHIR, NCPDP SCRIPT (e-prescribing), and X12 claims already integrated

Running it:

git clone https://github.com/temetro/temetro.git
cd temetro/backend
docker compose up --build

That's the full install. PostgreSQL, Next.js frontend, Node/Express API, all wired together. No mandatory config, secrets auto-generate on first boot.

Why open source matters here specifically:

Healthcare software that is closed-source and cloud-only means a third party permanently holds your patient records. Open source here isn't just a licensing preference it's the only model that lets a clinic in Djibouti, Nairobi, or Amman actually control their own data, audit the code handling that data, and keep running even if the vendor disappears.

The long-term vision goes further: patient-owned records, where a chart is cryptographically signed and lives on the patient's own device. That part isn't built yet, but it's the north star.

Still in beta, actively developed. Contributions welcome.

GitHub: https://github.com/temetro/temetro

u/Direct-Attention8597 — 9 days ago

▲ 34 r/artificial+1 crossposts

Anthropic just published data showing 35% of their users expect AI to do MOST of their work within 12 months. We’re not having an honest conversation about what this actually means.

Anthropic dropped their June 2026 Economic Index today and buried inside the survey data is something that should be making headlines:

Over a third of respondents (9,700 actual Claude users, linked to real usage data) believe AI will be capable of handling most or nearly all of their work tasks within the next year.

Not “some tasks.” Not “help me write emails.” MOST of their work.

And here’s the part nobody wants to talk about: the people who delegate the most to AI are the MOST optimistic about their job prospects. Meanwhile entry-level workers are the ones most worried about displacement. Senior devs and managers? Thriving. Junior colleagues? Everyone in the survey is more worried about them than themselves.

The data also shows AI autonomy is measurably higher on Claude Code than on regular chat, across 26 out of 31 output types. A blog post that takes 13 rounds of back-and-forth on Claude.ai? Claude Code does it in a single prompt.

So here’s the uncomfortable question nobody wants to ask:

Are we witnessing the largest skill-premium compression in history, where the gap between a senior person using AI and a junior person using AI collapses the value of experience? Or is this actually fine and we’re all just catastrophizing?

Because Anthropic’s own framing spins this as “augmentation not displacement” while simultaneously showing that 38% of people who think they’ll lose their job attribute that directly to AI.

Make it make sense.

Full report: https://www.anthropic.com/research/economic-index-june-2026-report

u/Direct-Attention8597 — 10 days ago

▲ 2 r/ClaudeCode

I gave Claude Code a memory that understands both text AND screenshots open source MCP server

Built HybridRAG: indexes documents two ways in parallel text chunks for code/logs/JSON, rendered page tiles for charts/tables/layouts then merges the results.

The MCP server gives Claude Code 4 tools:

1- hybridrag_search : fused search, text + vision

2-hybildrag_add_text : index a doc mid-session

3-hybridrag_add_html : scrape + index a page

4- hybridrag_stats : see what’s in the index

Wire it up in .mcp.json and you’re done:

{
"mcpServers": {
"hybridrag": {
"command": "hybridrag-mcp",
"env": { "HYBRIDRAG_STORAGE": ".project_index" }
}
}
}

Core runs on numpy only. 43 tests, no model downloads needed to try it.

https://github.com/Khalidabdi1/HybridRAG

u/Direct-Attention8597 — 11 days ago

▲ 36 r/ClaudeCode

Claude Fable 5 may return today after 13-day government-forced suspension

Here’s the full timeline:

-June 9: Anthropic releases Claude Fable 5, their most powerful public model ever (Mythos-class with safeguards)

-June 12: US government issues an export control directive at 5:21 PM, ordering Anthropic to cut off access to ALL foreign nationals. Model goes offline worldwide within 90 minutes

The reason? Amazon engineers reportedly found a narrow jailbreak that could bypass Fable’s cybersecurity classifiers

-Anthropic complied but publicly pushed back, calling the action unfair

-Trump met Dario Amodei at the G7 and softened his stance, but the directive was never officially lifted

-June 26 (today): Congressional deadline for Commerce Secretary Lutnick to respond in writing about the export controls

Prediction markets are pricing ~57% odds of restoration before July 1. Developers have been stuck on Opus 4.8 this whole time.

This whole situation raises a serious question: if a government can pull your AI model offline in 90 minutes, what does that mean for anyone building on closed, hosted models?

u/Direct-Attention8597 — 11 days ago

▲ 649 r/artificial+1 crossposts

Claude Code v2.1.190 strings suggest Fable 5 is returning soon

Spotted in the latest Claude Code build (v2.1.190):

The “purchased separately from your plan” references for Fable 5 have been removed. New strings around weekly usage limits and included usage have been added in their place, pointing toward a bundled weekly allotment model rather than a standalone purchase.

On top of that, Fable 5 resurfaced in Amazon Bedrock strings, and Sonnet 5 is currently in Early Access for enterprise, which may be a placeholder while Fable 5 finishes development.

Anyone else tracking this? Feels like the release window is getting close.

u/Direct-Attention8597 — 11 days ago

▲ 1.8k r/Agent_AI+2 crossposts

Coding is largely solved.

u/Direct-Attention8597 — 12 days ago

▲ 162 r/ClaudeCode

Life after Fable 5

u/Direct-Attention8597 — 14 days ago

▲ 187 r/ClaudeCode

🔥 Anthropic is bringing Cowork support to Claude Mobile soon.

This means you’ll be able to:

• Start and manage tasks directly from your phone

• Keep track of progress from mobile, browser, or desktop

• Let Claude continue working in the background even after closing the app

• Stay productive while on the go

The gap between desktop AI workflows and mobile AI workflows keeps getting smaller.

u/Direct-Attention8597 — 14 days ago

▲ 1.6k r/vibecodingitalia+2 crossposts

Claude Sonnet 5 “Fennec” leak 1M context, expected next week

A leak just surfaced about Anthropic’s next Sonnet model, internally codenamed Fennec.

Details so far:
- Codename: Fennec
- Context window: 1 million tokens
- Expected release: as early as next week
- Performance: strong coding, fast inference, better price/performance than Opus and Fable
If accurate, this would be a significant jump, especially the 1M context window at Sonnet pricing.

Take it with a grain of salt since nothing is confirmed yet, but Anthropic has been on a fast release cadence lately so it wouldn’t be surprising.

Are you waiting for this before starting a new project, or are you already locked in on your current model setup?

u/Direct-Attention8597 — 14 days ago

▲ 908 r/ClaudeCode

Daily reminder that Fable is still not back.

u/Direct-Attention8597 — 19 days ago

▲ 0 r/artificial

Anthropic just published data from 400k Claude Code sessions, and the headline buries the real story: your CS degree is becoming optional

Anthropric released a research paper today analyzing ~400,000 Claude Code sessions. The findings are wild and I haven’t seen anyone talk about the uncomfortable implications.

What they actually found:
-Lawyers, accountants, and managers succeed at coding tasks within 7 percentage points of actual software engineers

-Management occupations had the HIGHEST verified success rate. Higher than software engineers.

-The gap between experts and intermediates is “modest” meaning once you hit a basic level of domain knowledge, you get most of the benefit

-Sessions where users show debugging skills fell by nearly half in 7 months

-The value of the average task rose ~27% in 7 months

The part everyone is ignoring:
Anthropric’s own framing is “expertise still matters!” But read their definition of expertise carefully. It’s NOT coding expertise. It’s domain expertise. A lawyer who knows exactly what clauses to flag counts as an “expert” in their session, even if they’ve never written a line of code.

So when they say “expertise persists,” they mean: understanding your problem still matters. Understanding code increasingly doesn’t.

Think about what that actually means. Every company has been hiring senior engineers partly for their ability to translate business problems into code. That translation layer is what’s collapsing. The lawyers and managers are coming for your job not by learning to code, but by not needing to.

And Anthropic sat on 400k sessions of data showing this is already happening, and the headline is “expertise matters”?

The real headline is: if you’re a software engineer whose main value is implementation, the floor is dropping.

reddit.com

u/Direct-Attention8597 — 19 days ago

▲ 889 r/ClaudeCode

Anthropic just published data from 400k Claude Code sessions, and the headline buries the real story: your CS degree is becoming optional

Anthropric released a research paper today analyzing ~400,000 Claude Code sessions. The findings are wild and I haven’t seen anyone talk about the uncomfortable implications.

What they actually found:
-Lawyers, accountants, and managers succeed at coding tasks within 7 percentage points of actual software engineers

-Management occupations had the HIGHEST verified success rate. Higher than software engineers.

-The gap between experts and intermediates is “modest” meaning once you hit a basic level of domain knowledge, you get most of the benefit

-Sessions where users show debugging skills fell by nearly half in 7 months

-The value of the average task rose ~27% in 7 months

So when they say “expertise persists,” they mean: understanding your problem still matters. Understanding code increasingly doesn’t.

And Anthropic sat on 400k sessions of data showing this is already happening, and the headline is “expertise matters”?

The real headline is: if you’re a software engineer whose main value is implementation, the floor is dropping.

reddit.com

u/Direct-Attention8597 — 19 days ago

▲ 5 r/artificial

US government just forced Anthropic to pull Fable 5 and Mythos 5 for all users

Anthropic put out a statement today. The US government issued an export control directive citing national security, suspending access to Fable 5 and Mythos 5 for any foreign national, inside or outside the US. To comply, Anthropic had to disable both models for everyone immediately. Other Claude models are not affected.

The stated reason is a potential method to bypass Fable 5’s safeguards. But Anthropic says it reviewed the demonstration and found the vulnerabilities were minor, already known, and discoverable by other public models (they specifically point to GPT-5.5) without needing any bypass.

Anthropic is complying but openly disagrees. Their argument is that recalling a commercial model used by hundreds of millions over a narrow potential jailbreak could effectively freeze new model deployments across the whole industry if it became the standard.

What I find interesting is the precedent. If a verbal report of a minor, non-universal jailbreak is enough to pull a frontier model, where does that leave every other provider?

Curious what people here think. Reasonable safety intervention, or government overreach that hurts the whole field?

reddit.com

u/Direct-Attention8597 — 23 days ago