Meta Cuts 8,000 And Drafts 10,000 to Build Their AI Replacement. The agentic AI tide is rising rapidly, and no one knows how to handle it.

On May 20, 2026, Meta eliminated approximately 8,000 positions, close to 10 percent of its workforce, and reassigned a comparable number of employees into new teams tasked with building AI agents. Internally, the reassignment process is referred to as being "drafted." The action is the most recent in a sequence of large reductions across the sector.

Oracle began terminating employees on March 31, 2026, in a reduction that independent estimates place between 20,000 and 30,000 people, near 18 percent of its staff. Amazon eliminated roughly 16,000 corporate roles in January 2026, following approximately 14,000 in October 2025. LinkedIn cut about 5 percent of its 17,000-person workforce in May. Microsoft, in April, introduced the first voluntary early retirement program in its 51-year history, offering buyouts to an estimated 8,750 United States employees whose combined age and tenure reach 70 or more. Aggregated data from layoffs.fyi recorded more than 100,000 technology-sector job eliminations in the first five months of 2026, approaching the full-year 2025 total.

The reductions share a stated rationale. Each company has linked the cuts to capital expenditure on AI infrastructure. Oracle's reduction accompanies a data-center buildout with capital spending near 50 billion dollars. Meta has guided toward 115 billion to 135 billion dollars in AI infrastructure spending for 2026. The structural change is not limited to headcount. Meta has redirected roughly 7,000 employees into teams designated Applied AI Engineering, the Agent Transformation Accelerator, and Central Analytics, with a mandate to develop agents that perform tasks currently assigned to people and to measure the resulting output.

The reorganization corresponds to a measurable shift in system architecture. Through 2025, AI coding tools operated primarily as prompt-driven assistants. The dominant design has since moved toward agents that operate on a codebase over extended sessions, retrieve repository context, execute tests, and complete multi-step tasks under limited supervision. This model depends on two infrastructure layers that did not exist in standardized form before late 2024.

The first is a protocol for connecting a model to external tools and data. The Model Context Protocol, released by Anthropic in November 2024, defines a JSON-RPC client-server interface for context ingestion and structured tool invocation, and is model-agnostic by design. The second is a protocol for communication between agents. The Agent-to-Agent protocol, released by Google in April 2025, uses HTTP transport, JSON-RPC messaging, and Server-Sent Events for streaming, and represents agent capabilities through machine-readable descriptors. The research literature has begun to formalize this layer. A 2025 survey of agent interoperability protocols (Ehtesham et al., arXiv:2505.02279) classifies four protocols, MCP, ACP, A2A, and ANP, by interoperability tier and proposes a phased adoption sequence. A parallel survey of AI agent protocols (Yang et al., arXiv:2504.16736) compares them across discovery, interaction, and security dimensions. Subsequent work has examined the security exposure these protocols introduce (arXiv:2506.19676), since standardized inter-agent messaging expands the attack surface relative to isolated systems.

The coordination of multiple agents operating in parallel is the open engineering problem these protocols are intended to address. It is also the capability the new corporate teams are organized to build.

The consequences for software roles are not settled. Boris Cherny, the creator of Claude Code, stated in early 2026 that routine code generation is largely a solved problem and that the title of software engineer may be replaced by broader descriptions. Dario Amodei, the chief executive of Anthropic, told the World Economic Forum in January 2026 that AI systems could perform most software engineering work within six to twelve months. A contrasting analysis by Steven Sinofsky observes that prior platform transitions, including personal computing and cloud infrastructure, were expected to reduce technical employment and instead expanded it, with effort shifting toward architecture, evaluation, and the direction of automated systems. Usage studies find a mix of augmentation and automation that varies by task.

What the new structures require is not the elimination of engineering judgment but its relocation. Specifying agent behavior, evaluating agent output, and architecting multi-agent systems are distinct from the skills that defined entry-level engineering hiring for most of the past decade. The accuracy of the current restructuring depends on how quickly that capability is developed.

This analysis was published by GradientCast, which produces technical interview-preparation material for machine learning and software engineering roles. Its recent walkthroughs include the design of multi-agent coding systems, covering the coordination of background coding agents through Model Context Protocol servers.

Sources

Layoff figures: Reuters, Bloomberg, CNBC, CNN, GeekWire, TechSpot reporting, May 2026; layoffs.fyi aggregated data.

Protocol literature:

Ehtesham, Singh, Gupta, Kumar (2025). A Survey of Agent Interoperability Protocols: MCP, ACP, A2A, and ANP. arXiv:2505.02279
Yang et al. (2025). A Survey of AI Agent Protocols. arXiv:2504.16736
A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures (2025). arXiv:2506.19676
Model Context Protocol specification, Anthropic, November 2024
Agent-to-Agent Protocol announcement, Google, April 2025

u/GradientCastTeam — 1 day ago

▲ 1 r/AIDeveloperNews

Stop grinding 500 LeetCode problems for FAANG ML. Here's what the loop actually tests in 2026

Here's something I've been wanting to write down for a while.

I run ML system design and behavioral rounds at one of the FAANG companies. Been doing it for about two years. Before that I was at another major tech firm. I'm not a recruiter, I'm an engineer who sits on hiring committees and conducts a lot of interviews. So this is one staff engineer's view, not my employer's official position, but I talk to enough peers at the other FAANG companies and AI labs to know my read isn't unusual.

Last month I rejected a candidate who solved every coding question cleanly. The medium in 8 minutes, the hard in 20. The kind of candidate who would've gotten an automatic hire rec in 2022. He didn't get the offer.

The round that sank him was system design. I gave him a problem that's now pretty standard in our loop: a recommender that started producing biased outputs overnight, hundreds of millions of users impacted, what do you do. He started drawing boxes. He talked about rebuilding the ranking pipeline. I asked him "what do you do first?" and he gave me more architecture. I asked again. More architecture. By the third time I was pretty sure I was writing a no-hire, and I did.

I don't think he was a bad engineer. I think he was prepping for a loop that doesn't exist anymore.

If you're a student or early-career engineer trying to break into FAANG ML, this is the part most prep guides aren't telling you yet.

What changed

By late 2024, GPT-class models could solve about 80% of LeetCode mediums on the first try. By mid-2025 every engineer I work with was using AI in their daily flow. The signal "writes correct fast code under pressure" stopped predicting anything useful about how someone performs on the job. So companies adapted. They started weighting the rounds where AI can't fake it for you. System design. Behavioral. The deep dive into your past work.

Nobody put out a press release. There's no official blog post saying "behavioral now matters 3x more than coding." But every interviewer doing this for two years has felt it. The debriefs sound different than 2023. The reasons we say no are different. The reasons we say strong-hire are different.

Coding round: still there, but not where you win

You still need to be solid. But it's now a "don't fail" round, not a "show off" round. After about 80 well-chosen LeetCode mediums you have very steep diminishing returns. The 200th problem gives you almost nothing the 80th didn't. If you're at problem 250 in grinding mode right now, you're optimizing the wrong thing.

I know that's hard to hear because LeetCode feels like the "real" prep. It's been the cultural center of FAANG prep for like a decade. But it's not where the loop is won now.

System design: where the loop is actually won

The question used to be "design a recommender system" with whiteboard freedom and a friendly interviewer who'd let you pick your favorite architecture. Now it's something like "your main feed ranking system is showing a 3% engagement drop in the last 24 hours, walk me through what you'd do, latency budget is 100ms, you can't change the ranking model this quarter." Real constraints. Real production decision. We're not testing whether you can recite a recsys architecture. We're testing whether you can hold the constraints of an actual system in your head while reasoning about a problem.

The thing that flips this round is reasoning out loud. And I don't mean narrating, like "first I'll do this, then I'll do that." I mean actual reasoning. Like "I think we should start by checking whether this is data quality, because the symptoms look like bias, but bias and feedback loops at this scale can look identical from the metrics alone, so the way I'd disambiguate is..." Visible thinking. You're telling me how your brain is working through the problem.

Every interviewer I work with grades for this and it's the single biggest thing that separates a hire from a no-hire. School trains you to give clean right answers. The interview rewards showing your work, especially when you don't have the right answer yet.

Behavioral: the most under-discussed change

Behavioral used to be a vibe check with a STAR-method shell. Now it's closer to "tell me about a decision you made that you later thought was wrong, and what you did about it." The questions probe judgment, not accomplishments. STAR recitations get spotted instantly and read as canned.

If you're a student you might think you don't have huge stories to tell. That's fine. We know your level. What we're looking for is whether you can talk honestly about your judgment and your mistakes. A student who tells a real specific story about a class project where they made a hard call and partly got it wrong is in way better shape than someone with a polished "I led the team to a 10% improvement" story. We see a lot of polished stories. They all blur together.

ML depth: about what you've built, not what you've read

Not textbook theory. Not "explain backprop." If you have one real ML project (even a strong internship, even a class project you took seriously) you should be able to talk about every decision: why that model, what you tried first, what surprised you, what you'd do differently.

If you don't have a project like that yet, this is the highest-leverage thing you can do. Forget the next 100 LeetCode problems. Build one ML system end-to-end (data pipeline, model, eval, deployment if you can) and write up what you learned. That's a better use of three months than any amount of grinding.

The new thing nobody's talking about: AI-collaboration literacy

This is new in the last 12 months. Interviewers, especially at the frontier AI labs and increasingly at the bigger tech companies, are watching whether you treat AI tools as a peer or as an oracle. Can you tell when a model's suggestion is plausible but wrong? Do you verify or trust? When you describe how you'd build something, are you describing how you'd build it with AI in the loop, or thinking like it's 2022?

This is becoming an explicit scoring dimension at the AI-first companies and it's going to keep mattering more. Practice using AI while reasoning out loud. Practice the moments where you catch the AI being wrong and say out loud how you caught it. It's a separable, learnable skill and most candidates aren't training it.

Time allocation if I were prepping right now:

30% system design (real-system reasoning under constraints, not whiteboarding)
25% behavioral (real story practice, not STAR memorization)
20% ML depth on systems you've actually built
15% coding (around 80 well-chosen mediums)
10% AI-collaboration practice

The biggest difference from standard internet advice is that 70% of your time should be on non-coding rounds. That feels wrong to most students. I get why. But it's where the loop is decided in 2026.

Bottom line

The interview isn't testing what you know. It's testing how you reason. LeetCode was pattern recognition. Modern ML loops aren't. They're about reasoning through new problems where the pattern doesn't exist. The candidates who get offers slow down when they don't know, ask clarifying questions, name assumptions, propose multiple approaches, pick one with explicit tradeoffs, and do all that out loud.

If you've been a strong student, you already have the reasoning ability the loop is testing. You just haven't practiced expressing it under pressure in a 45-minute window with someone watching. That's the actual skill. It's much shorter to learn than 500 LeetCode problems. Most candidates never practice it specifically.

Happy to answer questions in the comments.

If you want to dig deeper, I run gradientcast.com : deep, staff-level walkthroughs of every major ML system design pattern

reddit.com

u/GradientCastTeam — 4 days ago

▲ 112 r/MachineLearningJobs

The algo round is dead at FAANG for ML Engineers. What replaced it (from someone running the loops)

Two years ago I helped a friend prep for a Meta ML engineer loop. We did 200 LeetCode problems together. He was sharp, fast, would solve mediums in under twelve minutes. He didn't get the offer.

When the recruiter walked him through the debrief, the feedback was strange. The coding round had been "fine". Not a red flag, not a strong signal. What sank him was the system design round. The interviewer had given him an ambiguous problem about a recommender producing biased outputs and asked what he'd do in the next thirty minutes. He defaulted to architecture. The interviewer kept pulling him back: "what would you do first?"

I've been on the other side of that table for the past two years, running ML system design and behavioral rounds at Meta. What happened to my friend is happening to a lot of strong engineers, and most of them don't know why. They're prepping for a loop that doesn't exist anymore.

What changed: the algo round isn't dead, but its weight in hire/no-hire collapsed. The reason isn't that algorithms stopped mattering — it's that AI got too good at them. By 2025 GPT-class models could solve ~80% of LeetCode mediums first try. The skill "produce a clean implementation of two-sum in 12 minutes" stopped predicting anything useful. So companies shifted weight to what AI can't fake yet — judgment, system reasoning, AI-collaboration literacy, communication.

This shift wasn't announced. But anyone on a hiring committee in the past 18 months can tell you it happened. The debriefs sound different. The candidates who get strong-hire recs look different from the ones who got them in 2023.

What's actually being tested in 2026:

Reasoning under uncertainty. Deliberately ambiguous problems with no clean answer. The interviewer watches how you decompose, prioritize, name tradeoffs. Pattern-matching to a template = filtered out. Slowing down, asking questions, reasoning out loud = offer.
System design at production scale. Not "design Twitter" — "design Twitter with a 100ms latency budget, a degrading existing model, a downstream team you can't change, and a vendor you can't swap." Real constraints, real production decisions.
AI-collaboration literacy. New in the last 12 months. Can you tell when a model's output is plausible-but-wrong? Do you verify or trust? Becoming an explicit scoring dimension at Anthropic, OpenAI, increasingly Google/Meta.
Communication / thinking out loud. The candidates who get strong-hire reports almost always reason out loud constantly. Not narrating — *reasoning*. Making assumptions visible. Flagging uncertainty.

What an E5 Meta ML loop looks like now vs 2022:

2022: two LeetCode rounds, one broad system design, one ML theory round, one behavioral.

2026: one shorter coding round (pseudocode often fine for harder parts), one heavily-constrained system design round, one ML deep-dive on a system you've actually built, one meaningfully harder behavioral.

If I were prepping for a FAANG ML loop today, time allocation:

- 30% system design (real-system reasoning, not whiteboarding)
- 25% behavioral (real story practice, not STAR memorization)
- 20% ML depth on systems you've actually shipped
- 15% coding (~60–80 well-chosen mediums is enough; diminishing returns past that)
- 10% AI-collaboration practice (new — practice reasoning out loud while using AI tools)

The mindset shift: the interview is no longer testing what you know. It's testing how you reason.

LeetCode was about pattern recognition. Modern ML loops aren't. They're about reasoning through novel problems where the pattern doesn't exist. The candidates who get offers slow down when they don't know, ask clarifying questions, name assumptions, propose multiple approaches, pick one with explicit tradeoffs — out loud.

If you've been shipping production ML systems, you already have the skills. You just need to practice expressing them out loud, under pressure, in 45 minutes.

Grind less. Reason more. Talk out loud.

---

Full version with the worked example (Meta E5 round breakdown phase-by-phase) at gradientcast.com/insights/why-faang-killed-the-algo-round

---

reddit.com

u/GradientCastTeam — 9 days ago