u/Such-Run-4412

Alibaba Just Dropped Qwen3.7-Max — A New Flagship Model Built for AI Agents

Alibaba’s Qwen team released Qwen3.7-Max, its latest proprietary flagship model built specifically for the “agent era.”

This is not being pitched as just another chatbot. Qwen3.7-Max is designed for long-running AI agents that can write code, debug, use tools, work across files, automate office tasks, and coordinate multi-step workflows.

The biggest claim is long-horizon autonomy. Qwen says the model ran a 35-hour kernel optimization task with 1,000+ tool calls and no hand-holding. That is the kind of workload AI labs are now using to prove whether a model can actually stay coherent over many steps, not just answer a prompt well.

Coding is the main focus. Qwen3.7-Max is aimed at frontend prototypes, multi-file refactors, real debugging, and end-to-end coding agent work. It is also designed to work across different agent scaffolds, including Claude Code, OpenClaw, Qwen Code, Hermes, or custom stacks.

The benchmark claims are strong. Coverage of the Qwen release says Qwen3.7-Max scored 69.7 on Terminal-Bench 2.0 Terminus-2, 80.4 on SWE-bench Verified, 60.8 on MCP-Mark, and 76.4 on MCP-Atlas. It reportedly beat or came close to major competitors across several agent and coding benchmarks.

The model also showed strong kernel optimization performance. In KernelBench L3, it reportedly reached a 1.98x median speedup over PyTorch reference implementations and produced code faster than torch.compile in 96% of cases.

The really important detail is that Qwen is emphasizing “scaffold-agnostic” performance. A lot of models look good inside their own optimized toolchain, then fall apart when moved into another agent framework. Qwen is trying to show that 3.7-Max learned the actual task-solving behavior, not just tricks for one harness.

There is also a business/productivity angle. Qwen3.7-Max supports MCP integrations and multi-agent orchestration, so it can be used as an office assistant for workflows involving documents, spreadsheets, productivity tools, and enterprise systems.

Developers can use it through Alibaba Model Studio, and it is also available to try in Qwen Studio.

Big takeaway: Qwen is no longer just competing on open model releases or benchmark flexing. Alibaba is clearly aiming at the agent market — coding agents, office agents, MCP workflows, and long-running autonomous tasks.

The frontier model race is turning into an agent race. Qwen3.7-Max is Alibaba’s strongest signal yet that it wants to be a serious player there.

Source: https://qwen.ai/blog?id=qwen3.7

reddit.com

u/Such-Run-4412 — 22 hours ago

▲ 1 r/AIGuild

Anthropic May Run Claude on Microsoft’s AI Chips — Another Sign Nvidia Won’t Be the Only Game in Town

Anthropic is reportedly in talks to rent servers powered by Microsoft’s own AI chips, giving Claude another possible source of compute as demand for AI services keeps rising.

The talks are still early and may not lead to a final deal. But if it happens, it would be a major win for Microsoft’s custom chip ambitions.

The chip in question is Maia 200, Microsoft’s second-generation in-house AI accelerator. Microsoft built it for large-scale AI inference, meaning the expensive everyday work of running AI models after they are trained — answering prompts, powering assistants, and serving millions of users.

This matters because the AI race is no longer just about who has the smartest model. It is also about who can secure enough compute, lower inference costs, and avoid being fully dependent on Nvidia GPUs.

Anthropic already works with Amazon and Google for AI infrastructure, so adding Microsoft chips would make its compute strategy even more diversified. That is important because Claude’s usage is growing, and frontier AI companies cannot afford to rely on one cloud provider or one chip supplier.

For Microsoft, this would be a big credibility boost. Google has TPUs. Amazon has Trainium. Microsoft wants Maia to become a serious part of the AI cloud stack too.

The bigger takeaway: AI labs are turning into infrastructure strategists. The companies that win may not just be the ones with the best models, but the ones that can get enough chips, power, and data center capacity to actually serve those models at scale.

Source: https://www.bloomberg.com/news/articles/2026-05-21/anthropic-in-talks-to-use-microsoft-ai-chips-information-says

u/Such-Run-4412 — 22 hours ago

▲ 5 r/AIGuild

Anthropic May Use Microsoft’s Own AI Chips — The Compute Race Is Getting Weird Fast

Anthropic is reportedly in early talks to rent servers powered by Microsoft-designed AI chips to meet rising demand for Claude and its other AI services. The discussions are still early and may not become a final deal.

The key chip here is Maia 200, Microsoft’s second-generation in-house AI accelerator. Microsoft built it mainly for inference, meaning the everyday work of running AI models after they are trained — answering prompts, generating text, powering assistants, and serving users at scale.

Inference is becoming one of the biggest costs in AI. Training a frontier model is expensive, but once millions of people start using it every day, serving all those requests can become a massive ongoing compute bill.

If Anthropic ends up using Maia servers, it would be a big win for Microsoft’s custom chip strategy. Microsoft is trying to follow the path of Google and Amazon, which built custom AI chips for themselves and then turned them into cloud products for outside AI companies.

It would also make Anthropic’s compute strategy even more diversified. Anthropic already has major infrastructure ties with Amazon and Google, and now it may add Microsoft’s own silicon to the mix. That suggests Claude’s demand is large enough that Anthropic does not want to depend on one cloud, one chip supplier, or one hardware roadmap.

The Nvidia angle is important too. Nvidia still dominates AI chips, but its GPUs are expensive and supply-constrained. Custom chips from Microsoft, Google, Amazon, and others are becoming more attractive because AI labs need cheaper and more available compute.

Microsoft’s Maia 200 is built on TSMC’s 3nm process and includes 216GB of HBM3e memory, 7 TB/s of memory bandwidth, and 272MB of on-chip SRAM. Microsoft says the chip is designed to improve the economics of AI token generation and support large-scale model serving.

There is also a bigger relationship shift happening. Microsoft has been adding Anthropic models into products like Copilot while its long-standing OpenAI partnership becomes less exclusive. So this is not just a chip story. It is also another sign that Microsoft wants a broader AI stack: OpenAI, Anthropic, its own models, Azure, and now its own chips.

Big takeaway: the AI race is no longer just about who has the best model. It is about who can secure enough compute, lower inference costs, and avoid being trapped by Nvidia shortages or one cloud provider.

Anthropic talking to Microsoft about Maia chips is another sign that frontier AI labs are becoming infrastructure strategists, not just model builders.

Source: https://www.theinformation.com/articles/anthropic-talks-use-microsofts-ai-chips?rc=mf8uqd

reddit.com

u/Such-Run-4412 — 22 hours ago

▲ 1 r/AIGuild

OpenAI Launches ChatGPT for PowerPoint — AI Slide Creation Is Now Inside PowerPoint

OpenAI released ChatGPT for PowerPoint in beta, a new app that lets users create and edit presentations directly inside Microsoft PowerPoint.

The main idea is simple: instead of copying content back and forth between ChatGPT and PowerPoint, you can now ask ChatGPT to work inside the deck itself.

It can create new slides, update existing decks, rewrite slide titles, turn notes into a board update, clean up dense slides, add chart slides, summarize the story of a deck, and turn screenshots or source material into editable PowerPoint slides.

The feature works with notes, documents, spreadsheets, prompts, existing decks, images, and text files. It can also review a presentation and point out where the story is weak, what is missing, and what an executive audience might ask.

This is available globally in beta for ChatGPT Free, Go, Plus, Pro, Business, Enterprise, Edu, Teachers, and K-12 users. You install it from PowerPoint’s Add-ins menu, open ChatGPT from the PowerPoint ribbon, then sign in with your OpenAI account.

The important part is that ChatGPT is not just generating a static slide image. It is designed to help preserve editable slide content, so users can still revise the deck inside PowerPoint.

There are limitations. Since this is beta, OpenAI warns that results may be incomplete or incorrect, and users should review formatting, numbers, claims, and important details before sharing. Some advanced PowerPoint formatting, templates, and font handling may not work perfectly yet.

Still, this is a pretty big productivity move. PowerPoint is one of the most used business tools in the world, and now ChatGPT is moving from “help me write slide content” to “help me build and edit the actual deck.”

The bigger trend is obvious: AI is being embedded directly into work apps, not just sitting in a separate chatbot window.

Source: https://chatgpt.com/apps/powerpoint/

reddit.com

u/Such-Run-4412 — 22 hours ago

▲ 2 r/AIGuild

Google’s AI Plan Is Bigger Than Gemini — It Wants Agents Inside Search, Gmail, YouTube, Glasses, and Shopping

Google’s latest AI push looks less like a normal model update and more like a full ecosystem play.

The headline model is Gemini 3.5 Flash, which is becoming the default model for the Gemini app and AI Mode in Search. It is built for faster output, better coding, stronger agentic task handling, improved multimodal work, richer interactive UI generation, and better safety guardrails.

But the bigger Gemini 3.5 Pro model is not out yet. Google says it is still being tested and is expected next month.

The most interesting part is how aggressively Google is pushing agents everywhere.

Gemini Spark is Google’s new personal AI agent that can work across Gmail, Docs, Sheets, Slides, and other Google apps. It can run in the background even when your phone is off or laptop is closed, because it runs on Google’s virtual machines. The goal is basically a 24/7 personal assistant that can track tasks, handle recurring workflows, and act across your digital life.

Google is also adding a Daily Brief, which sounds like a lighter version of Spark. It looks at your email, calendar, tasks, and other personal context, then prioritizes what matters and suggests next steps.

Search is changing too. AI Mode now runs on Gemini 3.5 Flash and supports text, images, files, videos, and even full Chrome tabs. The big shift is that Google Search is moving away from just “ten blue links” and toward interactive answers, generated tools, agents, and ongoing task completion.

That has huge implications for publishers. More answers inside Google means more zero-click searches, where users get what they need without visiting the websites that created the original content. This could be great for users, but rough for websites that depend on Google traffic.

Google is also pushing agentic shopping through something called a universal cart. The idea is a Gemini-powered shopping layer that works across Search, Gemini, YouTube, Gmail, and major retailers like Nike, Target, Walmart, Sephora, Wayfair, and Shopify. It can track prices, find coupons, alert you when items are back in stock, and possibly warn you if something is incompatible with your setup.

Then there are the AI glasses. Google is working with Samsung, Qualcomm, Gentle Monster, Warby Parker, and others on Android XR eyewear. The first version seems focused on audio, Gemini voice assistance, live translation, navigation, and hands-free interaction. Display glasses are still coming later.

Google is also turning more of its apps into AI surfaces. Gmail gets AI Inbox and Gmail Live. Docs gets voice-powered editing through Docs Live. Keep gets Keep Live for capturing messy thoughts and turning them into organized notes. YouTube gets Ask YouTube, which lets users ask questions about videos and jump to relevant timestamps.

The infrastructure side may be just as important. Google is partnering with Blackstone on a TPU-powered AI cloud venture, with Blackstone reportedly putting in $5 billion in equity. The goal is to expand compute capacity as AI agents become more common and more expensive to run.

Video URL: https://youtu.be/cSZ-y49e5q8?si=reUrkXPqT4H-9rgj

u/Such-Run-4412 — 2 days ago

▲ 1 r/AIGuild

Nous Hermes Agent Adds “Skill Bundles” — Basically Workflow Packs for AI Agents

Nous Research’s Hermes Agent now has a full skills system, and the most interesting part is skill bundles.

Skills are reusable instruction files that an agent can load only when needed. Instead of stuffing every workflow into the system prompt, Hermes keeps skills separate and pulls them in on demand. That makes the agent more token-efficient, meaning it wastes less context space on instructions it does not need.

Each skill works like a slash command. For example, you can call a coding workflow, a planning workflow, or a GitHub PR workflow directly inside Hermes. The agent can also discover skills naturally during chat.

The bigger update is skill bundles. These are small YAML files that group several skills together under one slash command. So instead of loading three separate skills one by one, you can create something like /backend-dev and have it load code review, test-driven development, and PR workflow instructions all at once.

That matters because real agent work usually needs more than one capability. A backend task might need planning, testing, code review, and GitHub steps. An incident response task might need logs, debugging, deployment, and documentation. Skill bundles turn those repeated combinations into one reusable workflow.

Hermes also supports external skill directories, meaning teams can share skills across machines or repos. It can install skills from hubs and registries, and agents can even create or update their own skills after learning a useful workflow.

The big idea here is procedural memory for agents. Instead of treating every task like it is brand new, Hermes can save the steps that worked and reuse them later.

This feels like another sign that AI agents are moving away from simple chatbots and toward configurable work systems. The next wave may not just be “smarter models.” It may be agents with reusable playbooks, team workflows, and shared skill libraries.

Source: https://hermes-agent.nousresearch.com/docs/user-guide/features/skills#skill-bundles

u/Such-Run-4412 — 2 days ago

▲ 1 r/AIGuild

OpenAI May File for IPO Within Days — The AI Race Is Officially Becoming a Wall Street Race

OpenAI is preparing to confidentially file for an IPO, possibly within days or weeks, setting up what could become one of the biggest public listings in tech history.

The company is working with major banks, including Goldman Sachs and Morgan Stanley, on the filing. The target is reportedly to be ready for a public debut as early as September 2026, though the timeline could still change.

This comes after OpenAI cleared a major legal hurdle. Elon Musk’s lawsuit over OpenAI’s shift away from its nonprofit roots was dismissed, removing one of the biggest risks hanging over a potential IPO. Musk still plans to appeal, so the legal drama may not be fully over.

The timing is interesting because OpenAI is both incredibly powerful and under pressure. It was recently valued at around $852 billion, but it is also facing stronger competition from Anthropic and Google. Internally, it has reportedly missed some user and revenue targets, while investors are watching its massive data-center spending closely.

There also seems to be some tension around timing. Sam Altman is pushing forward, while CFO Sarah Friar has reportedly been more cautious about going public too soon.

The bigger picture: AI labs are no longer just research companies. They are becoming giant infrastructure businesses that need public-market money to fund compute, chips, energy, and data centers.

If OpenAI goes public this year, it could reshape the entire AI market. Investors would finally get direct exposure to ChatGPT, but they would also get a front-row seat to how expensive the AI race really is.

Source: https://www.wsj.com/tech/ai/openai-ipo-filing-date-0ec95af5

reddit.com

u/Such-Run-4412 — 2 days ago

▲ 11 r/AIGuild

Anthropic Is About to Hand SpaceX a $45B Compute Check

Anthropic has agreed to pay SpaceX nearly $45 billion over the next three years for AI computing resources, making this one of the biggest AI infrastructure deals we’ve seen so far.

The deal works out to about $1.25 billion per month, or roughly $15 billion per year, running through May 2029. The compute will support Anthropic’s AI products, especially Claude, as demand for inference keeps exploding.

The interesting part is that this pushes SpaceX deeper into the AI infrastructure business. SpaceX is not just launching rockets anymore. It is becoming a major compute provider through its large AI data centers, reportedly including Colossus and Colossus II in Tennessee.

There are some flexible terms. Either Anthropic or SpaceX can walk away with 90 days’ notice, and the fees are lower during the early ramp-up period while capacity comes online.

This also shows how expensive the AI race has become. Even a top AI lab like Anthropic now needs massive outside infrastructure deals just to keep up with usage. The bottleneck is no longer only model quality. It is power, GPUs, data centers, and who can secure enough compute.

For SpaceX, this could become a huge new revenue stream before its IPO. For Anthropic, it gives Claude more compute at a time when the company is reportedly growing fast and moving closer to profitability.

The big takeaway: AI companies are turning into infrastructure companies by necessity, and infrastructure companies are turning into AI companies by opportunity.

Source: https://www.bloomberg.com/news/articles/2026-05-20/anthropic-to-pay-spacex-nearly-45-billion-for-computing-deal

u/Such-Run-4412 — 2 days ago

▲ 1 r/AIGuild

Google Launches Gemini 3.5 Flash — A Faster Agent Model Built to Actually Do Things

Google just introduced Gemini 3.5, starting with Gemini 3.5 Flash, and the main pitch is clear: this model is built for action, not just chat.

The big upgrade is agentic work. Gemini 3.5 Flash is designed to plan, use tools, write code, manage long tasks, and run multi-step workflows with less waiting. Google says it is its strongest model yet for agents and coding, while still keeping the speed people expect from the Flash line.

It is already available in the Gemini app, AI Mode in Google Search, Google Antigravity, Gemini API in AI Studio, Android Studio, and Google’s enterprise AI platforms. That means this is not just a lab preview. Google is pushing it straight into consumer apps, developer tools, and business workflows.

For developers, the biggest angle is coding agents. Google says 3.5 Flash can help transform old codebases, build apps, create interactive graphics, generate UI concepts, and even coordinate subagents through Antigravity. One demo had agents synthesize the AlphaZero paper and build a playable game in six hours.

For businesses, Google is pitching this as a practical automation engine. Shopify is using it for merchant growth forecasts. Macquarie Bank is testing it on long customer onboarding documents. Salesforce is adding it to Agentforce. Ramp is using it for smarter invoice OCR. Xero is using agents for admin tasks like supplier research and 1099 tax prep. Databricks is using it to help diagnose data issues.

The consumer side is also interesting. Gemini 3.5 Flash is now the default model for the Gemini app and AI Mode in Search. It also powers Gemini Spark, Google’s new personal AI agent that can run in the background and take action under your direction. Spark is rolling out first to trusted testers, with a beta planned for Google AI Ultra subscribers in the U.S.

Google also says Gemini 3.5 was built with stronger safeguards around cyber and CBRN risks, plus interpretability tools to better understand the model’s internal reasoning before it answers.

One more thing: Gemini 3.5 Pro is coming next month and is already being used internally.

Overall, this feels like Google moving Gemini deeper into the agent era. The headline is not just “better model.” It is “faster model that can operate across apps, code, Search, business tools, and personal workflows.”

Source: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/

u/Such-Run-4412 — 2 days ago

▲ 1 r/AIGuild

Google I/O 2026 “Agentic Gemini” Keynote

Google’s I/O 2026 keynote was all about turning Gemini from a chatbot into an agent layer across Search, Android, Workspace, YouTube, shopping, coding, and smart glasses.

The biggest model news was Gemini 3.5 Flash, which Google says brings frontier-level performance for agents and coding while staying fast. It is rolling out across the Gemini app, Search, Antigravity 2.0, and the Gemini API. Google also teased Gemini 3.5 Pro for next month.

Google also introduced Gemini Omni, a new multimodal creation model that can take text, images, audio, and video as inputs and generate or edit video. The first version, Gemini Omni Flash, is rolling out to the Gemini app, Google Flow, and YouTube Shorts.

The biggest assistant shift is Gemini Spark, Google’s new personal agent. It can take actions on your behalf across Gmail, Docs, Workspace apps, and eventually third-party tools through MCP. Google is positioning Spark as the move from “assistant that answers” to “agent that does work.”

Search is also getting rebuilt around AI. Google announced a new intelligent Search box, AI-powered query suggestions, background information agents, and custom dashboards that can monitor topics like finance, shopping, sports, news, or web changes over time.

Shopping got its own agentic layer too: Universal Cart. It can track deals, stock alerts, price history, product compatibility, loyalty perks, and payment benefits across Google products and merchants.

Google also pushed AI deeper into YouTube, Workspace, and hardware. Ask YouTube can answer complex video-search questions with follow-ups, Gemini Omni is coming to YouTube Shorts Remix, Gmail and Docs are getting conversational “Live” features, and Google showed new Android XR intelligent eyewear made with partners like Samsung, Qualcomm, Gentle Monster, and Warby Parker.

Source: https://www.youtube.com/live/wYSncx9zLIU?si=RkR5raKrYYavmr6t

reddit.com

u/Such-Run-4412 — 3 days ago

▲ 9 r/AIGuild

Claude Managed Agents Can Now Run Inside Your Own Infrastructure

Anthropic upgraded Claude Managed Agents with self-hosted sandboxes and MCP tunnels, making the platform much more enterprise-friendly. The big idea: Claude can still manage the agent loop, but the actual tool execution can happen inside infrastructure the company controls.

Self-hosted sandboxes let agents run tools, access files, install packages, execute code, and do compute-heavy work without sensitive repositories or internal services leaving the company’s perimeter. Companies can use their own infrastructure or managed sandbox providers like Cloudflare, Daytona, Modal, and Vercel.

The second big feature is MCP tunnels. These let Claude agents connect to private MCP servers inside a company’s network without exposing them to the public internet. That means internal databases, private APIs, ticketing systems, knowledge bases, and other tools can become available to agents through an encrypted outbound connection.

The trust angle is the real story. Enterprises want agents that can use internal systems, but they do not want to hand over sensitive files, credentials, or private infrastructure. This update gives companies more control over runtime, networking, audit logs, resource limits, and security boundaries.

Source: https://claude.com/blog/claude-managed-agents-updates

u/Such-Run-4412 — 3 days ago

▲ 1 r/AIGuild

SpaceX Reportedly Plans to Buy Cursor Right After Its IPO

SpaceX reportedly expects to move forward with its acquisition of Cursor about 30 days after SpaceX begins public trading, according to Bloomberg. The IPO is expected around June 12, which would put a possible Cursor deal timeline sometime in July.

This builds on earlier reporting that SpaceX had secured the option to either acquire Cursor for around $60 billion or pursue a smaller partnership instead. Reuters previously framed the Cursor deal as one of the stranger but most important parts of SpaceX’s IPO story because it ties Musk’s space company directly to AI coding infrastructure.

The logic is pretty clear: Cursor is one of the most important AI coding tools right now, and Musk’s companies need massive software automation across SpaceX, Tesla, xAI, Starlink, robotics, chips, and possibly orbital compute. Buying Cursor would give SpaceX a direct role in the developer-tool layer of the AI race.

It also makes SpaceX’s IPO even more unusual. Investors are not just being asked to value rockets and satellites. They are being asked to value a company that could include space launch, Starlink, xAI, AI infrastructure, and now potentially one of the hottest AI coding platforms.

Source: https://www.bloomberg.com/news/articles/2026-05-19/spacex-is-said-to-plan-to-buy-startup-cursor-30-days-after-ipo

u/Such-Run-4412 — 3 days ago

▲ 1 r/AIGuild

SpaceX Is Becoming the Center of Musk’s AI Empire

SpaceX is suddenly much more than a rocket and satellite company. It is now being pitched as a massive AI infrastructure play, with investors reportedly preparing for one of the biggest IPOs in history.

The company is targeting a raise of around $75 billion at a valuation near $1.75 trillion, which could make it the largest stock market debut ever. BlackRock has reportedly discussed investing $5 billion to $10 billion in the offering.

SpaceX shareholders also approved a 5-for-1 stock split, lowering the fair market value per share from $526.59 to $105.32 ahead of the planned listing. The company is reportedly aiming to list on Nasdaq as early as June 12, 2026.

The bigger story is AI. SpaceX now includes xAI, Starlink, rocket launches, and Musk’s long-term vision for orbital data centers. Reuters’ Breakingviews framed SpaceX as four buckets: rockets, Starlink, xAI, and “Musk’s imagination.”

That imagination is doing a lot of work. Investors are not just betting on launches and satellite internet anymore. They are betting that SpaceX could become a full-stack infrastructure company for AI: chips, data centers, satellites, orbital compute, energy, and possibly space-based AI systems.

Video URL: https://youtu.be/j-FWwcnDC9k?si=zN_aRTgZ6weT453B

u/Such-Run-4412 — 4 days ago

▲ 1 r/AIGuild

Codex Can Now Be Controlled From Your Phone

OpenAI added remote connections for Codex, letting developers control Codex from another device, including the ChatGPT mobile app. You can connect your phone to a Codex host, like a Mac, and keep working with the same projects, files, credentials, plugins, browser setup, and local tools.

The main idea is that Codex is becoming a long-running coding worker. From your phone, you can start or continue threads, send follow-up instructions, approve commands, review diffs, check test results, see terminal output, and get notified when Codex needs attention.

OpenAI also supports remote development environments through SSH hosts. That means Codex can work against files and shells on a remote devbox, not just your local machine. OpenAI warns teams to use normal SSH security practices like trusted keys, least-privilege accounts, and avoiding public unauthenticated access.

A useful detail: if you want Codex reachable for longer-running work, OpenAI recommends using an always-on Mac or remote host. If your Mac sleeps, loses network, or closes Codex, remote access stops.

Source: https://developers.openai.com/codex/remote-connections

u/Such-Run-4412 — 4 days ago

▲ 1 r/AIGuild

Anthropic Bought Stainless to Make Claude Agents Better at Using Tools

Anthropic acquired Stainless, a developer tools startup that builds SDKs, CLIs, and MCP server tooling. In simple terms, Stainless helps turn APIs into clean developer libraries and connectors that apps — and AI agents — can actually use.

This matters because the next AI race is not just about smarter models. It is about agents that can reliably connect to real systems: databases, SaaS tools, internal APIs, coding platforms, enterprise apps, and workflows.

Stainless has powered every official Anthropic SDK since the early days of the Claude API, and hundreds of companies use it to generate SDKs across languages like TypeScript, Python, Go, Java, and Kotlin.

The bigger angle is MCP. Anthropic created the Model Context Protocol to help agents connect to outside tools and data. Stainless strengthens that layer by making it easier to build the connectors, SDKs, and interfaces agents need to act beyond a chat window.

Source: https://www.anthropic.com/news/anthropic-acquires-stainless

u/Such-Run-4412 — 4 days ago

▲ 1 r/AIGuild

Cursor Just Made Composer Better at Long Coding Tasks

Cursor released Composer 2.5, a new version of its coding model that is now available inside Cursor.

The main upgrade is sustained work. Composer 2.5 is better at long-running coding tasks, following complex instructions, communicating clearly, and knowing how much effort to spend on a task. Cursor says it improved both intelligence and behavior, not just benchmark scores.

Composer 2.5 is built on the same open-source base checkpoint as Composer 2, Moonshot’s Kimi K2.5, but Cursor improved it with more difficult training tasks, new reinforcement learning methods, and much more synthetic coding data.

One big training change is targeted RL with textual feedback. In simple terms, instead of only telling the model whether a whole long task succeeded or failed, Cursor can now give feedback at the exact moment where the model made a bad tool call, confusing explanation, or style mistake.

Composer 2.5 was also trained with 25x more synthetic tasks than Composer 2. Cursor says this helped make the model stronger, but also created new reward-hacking issues, like the model finding hidden caches or decompiling bytecode to recover deleted information.

Pricing starts at $0.50 per million input tokens and $2.50 per million output tokens. There is also a faster version at $3 per million input tokens and $15 per million output tokens, which is now the default.

Source: https://cursor.com/blog/composer-2-5

u/Such-Run-4412 — 4 days ago

▲ 1 r/AIGuild

Microsoft Is Worried GitHub’s AI Coding Lead Is Slipping

Microsoft executives are reportedly warning that GitHub’s early lead in AI coding is eroding fast, as rivals like Cursor and Claude Code gain ground with developers. GitHub Copilot was once the obvious leader, but newer agentic coding tools are now threatening both Copilot and GitHub’s core developer workflow.

The timing is awkward because Microsoft is also pulling back most internal Claude Code licenses and moving its own developers toward GitHub Copilot CLI. Claude Code had become popular inside Microsoft, but the company wants to standardize around its own tool and reduce dependence on a rival coding agent.

The pressure is not just product quality. AI coding agents are changing how developers work: they can write code, open pull requests, review changes, and run longer tasks. That means GitHub has to evolve from a code-hosting platform into a full AI development workspace, or risk developers spending more time in competing tools.

There is also a business-model problem. GitHub is moving Copilot to usage-based billing starting June 1, 2026, which shows how expensive agentic coding is becoming compared with simple autocomplete. As developers run longer AI sessions, GitHub has to balance growth, margins, reliability, and pricing.

Source: https://www.theinformation.com/articles/microsoft-executives-sound-alarm-githubs-eroding-ai-lead?rc=mf8uqd

reddit.com

u/Such-Run-4412 — 4 days ago

▲ 7 r/AIGuild

Mistral CEO Says Europe Has 2 Years to Avoid Becoming America’s AI “Vassal State”

Mistral CEO Arthur Mensch warned that Europe has about two years to build its own AI infrastructure before it becomes permanently dependent on U.S. tech giants. He made the comments during a French National Assembly hearing on digital sovereignty and AI.

His main point: the AI race is no longer just about who has the best model. It is about who controls the chips, data centers, energy, cloud platforms, and deployment pipelines behind those models. If Europe keeps importing all of that from American companies, it loses leverage.

Mensch argued that Europe needs its own sovereign AI stack, including models, GPU compute, cloud capacity, and energy infrastructure. Mistral is trying to position itself as Europe’s AI champion, but even Mensch admitted its goal of reaching a gigawatt of AI compute by 2029 is still not enough on its own.

He also criticized Europe’s fragmented capital markets and regulation, saying they make it harder for startups to scale fast enough against U.S. giants. That is a major problem when OpenAI, Anthropic, Google, Microsoft, Amazon, Meta, and xAI are all locking up massive compute deals.

Source: https://www.businessinsider.com/mistral-ceo-warns-europe-2-years-avoid-us-ai-dependence-2026-5

u/Such-Run-4412 — 4 days ago

▲ 12 r/AIGuild

Musk Just Lost His OpenAI Lawsuit

Elon Musk lost his lawsuit against OpenAI, Sam Altman, Greg Brockman, and Microsoft after a California jury ruled that he brought his claims too late. The jury reportedly took less than two hours to decide, and Judge Yvonne Gonzalez Rogers accepted the finding.

Musk had accused OpenAI of abandoning its original nonprofit mission, shifting toward profit, and enriching itself through its Microsoft partnership. He was seeking major damages, leadership changes, and wanted OpenAI pushed back toward its charitable mission.

The case did not really end with the jury deciding whether OpenAI betrayed its mission. It ended on timing. OpenAI argued Musk already knew about the company’s for-profit direction years earlier, meaning the statute of limitations had expired before he filed.

This is a major win for OpenAI because it removes one of the biggest legal threats hanging over the company as it prepares for a possible IPO. Reuters says the ruling clears a path for OpenAI to move forward with public-market plans that could value it around $1 trillion.

Musk’s lawyer said he plans to appeal, so the fight may not be fully over. But for now, OpenAI and Altman scored a huge courtroom victory.

Source: https://www.theinformation.com/articles/musk-loses-openai-lawsuit-victory-altman?rc=mf8uqd

reddit.com

u/Such-Run-4412 — 4 days ago

▲ 1 r/AIGuild

Microsoft’s AI Chief Says White-Collar Automation Is Coming Fast

Microsoft AI CEO Mustafa Suleyman says AI could automate most computer-based professional tasks within the next 12 to 18 months. He specifically pointed to work in accounting, law, marketing, project management, and software engineering as areas where AI could reach human-level performance very soon.

The bigger point is not that every office worker disappears overnight. It is that many tasks done by people sitting at a computer — writing, analyzing, coding, planning, reporting, organizing, reviewing documents — could become automatable much faster than companies and workers expect.

Suleyman is also pushing Microsoft toward more AI self-sufficiency. Microsoft still works closely with OpenAI, but it is building its own foundation models, agents, and infrastructure so it is not fully dependent on one partner.

This fits the broader AI shift: companies are no longer just adding AI assistants. They are trying to build AI systems that can operate inside real workflows, handle repetitive office tasks, and eventually act like specialized digital workers.

Source: https://fortune.com/article/why-microsoft-ai-chief-mustafa-suleyman-predicts-ai-automation-18-months/

u/Such-Run-4412 — 4 days ago