Alibaba Just Dropped Qwen3.7-Max — A New Flagship Model Built for AI Agents
Alibaba’s Qwen team released Qwen3.7-Max, its latest proprietary flagship model built specifically for the “agent era.”
This is not being pitched as just another chatbot. Qwen3.7-Max is designed for long-running AI agents that can write code, debug, use tools, work across files, automate office tasks, and coordinate multi-step workflows.
The biggest claim is long-horizon autonomy. Qwen says the model ran a 35-hour kernel optimization task with 1,000+ tool calls and no hand-holding. That is the kind of workload AI labs are now using to prove whether a model can actually stay coherent over many steps, not just answer a prompt well.
Coding is the main focus. Qwen3.7-Max is aimed at frontend prototypes, multi-file refactors, real debugging, and end-to-end coding agent work. It is also designed to work across different agent scaffolds, including Claude Code, OpenClaw, Qwen Code, Hermes, or custom stacks.
The benchmark claims are strong. Coverage of the Qwen release says Qwen3.7-Max scored 69.7 on Terminal-Bench 2.0 Terminus-2, 80.4 on SWE-bench Verified, 60.8 on MCP-Mark, and 76.4 on MCP-Atlas. It reportedly beat or came close to major competitors across several agent and coding benchmarks.
The model also showed strong kernel optimization performance. In KernelBench L3, it reportedly reached a 1.98x median speedup over PyTorch reference implementations and produced code faster than torch.compile in 96% of cases.
The really important detail is that Qwen is emphasizing “scaffold-agnostic” performance. A lot of models look good inside their own optimized toolchain, then fall apart when moved into another agent framework. Qwen is trying to show that 3.7-Max learned the actual task-solving behavior, not just tricks for one harness.
There is also a business/productivity angle. Qwen3.7-Max supports MCP integrations and multi-agent orchestration, so it can be used as an office assistant for workflows involving documents, spreadsheets, productivity tools, and enterprise systems.
Developers can use it through Alibaba Model Studio, and it is also available to try in Qwen Studio.
Big takeaway: Qwen is no longer just competing on open model releases or benchmark flexing. Alibaba is clearly aiming at the agent market — coding agents, office agents, MCP workflows, and long-running autonomous tasks.
The frontier model race is turning into an agent race. Qwen3.7-Max is Alibaba’s strongest signal yet that it wants to be a serious player there.
Source: https://qwen.ai/blog?id=qwen3.7