Alibaba's Qwen3.7-Max launches with enhanced agentic and reasoning skills

By PulseAugur Editorial · [13 sources] · 2026-04-18 02:00

Alibaba's Qwen has released Qwen3.7-Max, a new flagship model designed for the Agent Era. This model demonstrates significant improvements in scientific reasoning, coding, and agentic capabilities, achieving a score of 56.6 on the Artificial Analysis Intelligence Index. Qwen3.7-Max also showcases enhanced performance in autonomous execution and generalization across various benchmarks, with features like implicit caching now live. AI

IMPACT Sets a new benchmark for agentic capabilities and reasoning, potentially accelerating the development of autonomous AI systems.

RANK_REASON Frontier-lab model release with system card and benchmark results.

Read on Qwen tech blog →

AI-generated summary · Google Gemini · from 13 sources. How we write summaries →

Alibaba's Qwen3.7-Max launches with enhanced agentic and reasoning skills

COVERAGE [13]

X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-27 01:23

🚀🚀 Qwen3.7-Max just hit #4 on Code Arena, on par with Claude Opus 4.6 ，top-ranked Chinese lab on the board! @arena

🚀🚀 Qwen3.7-Max just hit #4 on Code Arena, on par with Claude Opus 4.6 ，top-ranked Chinese lab on the board! @arena More to ship. Stay tuned. 🕶️
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-25 15:26

✅Implicit caching is now live on Qwen3.7-Max — kicks in automatically, no setup needed.

✅Implicit caching is now live on Qwen3.7-Max — kicks in automatically, no setup needed. ⚡️Faster + cheaper out of the box. Need higher, more deterministic hit rates? Try explicit caching instead. 🙌 🔗Best practices 🔗 ：https://t.co/3hSs6zquBH
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:20

🚀Qwen3.7-Max just landed at 56.6 on the Artificial Analysis Intelligence Index — a solid 4.8pt jump over Qwen3.6-Max-Preview. @ArtificialAnlys

🚀Qwen3.7-Max just landed at 56.6 on the Artificial Analysis Intelligence Index — a solid 4.8pt jump over Qwen3.6-Max-Preview. @ArtificialAnlys ⚡️Sharper sci reasoning, stronger agentic chops, better coding, and it hallucinates less.
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:15

Cowork Productivity Assistant：Qwen3.7-Max serves as your advanced coworker for real-world productivity. https://t.co/zFOjvNJAhT

Cowork Productivity Assistant：Qwen3.7-Max serves as your advanced coworker for real-world productivity. https://t.co/zFOjvNJAhT
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:15

Self-Evolving in the Wild：Over the course of ~35 hours of continuous autonomous execution, the model performed 432 kernel evaluations across 1,158 tool calls. I

Self-Evolving in the Wild：Over the course of ~35 hours of continuous autonomous execution, the model performed 432 kernel evaluations across 1,158 tool calls. It wrote, compiled, profiled, and iteratively improved the Extend Attention Kernel entirely on its own — 10.0x geometric …
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:15

Cross-Harness Generalization：Across QwenClawBench and CoWorkBench, Qwen3.7-Max delivers strong, consistent performance regardless of the harness used at evaluat

Cross-Harness Generalization：Across QwenClawBench and CoWorkBench, Qwen3.7-Max delivers strong, consistent performance regardless of the harness used at evaluation time, confirming that the model has learned to solve tasks — not to exploit particular harnesses. https://t.co/aSZaO…
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:15

Agent Scaling：Building on Qwen3.5's environment scaling approach, we've aggressively expanded the quality and diversity of agentic training environments in Qwen

Agent Scaling：Building on Qwen3.5's environment scaling approach, we've aggressively expanded the quality and diversity of agentic training environments in Qwen3.7 — agentic capabilities generalize from diverse environments, just as language models do from diverse text. The https…
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:15

Performance：Qwen3.7-Max performs strongly across benchmarks in coding agents , and improves massively in general-purpose agents. Qwen3.7-Max also demonstrates e

Performance：Qwen3.7-Max performs strongly across benchmarks in coding agents , and improves massively in general-purpose agents. Qwen3.7-Max also demonstrates exceptional strength on the hardest reasoning benchmarks, and stands out in general capabilities and multilingualism. htt…
X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen · 2026-05-21 13:15

📣Meet Qwen3.7-Max — our latest flagship, made for the Agent Era.

📣Meet Qwen3.7-Max — our latest flagship, made for the Agent Era. A versatile foundation for agents that actually get things done: 🧑‍💻 Coding agent, end to end. Frontend prototypes, multi-file refactors, real debugging — nails it. 🗂️ A reliable office and productivity assistant. …
Qwen tech blog TIER_1 English(EN) · QwenTeam · 2026-04-30 04:00

Qwen-Scope: Decoding Intelligence, Unleashing Potential

Interpretability research has emerged as a critical area for understanding LLM behaviors, informing performance optimization, and enabling more controllable model outputs. Today, we are excited to introduce Qwen-Scope, an interpretability toolkit trained on the Qwen3 and Qwen3.5 …
Qwen tech blog TIER_1 English(EN) · QwenTeam · 2026-04-18 02:00

Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

Following the release of Qwen3.6-Plus, we are sharing an early preview of our next proprietary model: Qwen3.6-Max-Preview. Compared to Qwen3.6-Plus, this preview release brings stronger world knowledge and instruction following, along with significant agentic coding improvements …
Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-24 14:02

🧠 Qwen3.7-Max exposes a thousand-tool-call agent runtime 35-hour kernel optimization run, 1,000+ tool calls, 1M context, Qwen Studio/API access. Take: evaluate

🧠 Qwen3.7-Max exposes a thousand-tool-call agent runtime 35-hour kernel optimization run, 1,000+ tool calls, 1M context, Qwen Studio/API access. Take: evaluate limits before production routing. 🧠 Claude Compliance API telemetry reaches security tools Anthropic pushes Enterprise a…
r/LocalLLaMA TIER_1 English(EN) · /u/Yes-Scale-9723 · 2026-05-27 18:32

Qwen3.6 huge quality gain from Q4 to Q6 for coding agent

<div class="md"><p>So, last week I tried to update my unused local LLM setup. I had to stop using it because quality was too low and deepseek was too cheap.</p> <p>First thing I stopped using Ollama and now I only use llama.cpp built in server that works really gre…

COVERAGE [13]

RELATED ENTITIES

RELATED TOPICS