PulseAugur
EN
LIVE 21:40:29

Alibaba's Qwen3.7-Max launches with enhanced agentic and reasoning skills

Alibaba's Qwen has released Qwen3.7-Max, a new flagship model designed for the Agent Era. This model demonstrates significant improvements in scientific reasoning, coding, and agentic capabilities, achieving a score of 56.6 on the Artificial Analysis Intelligence Index. Qwen3.7-Max also showcases enhanced performance in autonomous execution and generalization across various benchmarks, with features like implicit caching now live. AI

IMPACT Sets a new benchmark for agentic capabilities and reasoning, potentially accelerating the development of autonomous AI systems.

RANK_REASON Frontier-lab model release with system card and benchmark results.

Read on X — Qwen (Alibaba) →

AI-generated summary · Google Gemini · from 11 sources. How we write summaries →

Alibaba's Qwen3.7-Max launches with enhanced agentic and reasoning skills

COVERAGE [11]

  1. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    ✅Implicit caching is now live on Qwen3.7-Max — kicks in automatically, no setup needed.

    ✅Implicit caching is now live on Qwen3.7-Max — kicks in automatically, no setup needed. ⚡️Faster + cheaper out of the box. Need higher, more deterministic hit rates? Try explicit caching instead. 🙌 🔗Best practices 🔗 :https://t.co/3hSs6zquBH

  2. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    🚀Qwen3.7-Max just landed at 56.6 on the Artificial Analysis Intelligence Index — a solid 4.8pt jump over Qwen3.6-Max-Preview. @ArtificialAnlys

    🚀Qwen3.7-Max just landed at 56.6 on the Artificial Analysis Intelligence Index — a solid 4.8pt jump over Qwen3.6-Max-Preview. @ArtificialAnlys ⚡️Sharper sci reasoning, stronger agentic chops, better coding, and it hallucinates less.

  3. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    Cowork Productivity Assistant:Qwen3.7-Max serves as your advanced coworker for real-world productivity. https://t.co/zFOjvNJAhT

    Cowork Productivity Assistant:Qwen3.7-Max serves as your advanced coworker for real-world productivity. https://t.co/zFOjvNJAhT

  4. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    Self-Evolving in the Wild:Over the course of ~35 hours of continuous autonomous execution, the model performed 432 kernel evaluations across 1,158 tool calls. I

    Self-Evolving in the Wild:Over the course of ~35 hours of continuous autonomous execution, the model performed 432 kernel evaluations across 1,158 tool calls. It wrote, compiled, profiled, and iteratively improved the Extend Attention Kernel entirely on its own — 10.0x geometric …

  5. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    Cross-Harness Generalization:Across QwenClawBench and CoWorkBench, Qwen3.7-Max delivers strong, consistent performance regardless of the harness used at evaluat

    Cross-Harness Generalization:Across QwenClawBench and CoWorkBench, Qwen3.7-Max delivers strong, consistent performance regardless of the harness used at evaluation time, confirming that the model has learned to solve tasks — not to exploit particular harnesses. https://t.co/aSZaO…

  6. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    Agent Scaling:Building on Qwen3.5's environment scaling approach, we've aggressively expanded the quality and diversity of agentic training environments in Qwen

    Agent Scaling:Building on Qwen3.5's environment scaling approach, we've aggressively expanded the quality and diversity of agentic training environments in Qwen3.7 — agentic capabilities generalize from diverse environments, just as language models do from diverse text. The https…

  7. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    Performance:Qwen3.7-Max performs strongly across benchmarks in coding agents , and improves massively in general-purpose agents. Qwen3.7-Max also demonstrates e

    Performance:Qwen3.7-Max performs strongly across benchmarks in coding agents , and improves massively in general-purpose agents. Qwen3.7-Max also demonstrates exceptional strength on the hardest reasoning benchmarks, and stands out in general capabilities and multilingualism. htt…

  8. X — Qwen (Alibaba) TIER_1 English(EN) · Alibaba_Qwen ·

    📣Meet Qwen3.7-Max — our latest flagship, made for the Agent Era.

    📣Meet Qwen3.7-Max — our latest flagship, made for the Agent Era. A versatile foundation for agents that actually get things done: 🧑‍💻 Coding agent, end to end. Frontend prototypes, multi-file refactors, real debugging — nails it. 🗂️ A reliable office and productivity assistant. …

  9. Qwen tech blog TIER_1 English(EN) · QwenTeam ·

    Qwen-Scope: Decoding Intelligence, Unleashing Potential

    Interpretability research has emerged as a critical area for understanding LLM behaviors, informing performance optimization, and enabling more controllable model outputs. Today, we are excited to introduce Qwen-Scope, an interpretability toolkit trained on the Qwen3 and Qwen3.5 …

  10. Qwen tech blog TIER_1 English(EN) · QwenTeam ·

    Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

    Following the release of Qwen3.6-Plus, we are sharing an early preview of our next proprietary model: Qwen3.6-Max-Preview. Compared to Qwen3.6-Plus, this preview release brings stronger world knowledge and instruction following, along with significant agentic coding improvements …

  11. Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] ·

    🧠 Qwen3.7-Max exposes a thousand-tool-call agent runtime 35-hour kernel optimization run, 1,000+ tool calls, 1M context, Qwen Studio/API access. Take: evaluate

    🧠 Qwen3.7-Max exposes a thousand-tool-call agent runtime 35-hour kernel optimization run, 1,000+ tool calls, 1M context, Qwen Studio/API access. Take: evaluate limits before production routing. 🧠 Claude Compliance API telemetry reaches security tools Anthropic pushes Enterprise a…