Alibaba's Qwen 3.7 Max has demonstrated impressive autonomous capabilities by completing a 35-hour task. During this extended run, the model successfully executed 1,158 tool calls. This sustained performance has garnered positive attention from overseas developers. AI
IMPACT Demonstrates advanced long-task autonomy and tool use, potentially enabling more complex AI agent applications.
RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=2 ai=1.0]
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →