StepFun has released Step 3.7 Flash, a 198 billion parameter Mixture-of-Experts vision-language model designed for coding agents and search workflows. This new model features native multimodal understanding, improved tool-use reliability, and selectable reasoning depths to balance speed and computation. Step 3.7 Flash demonstrates significant performance gains on coding benchmarks like SWE-Bench Pro and offers an "Advisor Mode" that approaches Claude Opus 4.6 performance at a fraction of the cost. AI
IMPACT Sets a new benchmark for multimodal agentic coding performance and cost-efficiency, potentially influencing future agent development.
RANK_REASON New model release from a frontier lab (StepFun) with detailed technical specifications and benchmark results.
- Claude Opus 4.6
- Gemini 3 Flash
- GLM 5V Turbo
- GPT 5.5
- Kimi K2.6
- Mixture-of-Experts
- SimpleVQA
- Step 3.5 Flash
- Step 3.7 Flash
- StepFun
- SWE-Bench Pro
AI-generated summary · Google Gemini · from 6 sources. How we write summaries →