English(EN) Gemini 3.5 Flash beat 3.1 Pro on coding and agents

Google 的 Gemini 3.5 Flash 在编码和代理任务上超越 3.1 Pro

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-23 02:32

Google 的 Gemini 3.5 Flash 模型在多项关键基准测试中超越了其前身 Gemini 3.1 Pro，尤其是在编码和代理任务方面。这一新层级相比 3.1 Pro 提供了显著的成本降低 40%，并且输出生成速度大约快四倍。虽然 Gemini 3.5 Flash 在工具使用和代理性能方面表现出色，但 Gemini 3.1 Pro 在纯粹推理和新颖问题解决基准测试中仍保持优势。 AI

影响加速在代理任务中采用更便宜、更快的模型，可能降低 AI 驱动应用程序的成本。

排序理由来自前沿实验室（Google）的新模型发布，并与先前模型进行了性能基准测试。[lever_c_demoted from frontier_release: ic=1 ai=1.0]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

Google 的 Gemini 3.5 Flash 在编码和代理任务上超越 3.1 Pro

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · Thousand Miles AI · 2026-05-23 02:32

Gemini 3.5 Flash 在编码和代理方面击败 3.1 Pro

<p>Gemini 3.5 Flash scored 76.2% on Terminal-Bench 2.1. Gemini 3.1 Pro — the tier above it in Google's own lineup — scored 70.3%.</p> <p>Google shipped Flash at I/O 2026 on May 19. It costs $2.50 per million input tokens and $15 per million output, which is 40% cheaper than 3.1 P…

报道来源 [1]

Gemini 3.5 Flash 在编码和代理方面击败 3.1 Pro

相关实体

相关话题