Google DeepMind has released Gemini 3 Deep Think V2, a new reasoning mode for Google AI Ultra subscribers and available via API early access. This model achieves new state-of-the-art results on benchmarks like ARC-AGI-2 with 84.6% accuracy and demonstrates strong performance on Humanity's Last Exam and competitive coding. The model is also highlighted for its efficiency, being 82% cheaper per task, and its practical applications in scientific and engineering workflows, including error detection in papers and 3D modeling. AI
排序理由 Frontier-lab model release with new benchmark results and efficiency claims.
- Anthropic
- ARC-AGI-2
- Claude Code
- Codeforces
- François Chollet
- Gemini 3 Deep Think V2
- Gemini API
- Gemini app
- Google AI Ultra
- Google DeepMind
- GPT-5.3-Codex-Spark
- Humanity's Last Exam
- Jeff Dean
- MiniMax
- OpenAI
- Vertex AI
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →