DeepSeek has released its R2 model, a 32 billion parameter dense transformer. This new model achieves 92.7% accuracy on the AIME 2025 benchmark and can operate on a single RTX 4090 graphics card. The R2 model is also significantly more cost-effective, costing approximately 70% less than GPT-5 for reasoning tasks, and is available under an MIT license for self-hosting. AI
影响 Offers a cost-effective, high-performance alternative for reasoning tasks, potentially impacting enterprise adoption and research.
排序理由 New model release from a significant AI lab with strong benchmark performance.
在 Mastodon — fosstodon.org 阅读 →
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →