DeepSeek V3.1 model outperforms Claude 4 Sonnet at a fraction of the cost

作者 PulseAugur 编辑部 · [1 个来源] · 2025-08-20 05:44

DeepSeek has released its V3.1 model, which has undergone continued pre-training on 840 billion tokens. This new model reportedly outperforms Anthropic's Claude 4 Sonnet while operating at a significantly lower cost, approximately 11% of Sonnet's price. The release highlights advancements in efficient large language model training and performance. AI

排序理由 Frontier-lab model release with system card.

在 Smol AINews 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Smol AINews TIER_1 English(EN) · 2025-08-20 05:44

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

**DeepSeek** released **DeepSeek V3.1**, a quietly rolled out open model with an **128K context window** and improvements in **token efficiency**, coding, and agentic benchmarks. **ByteDance** launched the permissive **Seed-OSS 36B** model on Hugging Face, noted for long-context …

报道来源 [1]

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

相关实体

相关话题