PulseAugur
实时 07:04:45

DeepSeek V3.1 model outperforms Claude 4 Sonnet at a fraction of the cost

DeepSeek has released its V3.1 model, which has undergone continued pre-training on 840 billion tokens. This new model reportedly outperforms Anthropic's Claude 4 Sonnet while operating at a significantly lower cost, approximately 11% of Sonnet's price. The release highlights advancements in efficient large language model training and performance. AI

排序理由 Frontier-lab model release with system card.

在 Smol AINews 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. Smol AINews TIER_1 English(EN) ·

    DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

    **DeepSeek** released **DeepSeek V3.1**, a quietly rolled out open model with an **128K context window** and improvements in **token efficiency**, coding, and agentic benchmarks. **ByteDance** launched the permissive **Seed-OSS 36B** model on Hugging Face, noted for long-context …