DeepSeek V3.1 model outperforms Claude 4 Sonnet at a fraction of the cost

By PulseAugur Editorial · [1 sources] · 2025-08-20 05:44

DeepSeek has released its V3.1 model, which has undergone continued pre-training on 840 billion tokens. This new model reportedly outperforms Anthropic's Claude 4 Sonnet while operating at a significantly lower cost, approximately 11% of Sonnet's price. The release highlights advancements in efficient large language model training and performance. AI

RANK_REASON Frontier-lab model release with system card.

Read on Smol AINews →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

Smol AINews TIER_1 English(EN) · 2025-08-20 05:44

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

**DeepSeek** released **DeepSeek V3.1**, a quietly rolled out open model with an **128K context window** and improvements in **token efficiency**, coding, and agentic benchmarks. **ByteDance** launched the permissive **Seed-OSS 36B** model on Hugging Face, noted for long-context …

COVERAGE [1]

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

RELATED ENTITIES

RELATED TOPICS