DeepSeek has released its V3.1 model, which has undergone continued pre-training on 840 billion tokens. This new model reportedly outperforms Anthropic's Claude 4 Sonnet while operating at a significantly lower cost, approximately 11% of Sonnet's price. The release highlights advancements in efficient large language model training and performance. AI
排序理由 Frontier-lab model release with system card.
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →