PulseAugur
实时 22:51:35

DeepSeek releases V4 Pro and Flash models with 1M context, runs on Huawei chips

DeepSeek has released its new V4 family of models, including V4 Pro and V4 Flash, which boast a 1 million token context window. These models were trained on 32 trillion tokens and feature a novel hybrid attention system for improved efficiency. Notably, the V4 Pro is designed for complex tasks, while V4 Flash offers a faster alternative, and both are released under an MIT license, with compatibility for Huawei Ascend chips. AI

影响 Advances open-weight long-context and agentic coding performance, potentially challenging closed frontier models and enabling more complex AI applications.

排序理由 Release of a new major model family from a significant AI lab with advanced capabilities like 1M context window and novel attention mechanisms.

在 Latent Space (swyx) 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

DeepSeek releases V4 Pro and Flash models with 1M context, runs on Huawei chips

报道来源 [2]

  1. Latent Space (swyx) TIER_1 English(EN) ·

    [AINews] DeepSeek V4 Pro (1.6T-A49B) and Flash (284B-A13B), Base and Instruct — runnable on Huawei Ascend chips

    The prodigal Tiger returns... but is no longer the benchmarks leader.

  2. Mastodon — mastodon.social TIER_1 العربية(AR) · bidjadtech ·

    DeepSeek released its new models DeepSeek V4 Pro and DeepSeek V4 Flash, featuring a context window of up to one million tokens, enabling longer, more coherent conversations at a lower cost.

    أصدرت DeepSeek نماذجها الجديدة DeepSeek V4 Pro و DeepSeek V4 Flash، والتي تتميز بنافذة سياق تصل إلى مليون رمز، مما يسمح بمحادثات أطول وأكثر تماسكاً بتكلفة أقل. تظل النماذج مفتوحة المصدر، مع توجيه V4 Pro للمهام المعقدة و V4 Flash كخيار أسرع. يأتي هذا الإصدار وسط تدقيق متزايد بسبب …