PulseAugur
实时 23:39:22

DeepSeek V4 models offer high performance with reduced inference costs and NPU support

DeepSeek has released its V4 family of open-weight large language models, featuring a 1.6 trillion parameter model and a smaller 284 billion parameter Flash MoE model. These new models claim to rival top proprietary LLMs in performance while significantly reducing inference costs. Key to this efficiency are architectural innovations like a hybrid attention mechanism and the use of lower precision datatypes (FP8 and FP4), enabling a million-token context window with substantially less memory. AI

影响 Sets new efficiency benchmarks for open-weight models, potentially lowering inference costs and enabling larger context windows for a wider range of applications.

排序理由 Release of new open-weight LLMs from a notable AI lab with claims of rivaling proprietary models and significant efficiency gains.

在 The Register — AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

DeepSeek V4 models offer high performance with reduced inference costs and NPU support

报道来源 [2]

  1. The Register — AI TIER_1 English(EN) · Tobias Mann ·

    DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs

    <h4>Now available in preview, DeepSeek V4 cuts inference costs to a fraction of R1</h4> <p>Chinese AI darling DeepSeek is back with a new open weights large language model that promises performance to rival the best proprietary American LLMs. Perhaps more importantly, it claims t…

  2. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs # AI https://www. theregister.com/2026/04/24/dee pseek_v4/?td

    DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs # AI https://www. theregister.com/2026/04/24/dee pseek_v4/?td=rt-3a