English(EN) DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs

DeepSeek V4 模型提供高性能，同时降低推理成本并支持 NPU

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-24 21:25

DeepSeek 发布了其 V4 系列开源大语言模型，其中包括一个拥有 1.6 万亿参数的模型和一个拥有 2840 亿参数的较小 Flash MoE 模型。这些新模型声称在性能上可与顶级的专有 LLM 相媲美，同时显著降低了推理成本。实现这一效率的关键在于架构创新，例如混合注意力机制和使用较低精度的数据类型（FP8 和 FP4），从而能够以大大减少的内存实现百万级 token 的上下文窗口。 AI

影响为开源模型设定了新的效率基准，有可能降低推理成本并为更广泛的应用实现更大的上下文窗口。

排序理由一家知名 AI 实验室发布了新的开源 LLM，声称可与专有模型相媲美并实现了显著的效率提升。

在 The Register — AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

The Register — AI TIER_1 English(EN) · Tobias Mann · 2026-04-24 21:25

DeepSeek 的新模型效率极高，甚至能在烤面包机上运行……我们的意思是华为的 NPU 上

<h4>Now available in preview, DeepSeek V4 cuts inference costs to a fraction of R1</h4> <p>Chinese AI darling DeepSeek is back with a new open weights large language model that promises performance to rival the best proprietary American LLMs. Perhaps more importantly, it claims t…
Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-04-27 13:47

DeepSeek 新模型效率极高，可在 toaster 上运行……我们的意思是可在华为 NPU 上运行 # AI https://www. theregister.com/2026/04/24/dee pseek_v4/?td

DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs # AI https://www. theregister.com/2026/04/24/dee pseek_v4/?td=rt-3a

报道来源 [2]

DeepSeek 的新模型效率极高，甚至能在烤面包机上运行……我们的意思是华为的 NPU 上

DeepSeek 新模型效率极高，可在 toaster 上运行……我们的意思是可在华为 NPU 上运行 # AI https://www. theregister.com/2026/04/24/dee pseek_v4/?td

相关实体

相关话题