English(EN) NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.

NVIDIA 发布 Nemotron-TwoTower 扩散语言模型

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-25 08:34

NVIDIA 推出了 Nemotron-TwoTower-30B-A3B-Base-BF16，一个新颖的、基于扩散的语言模型。该模型通过使用扩散去噪器塔来同时处理 token 块，从而偏离了传统的逐 token 生成方式。NVIDIA 报告称，这种方法在保持与其自回归模型几乎同等质量的同时，显著提高了生成速度。 AI

影响这种新颖的基于扩散的方法可以提高 LLM 的生成速度，同时保持高质量。

排序理由 NVIDIA 发布了 Nemotron-TwoTower-30B-A3B-Base-BF16，一个基于 Nemotron 3 Nano 30B-A3B 主干的、不同寻常的、基于扩散的语言模型。 [lever_c_demoted from frontier_release: ic=1 ai=1.0]

在 r/LocalLLaMA 阅读 →

模型发布

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/nikhilprasanth · 2026-06-25 08:34

NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1uf4azy/nvidia_has_released/"> <img alt="NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone." src="https://extern…

报道来源 [1]

NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.

相关实体

相关话题