PulseAugur
EN
LIVE 04:18:36

NVIDIA releases Nemotron-Labs-TwoTower for faster text generation · 2 sources tracked

NVIDIA has introduced Nemotron-Labs-TwoTower, an open-weight diffusion language model designed to improve text generation throughput. This model separates the tasks of context representation and token denoising into two distinct "towers." By employing this architecture, Nemotron-Labs-TwoTower maintains nearly 99% of the quality of autoregressive models while achieving over 2.4 times the generation speed. AI

IMPACT This novel architecture could significantly accelerate inference for large language models, potentially lowering costs and enabling new real-time applications.

RANK_REASON Frontier-lab model release with system card [lever_c_demoted from frontier_release: ic=2 ai=1.0]

Read on MarkTechPost →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

NVIDIA releases Nemotron-Labs-TwoTower for faster text generation · 2 sources tracked

COVERAGE [2]

  1. MarkTechPost TIER_1 English(EN) · Asif Razzaq ·

    NVIDIA Releases Nemotron-Labs-TwoTower: an Open-Weight Diffusion Language Model Built on a Frozen Autoregressive Nemotron-3-Nano-30B-A3B Backbone

    <p>NVIDIA has released Nemotron-Labs-TwoTower, a diffusion language model built on a pretrained autoregressive backbone. It ships as open weights under the NVIDIA Nemotron Open Model License. The release targets a throughput bottleneck in text generation. Autoregressive (AR) mode…

  2. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    NVIDIA has released TwoTower, an open-weight diffusion language model built on a frozen autoregressive backbone. It retains 98.7% of baseline quality while achi

    NVIDIA has released TwoTower, an open-weight diffusion language model built on a frozen autoregressive backbone. It retains 98.7% of baseline quality while achieving 2.42x faster text generation. https://www. marktechpost.com/2026/07/01/nv idia-releases-nemotron-labs-twotower/ # …