PulseAugur
EN
LIVE 10:47:56

NVIDIA releases Nemotron 3 Ultra, a 550B parameter open-weights model

NVIDIA has released Nemotron 3 Ultra, a 550-billion-parameter open-weights model that sets a new benchmark for US-based releases. This hybrid Mamba-Transformer mixture-of-experts model features a 1M-token context window and is optimized for agent harnesses. While it achieves a high score on the Artificial Analysis Intelligence Index, it trails behind some Chinese and closed-source models in raw capability but excels in speed, processing over 300 tokens per second. AI

IMPACT Sets a new high-water mark for US open-weights models, particularly in speed, potentially influencing agent development.

RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. dev.to — LLM tag TIER_1 English(EN) · Creeta ·

    Nemotron 3 Ultra went live June 4. Here's the call that works.

    <p>NVIDIA shipped Nemotron 3 Ultra on June 4, 2026 — its largest open-weights model and the new high-water mark for US open releases. Before you wire it into an agent harness, here is exactly what landed and where it sits on the leaderboard.</p> <h2> What NVIDIA Launched on June …