NVIDIA releases Nemotron 3 Ultra, a 550B parameter open-weights model

By PulseAugur Editorial · [1 sources] · 2026-06-18 07:57

NVIDIA has released Nemotron 3 Ultra, a 550-billion-parameter open-weights model that sets a new benchmark for US-based releases. This hybrid Mamba-Transformer mixture-of-experts model features a 1M-token context window and is optimized for agent harnesses. While it achieves a high score on the Artificial Analysis Intelligence Index, it trails behind some Chinese and closed-source models in raw capability but excels in speed, processing over 300 tokens per second. AI

IMPACT Sets a new high-water mark for US open-weights models, particularly in speed, potentially influencing agent development.

RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

dev.to — LLM tag TIER_1 English(EN) · Creeta · 2026-06-18 07:57

Nemotron 3 Ultra went live June 4. Here's the call that works.

<p>NVIDIA shipped Nemotron 3 Ultra on June 4, 2026 — its largest open-weights model and the new high-water mark for US open releases. Before you wire it into an agent harness, here is exactly what landed and where it sits on the leaderboard.</p> <h2> What NVIDIA Launched on June …

COVERAGE [1]

Nemotron 3 Ultra went live June 4. Here's the call that works.

RELATED ENTITIES

RELATED TOPICS