NVIDIA has released Nemotron 3 Ultra, a 550-billion-parameter open-weights model that sets a new benchmark for US-based releases. This hybrid Mamba-Transformer mixture-of-experts model features a 1M-token context window and is optimized for agent harnesses. While it achieves a high score on the Artificial Analysis Intelligence Index, it trails behind some Chinese and closed-source models in raw capability but excels in speed, processing over 300 tokens per second. AI
IMPACT Sets a new high-water mark for US open-weights models, particularly in speed, potentially influencing agent development.
RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]
- Anthropic
- Artificial Analysis Intelligence Index
- DeepSeek
- Hermes Agent
- Kimi K2.6
- LangChain Deep Agents
- Moonshot
- Nemotron 3 Ultra
- NVIDIA
- OpenCode
- OpenHands
- Opus 4.8
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →