NVIDIA has released Nemotron 3.5 Content Safety, a multimodal safety model for enterprises that supports customizable policies and global compliance. ServiceNow-AI launched EVA-Bench Data 2.0, an expanded evaluation benchmark for AI agents covering tool use, reasoning, and error recovery. JetBrains introduced Mellum2, a 12B parameter MoE model optimized for software development tasks, and Dharma-AI published research extending Direct Preference Optimization beyond chatbots to areas like code generation and creative writing. AI
IMPACT New models and benchmarks are released, advancing AI safety, agent capabilities, and software development tools.
RANK_REASON Cluster contains multiple research papers and model releases from various organizations. [lever_c_demoted from research: ic=1 ai=1.0]
- Dharma-AI
- Direct Preference Optimization
- EVA-Bench Data 2.0
- Hcompany
- Holo3.1
- JetBrains
- Mellum2
- Nemotron 3.5 Content Safety
- NVIDIA
- ServiceNow-AI
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →