NVIDIA has open-sourced parts of its cuDNN library, a significant move after 12 years of it being closed-source. This release includes over 20 Mixture-of-Experts (MoE) kernels and NSA sparse attention kernels. The codebase for these kernels is largely written in Python CuTe-DSL, with public documentation now available. AI
IMPACT Open-sourcing of cuDNN kernels could accelerate research and development in AI infrastructure and model optimization.
RANK_REASON Open-sourcing of a significant software library component by a major tech company.
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →