ENTITY CuTe-DSL

CuTe-DSL

PulseAugur coverage of CuTe-DSL — every cluster mentioning CuTe-DSL across labs, papers, and developer communities, ranked by signal.

Total · 30d

3

3 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

0

0 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 3 TOTAL

RESEARCH · CL_104070 · Jun 22 · 17:00

GB200 NVL72 serving costs slashed 2.5x via software upgrades

Software optimizations for the GB200 NVL72 have drastically reduced serving costs by 2.5 times in under 70 days. These improvements, particularly the rewriting of the NVFP4 MoE kernel using CuTe-DSL and leveraging the N…
TOOL · CL_86322 · Jun 11 · 12:00

Modal optimizes FlashAttention-4 for faster LLM inference

Modal has enhanced the FlashAttention-4 kernel to improve inference speed for large language models, particularly for decode-heavy workloads. Their contributions focused on adjusting parallelism strategies, such as shif…
RESEARCH · CL_18472 · May 6 · 04:00

NVIDIA open-sources cuDNN kernels after 12 years, including MoE and sparse attention

NVIDIA has open-sourced parts of its cuDNN library, a significant move after 12 years of it being closed-source. This release includes over 20 Mixture-of-Experts (MoE) kernels and NSA sparse attention kernels. The codeb…