ENTITY
Kimi Delta Attention
Kimi Delta Attention
PulseAugur coverage of Kimi Delta Attention — every cluster mentioning Kimi Delta Attention across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
FG^2-GDN enhances long-context understanding with adaptive learning rates
Researchers have introduced FG$^2$-GDN, a novel approach to enhance long-context understanding in neural networks. This method improves upon existing Gated Delta Networks by replacing a scalar learning rate with a chann…
-
Moonshot AI open-sources FlashKDA, boosting Kimi Delta Attention 2.5x on H200 GPUs
Moonshot AI has released FlashKDA, an open-source implementation of Kimi Delta Attention. This new kernel achieves up to 2.5 times faster inference speeds on NVIDIA H200 GPUs. It is built using CUTLASS and optimized for…