FlashAttention-2
PulseAugur coverage of FlashAttention-2 — every cluster mentioning FlashAttention-2 across labs, papers, and developer communities, ranked by signal.
1 天有情绪数据
-
Stanford's ThunderKittens DSL optimizes AI kernel performance
A new article details ThunderKittens, a compact domain-specific language (DSL) developed at Stanford's Hazy Research Lab for creating high-performance AI kernels. The DSL aims to strike a balance between research produc…
-
Sigmoid attention improves biological foundation models with faster, stable training
Researchers have developed a new attention mechanism called Sigmoid Attention, which offers significant improvements for training biological foundation models. This novel approach leads to better learned representations…
-
Google AI optimizes cloud computing with LAVA, Together AI expands GPU cloud, and Modal streamlines AI/ML deployment
Google DeepMind researchers have developed LAVA, a new AI-driven scheduling algorithm designed to optimize resource allocation in cloud data centers. LAVA continuously re-predicts virtual machine (VM) lifetimes, adaptin…