ENTITY FlashAttention-2

FlashAttention-2

PulseAugur coverage of FlashAttention-2 — every cluster mentioning FlashAttention-2 across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

8 over 90d

Releases · 30d

0 over 90d

Papers · 30d

4 over 90d

TIER MIX · 90D

significant 1
research 4
tool 3

TOPICS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL

TOOL · CL_113992 · Jun 27 · 16:44

Picotron framework enables LLM training on older GPUs

A developer has created Picotron, an LLM training framework designed to run on older GPUs without crashing. This framework eliminates mandatory GPU-specific dependencies, allowing it to function on any GPU supporting Py…
RESEARCH · CL_109474 · Jun 24 · 00:00

New Causal-rCM recipe accelerates autoregressive video diffusion

Researchers have introduced Causal-rCM, a novel open recipe for autoregressive video diffusion distillation. This framework unifies teacher-forcing and self-forcing paradigms to enhance streaming video generation and in…
SIGNIFICANT · CL_94984 · Jun 16 · 15:04

Subquadratic AI unveils SubQ 1.1 Small with 12M token context

Subquadratic AI has released its new model, SubQ 1.1 Small, which utilizes Smart Sparse Attention to achieve near-perfect long-context retrieval up to 12 million tokens. This model significantly reduces computational re…
SIGNIFICANT · CL_95036 · Jun 16 · 14:50

SubQ unveils SubQ 1.1 Small with 12M-token context and sparse attention

SubQ has released its SubQ 1.1 Small model, featuring a new Subquadratic Sparse Attention (SSA) architecture designed to overcome the quadratic scaling limitations of traditional attention mechanisms. This new architect…
SIGNIFICANT · CL_65070 · Jun 1 · 03:04

ByteDance releases Bernini open-source video generation framework

ByteDance has released Bernini, an open-source framework for video generation and editing. The system combines a multimodal large language model for semantic planning with a DiT-based renderer. Bernini reportedly achiev…
RESEARCH · CL_43418 · May 22 · 05:38

Stanford's ThunderKittens DSL optimizes AI kernel performance

A new article details ThunderKittens, a compact domain-specific language (DSL) developed at Stanford's Hazy Research Lab for creating high-performance AI kernels. The DSL aims to strike a balance between research produc…
RESEARCH · CL_11887 · May 1 · 04:00

Sigmoid attention improves biological foundation models with faster, stable training

Researchers have developed a new attention mechanism called Sigmoid Attention, which offers significant improvements for training biological foundation models. This novel approach leads to better learned representations…
RESEARCH · CL_00277 · Mar 7 · 20:00

Google AI optimizes cloud computing with LAVA, Together AI expands GPU cloud, and Modal streamlines AI/ML deployment

Google DeepMind researchers have developed LAVA, a new AI-driven scheduling algorithm designed to optimize resource allocation in cloud data centers. LAVA continuously re-predicts virtual machine (VM) lifetimes, adaptin…

Picotron framework enables LLM training on older GPUs

New Causal-rCM recipe accelerates autoregressive video diffusion

Subquadratic AI unveils SubQ 1.1 Small with 12M token context

SubQ unveils SubQ 1.1 Small with 12M-token context and sparse attention

ByteDance releases Bernini open-source video generation framework

Stanford's ThunderKittens DSL optimizes AI kernel performance

Sigmoid attention improves biological foundation models with faster, stable training

Google AI optimizes cloud computing with LAVA, Together AI expands GPU cloud, and Modal streamlines AI/ML deployment