ENTITY
Gaussian Mixture Attention
Gaussian Mixture Attention
PulseAugur coverage of Gaussian Mixture Attention — every cluster mentioning Gaussian Mixture Attention across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
Gaussian Mixture Attention enhances LLM long-term context
Researchers have developed a new method called Gaussian Mixture Attention (GMA) to enhance long-term context understanding in large language models. This approach utilizes probabilistic attention mechanisms, moving away…
-
Gaussian Mixture Attention offers linear-time sequence mixing
Researchers have introduced Gaussian Mixture Attention (GMA), a novel sequence mixing technique designed to overcome the quadratic scaling bottleneck of standard Transformer attention. GMA replaces explicit token-to-tok…