ENTITY
Decoder-Only Transformers
Decoder-Only Transformers
PulseAugur coverage of Decoder-Only Transformers — every cluster mentioning Decoder-Only Transformers across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
Transformer models gain absolute position awareness from causal mask and residual stream
Researchers have identified two key architectural components in decoder-only Transformers that contribute to the model's ability to distinguish absolute position, despite positional encoding methods like RoPE primarily …
-
Researchers explore efficient transformers via attention control and algorithmic capture
Researchers are exploring methods to enhance transformer efficiency and understanding. One paper introduces Budgeted Attention Allocation, a head-gating mechanism that allows for cost-quality trade-offs. Another study d…