PulseAugur
EN
LIVE 07:24:01
ENTITY Language Modeling

Language Modeling

PulseAugur coverage of Language Modeling — every cluster mentioning Language Modeling across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
4
4 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
4
4 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 4 TOTAL
  1. RESEARCH · CL_115231 ·

    Flexformer introduces learnable attention kernels for efficient Transformers

    Researchers have introduced Flexformer, a novel linear Transformer architecture designed to overcome the quadratic complexity limitations of traditional Transformers. Flexformer achieves this by learning attention kerne…

  2. RESEARCH · CL_95870 ·

    Researchers analyze transformer expressivity using formal grammars

    A new research paper analyzes the expressivity of deep transformer models by examining their ability to represent hierarchical structures. The study uses bounded-depth, non-recursive context-free grammars to construct t…

  3. RESEARCH · CL_70263 ·

    Transformer study finds QKV projection sharing slashes memory use

    Researchers have investigated the necessity of three distinct projections (query, key, and value) in Transformer models. Their study found that sharing projections, particularly the Q-K=V variant, can significantly redu…

  4. RESEARCH · CL_06711 ·

    Switch Attention dynamically routes between full and sliding window attention

    Researchers have introduced Switch Attention (SwiAttn), a novel hybrid transformer architecture designed to address the computational bottleneck of standard full attention mechanisms in long-context language modeling. S…