ENTITY
Billion Parameter Pretrained Transformers
Billion Parameter Pretrained Transformers
PulseAugur coverage of Billion Parameter Pretrained Transformers — every cluster mentioning Billion Parameter Pretrained Transformers across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
2 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New framework enables linear merging of billion-parameter transformers
Researchers have developed a new framework for merging large pretrained transformers, specifically those with billions of parameters. This method addresses limitations of previous approaches by optimizing interpolation …
-
New research explores merging large transformers and improving looped model stability
Two new research papers explore novel techniques for enhancing the capabilities and stability of large transformer models. The first paper introduces a scalable framework for linear mode connectivity (LMC) that allows f…