LM1B
PulseAugur coverage of LM1B — every cluster mentioning LM1B across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New 7B Uniform Diffusion Language Model 'Sumi' Released, Alongside Diffusion Model Advancements
Researchers have introduced Sumi, a 7-billion parameter uniform diffusion language model (UDLM) pretrained from scratch on 1.5 trillion tokens. This open-source model demonstrates competitive performance against autoreg…
-
K-Forcing accelerates LLM inference by decoding multiple tokens at once
Researchers have introduced K-Forcing, a new paradigm for accelerating language model inference by decoding multiple tokens simultaneously. This push-forward approach distills an existing autoregressive model into a map…
-
AI text evaluation methods criticized in new research papers
Two new research papers highlight significant issues with current methods for evaluating AI-generated text. One paper reveals widespread under-reporting of human evaluation protocols in NLP conferences, hindering reprod…
-
New LLM training methods boost efficiency and error recovery
Researchers have developed new techniques for improving the efficiency of training large language models (LLMs). One method, Step Rejection Fine-Tuning (SRFT), leverages unsuccessful training trajectories by assessing t…