Dapo
PulseAugur coverage of Dapo — every cluster mentioning Dapo across labs, papers, and developer communities, ranked by signal.
3 天有情绪数据
-
Anyscale launches skill to automate LLM post-training runs
Anyscale has introduced a new Anyscale Agent Skill designed to simplify and automate the process of generating LLM post-training runs. This skill assists users in selecting the most appropriate post-training method, suc…
-
New method stabilizes LLM reasoning by rescuing near-boundary signals
Researchers have identified a key bottleneck in Reinforcement Learning from Verifiable Rewards (RLVR) that hinders LLM reasoning optimization. The study pinpoints rigid clipping decisions in standard hard-clipping metho…
-
New PRISM framework corrects SFT flaws in multimodal LLM training
New research from institutions including the Hong Kong University of Science and Technology (Guangzhou) reveals a critical flaw in the common post-training paradigm for multimodal large language models (MLLMs). The stan…
-
IBM releases Granite 4.1 LLMs with 512K context and Apache 2.0 license
IBM has released the Granite 4.1 family of large language models, comprising 3B, 8B, and 30B parameter versions. These models were trained on approximately 15 trillion tokens through a five-stage pre-training process th…