New training method combats LLM diversity loss

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 05:11

Researchers have developed a new method called annotation-anchored training to address semantic mode collapse in large language models. This technique involves pretraining models on documents paired with semantic annotations, which helps maintain the diversity of the original pretraining data during fine-tuning. The approach allows models to generate more diverse outputs by using these annotations as anchors, reportedly reducing diversity collapse by six times compared to standard supervised fine-tuning and showing improved performance with increased model scale. AI

影响 Mitigates semantic diversity loss in LLMs, potentially leading to more varied and robust model outputs.

排序理由 The cluster contains an academic paper detailing a new method for training language models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Aditi Raghunathan · 2026-05-11 05:11

Annotations Mitigate Post-Training Mode Collapse

Post-training (via supervised fine-tuning) improves instruction-following, but often induces semantic mode collapse by biasing models toward low-entropy fine-tuning data at the expense of the high-entropy pretraining distribution. Crucially, we find this trade-off worsens with sc…

报道来源 [1]

Annotations Mitigate Post-Training Mode Collapse

相关话题