LLM co-evolution boosted by vocabulary dropout for sustained diversity

作者 PulseAugur 编辑部 · [1 个来源] · 2026-04-29 04:00

Researchers have developed a technique called vocabulary dropout to address diversity collapse in co-evolutionary language model training. This method involves applying a random mask to the proposer model's output logits, preventing it from generating repetitive problems. Experiments with Qwen3-4B and Qwen3-8B models on mathematical reasoning tasks showed that vocabulary dropout maintained proposer diversity and led to significant solver improvements, particularly on challenging benchmarks. AI

影响 Introduces a method to improve LLM training diversity and performance on reasoning tasks.

排序理由 This is a research paper detailing a new technique for LLM training.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Jacob Dineen, Aswin RRV, Zhikun Xu, Ben Zhou · 2026-04-29 04:00

Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

arXiv:2604.03472v2 Announce Type: replace Abstract: Co-evolutionary self-play, where one language model generates problems and another solves them, promises autonomous curriculum learning without human supervision. In practice, the proposer quickly converges to a narrow distribut…

报道来源 [1]

Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

相关实体

相关话题