English(EN) When Good Verifiers Go Bad: Self-Improving VLMs Can Regress on New Tasks

研究发现：自改进的视觉语言模型可能在新任务上出现性能回退

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-12 16:55

一项新的研究论文揭示，自改进的视觉语言模型（VLMs）可能在新任务上出现性能回退，这与更强的验证器总是能带来更强的学生模型的假设相反。研究发现，验证器的质量高度依赖于具体任务，在一个任务上提高性能的验证器实际上会在另一个任务上降低性能。这种回退是悄无声息发生的，即使在性能下降时训练损失也在减少，并且会被自信错误的偏好对所放大。 AI

影响强调了自改进VLMs技术中的一个关键缺陷，表明需要更鲁棒的验证和特定任务的评估方法。

排序理由该集群包含一篇研究论文，详细介绍了关于自改进VLMs行为的新发现。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Jianzhe Lin · 2026-06-15 04:00

When Good Verifiers Go Bad: Self-Improving VLMs Can Regress on New Tasks

arXiv:2606.14629v1 Announce Type: cross Abstract: Verifier-driven self-DPO is a common recipe for self-improving production visual-language models. In this setup, a frozen verifier scores candidate generations, the top- and bottom-scoring candidates form a preference example, and…
arXiv cs.AI TIER_1 English(EN) · Jianzhe Lin · 2026-06-12 16:55

当优秀的验证器变坏：自改进的VLMs在新任务上可能出现退化

Verifier-driven self-DPO is a common recipe for self-improving production visual-language models. In this setup, a frozen verifier scores candidate generations, the top- and bottom-scoring candidates form a preference example, and DPO updates the learner. The deployment-time assu…

报道来源 [2]

When Good Verifiers Go Bad: Self-Improving VLMs Can Regress on New Tasks

当优秀的验证器变坏：自改进的VLMs在新任务上可能出现退化

相关话题