English(EN) Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting

图像修复研究揭示奖励模型偏差

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-03 04:00

研究人员利用直接偏好优化框架和公开可用的奖励模型，重新审视了图像修复的首选项对齐。他们的研究表明，虽然大多数奖励模型提供了有效的信号，但一些模型在亮度、构图和颜色方面存在偏差，导致奖励破解。这些奖励模型的集成有效地缓解了这些偏差，在标准指标和人类评估方面均取得了改进的性能，甚至显示出对物体移除任务的可迁移性。 AI

影响识别当前图像生成任务奖励模型的偏差，并提出集成方法以获得更鲁棒和可泛化的结果。

排序理由这是一篇详细介绍图像修复技术和奖励模型研究的论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

Direct Preference Optimization

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Junkun Yuan, Yutao Shen, Toru Aonishi, Hideki Nakayama, Yue Ma · 2026-06-03 04:00

Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting

arXiv:2606.03216v1 Announce Type: new Abstract: We study preference alignment for image inpainting. Rather than proposing yet another method, we revisit the problem from first principles and reassess its core challenges. We adopt the widely used direct preference optimization fra…

报道来源 [1]

Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting

相关实体

相关话题