New metric preserves diversity in AI image generation

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-12 13:29

Researchers have identified a critical flaw in Reinforcement Learning from Human Feedback (RLHF) when applied to flow-matching text-to-image models, where standard policy entropy fails to prevent a collapse in perceptual diversity. They propose a new metric, perceptual entropy, to accurately capture diversity in the perceptual space, addressing the limitations of policy entropy which remains constant despite diversity loss. Experiments demonstrate that strategies based on perceptual entropy significantly improve the quality-diversity trade-off in image generation models. AI

影响 Introduces a novel metric to address diversity collapse in AI image generation, potentially improving the quality and variety of outputs.

排序理由 The cluster contains an academic paper introducing a new metric and methodology for AI model training. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Feng Zheng · 2026-05-12 13:29

When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy

RLHF is widely used to align flow-matching text-to-image models with human preferences, but often leads to severe diversity collapse after fine-tuning. In RL, diversity is often assumed to correlate with policy entropy, motivating entropy regularization. However, we show this int…

报道来源 [1]

When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy

相关实体

相关话题