New metric preserves diversity in AI image generation

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have identified a critical flaw in Reinforcement Learning from Human Feedback (RLHF) when applied to flow-matching text-to-image models, where standard policy entropy fails to prevent a collapse in perceptual diversity. They propose a new metric, perceptual entropy, to accurately capture diversity in the perceptual space, addressing the limitations of policy entropy which remains constant despite diversity loss. Experiments demonstrate that strategies based on perceptual entropy significantly improve the quality-diversity trade-off in image generation models. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Introduces a novel metric to address diversity collapse in AI image generation, potentially improving the quality and variety of outputs.

RANK_REASON The cluster contains an academic paper introducing a new metric and methodology for AI model training. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

COVERAGE [1]

arXiv cs.CV TIER_1 · Feng Zheng · 2026-05-12 13:29

When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy

RLHF is widely used to align flow-matching text-to-image models with human preferences, but often leads to severe diversity collapse after fine-tuning. In RL, diversity is often assumed to correlate with policy entropy, motivating entropy regularization. However, we show this int…

COVERAGE [1]

When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy

RELATED ENTITIES

RELATED TOPICS