English(EN) Exploring the Design Space of Reward Backpropagation for Flow Matching

新的FlowBP框架增强了文本到图像模型的对齐能力

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-09 16:36

研究人员推出了一种名为FlowBP的新框架，旨在提高文本到图像模型与人类偏好的对齐度。该方法通过创建替代的反向轨迹，解决了直接奖励反向传播的内存限制和梯度膨胀等局限性。FlowBP提供了三种变体，可以限制内存使用并减少梯度链式反应，在SD3.5-M和FLUX等模型上展示了各项指标的改进。 AI

影响引入了一个新颖的框架，以提高生成模型与人类偏好对齐的效率和有效性。

排序理由该集群包含一篇研究论文，详细介绍了一种改进文本到图像模型的新方法。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Ruoyu Wang, Boye Niu, Xiangxin Zhou, Yushi Huang, Tongliang Liu, Chi Zhang · 2026-06-10 04:00

Exploring the Design Space of Reward Backpropagation for Flow Matching

arXiv:2606.11075v1 Announce Type: new Abstract: Aligning text-to-image flow matching models with human preferences via direct reward backpropagation is sample-efficient but hampered by two well-known pathologies: activations cannot be stored across the full sampling trajectory at…
arXiv cs.LG TIER_1 English(EN) · Chi Zhang · 2026-06-09 16:36

Exploring the Design Space of Reward Backpropagation for Flow Matching

Aligning text-to-image flow matching models with human preferences via direct reward backpropagation is sample-efficient but hampered by two well-known pathologies: activations cannot be stored across the full sampling trajectory at modern model scale, and chained Jacobian produc…