English(EN) Drifting Preference Optimization for One-Step Generative Models

新的漂移偏好优化微调单步图像生成器

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-01 17:31

研究人员开发了漂移偏好优化（DrPO）方法，这是一种用于微调单步文本到图像生成模型的新方法。该技术能够对确定性单步生成器进行高效的偏好调整，而确定性单步生成器因其速度而备受青睐。DrPO 从高分和低分图像样本中合成更新方向，从而能够在不要求可微性的情况下使用各种奖励函数进行训练。 AI

影响能够实现对单步图像生成模型更快、更灵活的微调。

排序理由该集群包含一篇详细介绍生成模型新方法的论文。

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.LG TIER_1 English(EN) · Zhou Jiang, Yandong Wen, Zhen Liu · 2026-06-02 04:00

面向一步生成模型的漂移偏好优化

arXiv:2606.02521v1 Announce Type: new Abstract: One-step text-to-image generators are attractive for deployment because they generate an image with a single forward pass, but preference finetuning them remains difficult: standard alignment methods often rely on policy likelihoods…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-01 17:31

面向一步生成模型的漂移偏好优化

One-step text-to-image generators are attractive for deployment because they generate an image with a single forward pass, but preference finetuning them remains difficult: standard alignment methods often rely on policy likelihoods, denoising trajectories, differentiable reward …
arXiv cs.LG TIER_1 English(EN) · Zhen Liu · 2026-06-01 17:31

面向一步生成模型的漂移偏好优化

One-step text-to-image generators are attractive for deployment because they generate an image with a single forward pass, but preference finetuning them remains difficult: standard alignment methods often rely on policy likelihoods, denoising trajectories, differentiable reward …