新研究通过随机最大值原理构建伴随匹配

作者 PulseAugur 编辑部 · [1 个来源] · 2026-07-03 04:00

一篇题为“Adjoint Matching through the Lens of the Stochastic Maximum Principle in Optimal Control”的新研究论文，由Jiequn Han撰写，严格推导并推广了随机最优控制问题的伴随匹配方法。该工作构建了一个哈密顿伴随匹配目标，并证明了其与Hamilton-Jacobi-Bellman平稳性条件的关系。对于扩散项与状态和控制无关的情况，该论文恢复了先前引入的精简伴随匹配损失，同时强调了在扩散项与状态相关的需要额外项。该研究为传统随机最大值原理算法提供了一种实用、可实现的替代方案，特别是在鞅项构成挑战的随机环境中。 AI

影响为优化生成模型和采样技术提供了一个新框架。

排序理由关于一种新颖控制方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Carles Domingo-Enrich, Jiequn Han · 2026-07-03 04:00

Adjoint Matching through the Lens of the Stochastic Maximum Principle in Optimal Control

arXiv:2604.08580v2 Announce Type: replace-cross Abstract: Reward fine-tuning of diffusion and flow models and sampling from tilted or Boltzmann distributions can both be formulated as stochastic optimal control (SOC) problems, where learning an optimal generative dynamics corresp…

报道来源 [1]

Adjoint Matching through the Lens of the Stochastic Maximum Principle in Optimal Control

相关实体

相关话题