English(EN) What Drives Compositional Generalization? The Importance of Continuous Training Objectives in Visual Generative Models

视觉生成模型通过连续训练提高组合泛化能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-04-28 04:00

研究人员调查了影响视觉生成模型组合泛化能力的因素，重点关注已知概念的新组合是如何生成的。他们的研究强调了训练目标是使用离散分布还是连续分布，以及训练期间条件提供的信息量的重要性。研究结果表明，在现有的离散模型中，结合使用基于JEPA的连续目标和离散损失（例如在MaskGIT中）可以提高组合性能。 AI

影响确定了提高视觉生成模型中新概念组合能力的关键训练目标特征。

排序理由学术论文，详细介绍了影响视觉生成模型组合泛化能力的因素的系统研究。

在 arXiv cs.CV 阅读 →

MaskGIT

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Karim Farid, Rajat Sahay, Yumna Ali Alnaggar, Simon Schrodi, Volker Fischer, Cordelia Schmid, Thomas Brox · 2026-04-28 04:00

是什么驱动了组合泛化能力？连续训练目标在视觉生成模型中的重要性

arXiv:2510.03075v3 Announce Type: replace Abstract: Compositional generalization, the ability to generate novel combinations of known concepts, is a key ingredient for visual generative models. Yet, not all mechanisms that enable or inhibit it are fully understood. In this work, …

报道来源 [1]

是什么驱动了组合泛化能力？连续训练目标在视觉生成模型中的重要性

相关话题