English(EN) Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

新的Scone模型提高了主题在图像生成中的区分度

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-10 04:00

研究人员推出了一种新颖的主题驱动图像生成方法Scone，解决了区分多个主体时的局限性。Scone集成了组合和区分能力，利用一个理解专家来指导一个生成专家，以保持主体身份。该方法采用两阶段训练过程，并引入了一个新的基准SconeEval，用于评估组合和区分能力。 AI

影响增强了AI模型准确生成具有多个不同主体的图像的能力，提高了真实感和可控性。

排序理由这是一篇描述图像生成新模型和基准的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Yuran Wang, Bohan Zeng, Chengzhuo Tong, Wenxuan Liu, Yang Shi, Xiaochen Ma, Hao Liang, Yuanxing Zhang, Wentao Zhang · 2026-06-10 04:00

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

arXiv:2512.12675v3 Announce Type: replace-cross Abstract: Subject-driven image generation has advanced from single- to multi-subject composition, while neglecting distinction, the ability to distinguish and generate the correct subject when inputs contain multiple candidates. Thi…