PulseAugur
实时 13:13:11
English(EN) Why Are DMD Students Lazy? Understanding the Copying Behavior in Few-Step Distillation

扩散模型蒸馏在高维空间中表现出“复制”行为

研究人员在扩散模型的高维蒸馏中发现了一种称为“复制”的现象。当蒸馏出的学生模型复制教师模型的原始噪声-数据配对时,就会发生这种情况,而在低维设置中并未观察到这种行为。研究表明,这种复制是由于学生模型在蒸馏过程中几何自由度有限而产生的涌现特性,而不是对抗性目标或教师记忆所致。 AI

影响 识别出扩散模型蒸馏中的一种新行为,可能影响压缩模型的效率和泛化能力。

排序理由 该集群包含一篇详细介绍模型蒸馏新发现的学术论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.LG TIER_1 English(EN) · Shucheng Li, Iolo Jones, Alexander Tong, Michael M. Bronstein ·

    Why Are DMD Students Lazy? Understanding the Copying Behavior in Few-Step Distillation

    arXiv:2606.02237v1 Announce Type: new Abstract: Distribution Matching Distillation (DMD) compresses pretrained diffusion models into efficient few-step generators by aligning their noised distributions across all scales. In principle, such distribution-level supervision remains a…

  2. arXiv cs.LG TIER_1 English(EN) · Michael M. Bronstein ·

    Why Are DMD Students Lazy? Understanding the Copying Behavior in Few-Step Distillation

    Distribution Matching Distillation (DMD) compresses pretrained diffusion models into efficient few-step generators by aligning their noised distributions across all scales. In principle, such distribution-level supervision remains agnostic to specific noise-data pairings of the t…