English(EN) Pool-Select-Refine: Allocation-Aware Generative Dataset Distillation with Soft-Label-Guided Latent Refinement

新框架通过两阶段精炼增强生成数据集蒸馏

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 04:00

研究人员推出了一种名为Pool-Select-Refine的新框架，用于生成数据集蒸馏，这是一种利用扩散模型将大型数据集压缩成更小的合成数据集的技术。该方法首先创建一个过完备的候选样本池，然后在指定预算内选择一个子集，从而改进了现有方法。使用软标签监督在潜在空间中进一步精炼选定的样本，以增强语义对齐并保持生成质量。 AI

影响这个新框架可能带来更高效、更有效的数据集蒸馏，从而可能通过更小、经过精心策划的合成数据集来改进AI模型的训练。

排序理由该集群包含一篇详细介绍数据集蒸馏新框架的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Wenmin Li, Shunsuke Sakai, Zhongkai Zhao, Tatsuhito Hasegawa · 2026-06-02 04:00

Pool-Select-Refine: Allocation-Aware Generative Dataset Distillation with Soft-Label-Guided Latent Refinement

arXiv:2606.01920v1 Announce Type: new Abstract: Diffusion-based dataset distillation has recently emerged as a promising paradigm for condensing large-scale datasets into compact synthetic sets. By leveraging pretrained generative priors, these methods can produce realistic class…

报道来源 [1]

Pool-Select-Refine: Allocation-Aware Generative Dataset Distillation with Soft-Label-Guided Latent Refinement

相关实体

相关话题