PulseAugur
实时 12:36:36

Apple 研究人员为扩散模型泛化识别局部分数

Apple 的研究论文探讨了条件扩散模型中组合泛化机制,特别关注这些模型如何处理生成比训练时更多的对象的图像。研究确定“局部条件分数”是实现此能力的关键因素,表明成功进行长度泛化的模型表现出这些分数,而失败的模型则没有。研究还提出了一种强制执行这些局部分数的方法,该方法成功地使先前表现不佳的模型实现了长度泛化。 AI

影响 对扩散模型泛化的研究可能导致更强大、更可控的图像生成系统。

排序理由 该集群包含详细介绍扩散模型及其泛化能力研究的学术论文。

在 Hugging Face Blog 阅读 →

AI 生成摘要 · Google Gemini · 来自 436 个来源。 我们如何撰写摘要 →

Apple 研究人员为扩散模型泛化识别局部分数

报道来源 [436]

  1. Apple Machine Learning Research TIER_1 English(EN) ·

    条件扩散中的组合泛化的局部机制

    Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization,…

  2. Hugging Face Blog TIER_1 English(EN) ·

    使用 TRL 通过 DDPO 微调 Stable Diffusion 模型

  3. Hugging Face Blog TIER_1 English(EN) ·

    在 Intel CPU 上微调 Stable Diffusion 模型

  4. Hugging Face Blog TIER_1 English(EN) ·

    使用 NNCF 和 🤗 Optimum 优化 Intel CPU 上的 Stable Diffusion

  5. Hugging Face Blog TIER_1 English(EN) ·

    使用 InstructPix2Pix 对 Stable Diffusion 进行指令微调

  6. Hugging Face Blog TIER_1 English(EN) ·

    在 Intel CPU 上加速 Stable Diffusion 推理

  7. Hugging Face Blog TIER_1 English(EN) ·

    使用 LoRA 对 Stable Diffusion 进行高效微调

  8. Hugging Face Blog TIER_1 English(EN) ·

    日本Stable Diffusion

  9. Hugging Face Blog TIER_1 English(EN) ·

    The Annotated Diffusion Model

  10. arXiv cs.AI TIER_1 English(EN) · Jianwei Fei, Yunshu Dai, Zhihua Xia, Xiaochun Cao, Jiantao Zhou, Alessandro Piva, Benedetta Tondi ·

    图像扩散模型的有效、鲁棒且抗共谋指纹识别

    arXiv:2606.12977v1 Announce Type: cross Abstract: Model fingerprinting, embedding user-specific identifiers (fingerprints) into generated outputs, has recently emerged as a popular solution to protect the intellectual property rights (IPR) of generative text-to-image (T2I) models…

  11. arXiv cs.AI TIER_1 English(EN) · Rapha\"el Razafindralambo, R\'emy Sun, Fr\'ed\'eric Precioso, Jes Frellsen, Pierre-Alexandre Mattei ·

    利用 Jeffrey Guidance 实现对扩散模型的更通用控制

    arXiv:2606.13240v1 Announce Type: cross Abstract: A key strength of diffusion models lies in their flexibility, since their outputs can be controlled at sampling time through guidance. However, beyond simple cases such as conditional sampling, the target distribution is often lef…

  12. arXiv cs.LG TIER_1 English(EN) · Metod Jazbec ·

    分子扩散模型的不确定性估计

    Diffusion models have seen wide adoption for 3D molecular generation, yet they offer no principled signal of when a generated molecule is likely to be of low quality. We propose a post-hoc method for estimating per-sample uncertainty in pretrained molecular diffusion models. Buil…

  13. Hugging Face Daily Papers TIER_1 English(EN) ·

    分子扩散模型的不确定性估计

    Diffusion models have seen wide adoption for 3D molecular generation, yet they offer no principled signal of when a generated molecule is likely to be of low quality. We propose a post-hoc method for estimating per-sample uncertainty in pretrained molecular diffusion models. Buil…

  14. arXiv cs.LG TIER_1 English(EN) · Muneendra Ojha ·

    通过改进的逆扩散采样增强分类器引导扩散模型中的低密度区域探索

    Diffusion models have emerged as state-of-the-art generative models for high-fidelity image synthesis, particularly in their classifier-free guided and classifier-guided forms. However, standard classifier guidance concentrates probability mass around high-density class mean, lea…

  15. Hugging Face Daily Papers TIER_1 English(EN) ·

    DuET: Diffusion图像编辑的双专家轨迹

    Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the result appears, especially when the target scene d…

  16. arXiv cs.CL TIER_1 English(EN) · Lexington Whalen, Yuki Ito, Ryo Sakamoto ·

    教导Diffusion模型进行从左到右的推测

    arXiv:2606.11552v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decodi…

  17. arXiv cs.LG TIER_1 English(EN) · Zhongxin Yang, Yuanwei Bin, Xiang I. A. Yang, Shiyi Chen ·

    基于最小作用量引导的物理外推扩散模型

    arXiv:2606.11277v1 Announce Type: new Abstract: Reliable extrapolation remains a central challenge for generative models in computational physics, because models trained over finite ranges of time, parameters, or geometries may produce physically inconsistent predictions outside …

  18. arXiv cs.LG TIER_1 English(EN) · Yi Hu, Leying Yi, Emily Davis, Finn Carter ·

    在扩散模型编辑的可控性-保真度前沿

    arXiv:2606.09901v1 Announce Type: cross Abstract: Diffusion-based generative models enable powerful image editing capabilities, but achieving precise control while maintaining fidelity and safety remains challenging. We present a comprehensive theoretical and empirical study of c…

  19. arXiv cs.LG TIER_1 English(EN) · Huijie Zhang, Jinfan Zhou, Yifu Lu, Minzhe Guo, Peng Wang, Liyue Shen, Qing Qu ·

    扩散模型中可复现性和泛化性的出现

    arXiv:2310.05264v5 Announce Type: replace Abstract: In this work, we investigate an intriguing and prevalent phenomenon of diffusion models which we term as "consistent model reproducibility": given the same starting noise input and a deterministic sampler, different diffusion mo…

  20. arXiv cs.AI TIER_1 English(EN) · Matina Mahdizadeh Sani, Nima Jamali, Mohammad Jalali, Farzan Farnia ·

    MMD 指导:通过最大均值差异指导实现无训练的扩散模型分布适应

    arXiv:2601.08379v2 Announce Type: replace-cross Abstract: Pre-trained diffusion models have emerged as powerful generative priors for both unconditional and conditional sample generation, yet their outputs often deviate from the characteristics of user-specific target data. Such …

  21. arXiv cs.LG TIER_1 English(EN) · Peng Wang, Huijie Zhang, Zekai Zhang, Siyi Chen, Yi Ma, Qing Qu ·

    打破维度灾难:扩散模型高效学习低维分布

    arXiv:2409.02426v5 Announce Type: replace Abstract: Despite their empirical success across a wide range of generative tasks, the fundamental principles underlying the ability of diffusion models to learn data distributions are poorly understood. In this work, we develop a new mat…

  22. Hugging Face Daily Papers TIER_1 English(EN) ·

    教导Diffusion模型进行从左到右的推测

    Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decoding addresses this bottleneck by employing a ligh…

  23. arXiv cs.CL TIER_1 English(EN) · Ryo Sakamoto ·

    教导Diffusion模型进行从左到右的推测

    Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decoding addresses this bottleneck by employing a ligh…

  24. arXiv cs.LG TIER_1 English(EN) · Shigui Li, Delu Zeng ·

    通过 Stein 稳定化缓解扩散 ODE 中的收缩陷阱

    arXiv:2606.07835v1 Announce Type: new Abstract: A fundamental tension exists in the large-step inference of diffusion models via their deterministic probability flow ordinary differential equation (PF-ODE) trajectories, which we identify as the contractivity trap: efficient infer…

  25. arXiv cs.AI TIER_1 English(EN) · Danqi Zhuang, Jisui Huang, Xiaoyue Xi, Andrew Kiggins, Xiaojie Wang, Ke Chen, Yue Wu ·

    PTL-Diffusion:具有周期性终端定律的流形感知扩散

    arXiv:2606.09816v1 Announce Type: cross Abstract: Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explic…

  26. arXiv cs.LG TIER_1 English(EN) · Xiao Li, Yixuan Jia, Zekai Zhang, Xiang Li, Lianghe Shi, Jinxin Zhou, Zhihui Zhu, Liyue Shen, Qing Qu ·

    通过自监督原理评估扩散模型的表示空间

    arXiv:2606.09718v1 Announce Type: new Abstract: Diffusion models have demonstrated remarkable generative capabilities and have also emerged as powerful self-supervised representation learners, yet the connection between these two abilities remains less explored. Drawing inspirati…

  27. arXiv cs.LG TIER_1 English(EN) · Meher Chaitanya, Sebastian Dalleiger, Luana Ruiz ·

    阈值化局部超流扩散

    arXiv:2606.09340v1 Announce Type: new Abstract: Local Hyper-Flow Diffusion (HFD) gives an edge-size-independent Cheeger-type guarantee for seeded clustering in general submodular hypergraphs, but existing HFD solvers do not keep intermediate computation local at every iteration. …

  28. arXiv cs.LG TIER_1 English(EN) · Emma Finn, Binxu Wang, T. Anderson Keller, Demba E. Ba ·

    分数何处寻:小波视角下的扩散模型

    arXiv:2606.08309v1 Announce Type: new Abstract: Score-based generative models have had remarkable success over the last decade in generating a diverse set of visually plausible images. A variety of architectures including CNNs, U-Nets, and Transformers have been used as the score…

  29. arXiv cs.AI TIER_1 English(EN) · Runze Li, Hanchen Wang, Wenjie Zhang, Binghao Li, Yu Zhang, Xuemin Lin, Ying Zhang ·

    FADTI:傅里叶与注意力驱动扩散用于多元时间序列插补

    arXiv:2512.15116v2 Announce Type: replace-cross Abstract: Multivariate time series imputation is fundamental in applications such as healthcare, traffic forecasting, and biological modeling, where sensor failures and irregular sampling lead to pervasive missing values. However, e…

  30. arXiv cs.AI TIER_1 English(EN) · Yue Wu ·

    PTL-Diffusion:具有周期性终端定律的流形感知扩散

    Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explicit structure for data concentrated near low-dimens…

  31. Hugging Face Daily Papers TIER_1 English(EN) ·

    PTL-Diffusion:具有周期性终端定律的流形感知扩散

    Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explicit structure for data concentrated near low-dimens…

  32. arXiv cs.LG TIER_1 English(EN) · Qing Qu ·

    通过自监督原理评估扩散模型的表示空间

    Diffusion models have demonstrated remarkable generative capabilities and have also emerged as powerful self-supervised representation learners, yet the connection between these two abilities remains less explored. Drawing inspiration from self-supervised learning (SSL), we intro…

  33. arXiv cs.LG TIER_1 English(EN) · Luana Ruiz ·

    阈值化局部超流扩散

    Local Hyper-Flow Diffusion (HFD) gives an edge-size-independent Cheeger-type guarantee for seeded clustering in general submodular hypergraphs, but existing HFD solvers do not keep intermediate computation local at every iteration. We introduce Thresholded Local HFD (TL-HFD), a f…

  34. arXiv cs.LG TIER_1 English(EN) · Zekai Zhang, Xiao Li, Xiang Li, Lianghe Shi, Meng Wu, Molei Tao, Qing Qu ·

    扩散模型在平衡表示空间中实现泛化

    arXiv:2512.20963v3 Announce Type: replace Abstract: Diffusion models excel at generating high-quality, diverse samples, yet they risk memorizing training data when overfit to the training objective. We analyze the distinctions between memorization and generalization in diffusion …

  35. arXiv cs.AI TIER_1 English(EN) · Renzo G. Soatto, Anders Hoel, Greycen Ren, Shorna Alam, Stephen Bates, Nikolaos P. Daskalakis, Caroline Uhler, Maria Skoularidou ·

    CountsDiff: 一种基于自然数的扩散模型,用于计数数据的生成和插补

    arXiv:2604.03779v2 Announce Type: replace-cross Abstract: Diffusion models have excelled at generative tasks for both continuous and token-based domains, but their application to discrete ordinal data remains underdeveloped. We present CountsDiff, a diffusion framework designed t…

  36. arXiv cs.LG TIER_1 English(EN) · Lee Hyoseok, Sohwi Lim, Eunju Cha, Tae-Hyun Oh ·

    用于稳定潜在扩散逆问题求解器的测量一致性 Langevin 修正器

    arXiv:2601.04791v4 Announce Type: replace-cross Abstract: While latent diffusion models (LDMs) have emerged as powerful priors for inverse problems, existing LDM-based solvers frequently suffer from instability. In this work, we first identify the instability as a discrepancy bet…

  37. Hugging Face Daily Papers TIER_1 English(EN) ·

    MaskAlign:用于高效扩散训练的Token子集表示对齐

    Token-subset representation alignment method called MaskAlign improves diffusion transformer training by reducing reliance on complete token sets and maintaining stable alignment behavior under perturbations.

  38. arXiv cs.LG TIER_1 English(EN) · Demba E. Ba ·

    分数何处寻:小波视角下的扩散模型

    Score-based generative models have had remarkable success over the last decade in generating a diverse set of visually plausible images. A variety of architectures including CNNs, U-Nets, and Transformers have been used as the score-approximation network in such diffusion modelin…

  39. arXiv cs.LG TIER_1 English(EN) · Aditya Shankar, Yuandou Wang, Rihan Hai, Lydia Y. Chen ·

    Harpoon:用于条件表格扩散的通用流形引导

    arXiv:2602.07875v3 Announce Type: replace Abstract: Generating tabular data under conditions is critical to applications requiring precise control over the generative process. Existing methods rely on training-time strategies that do not generalise to unseen constraints during in…

  40. arXiv cs.LG TIER_1 English(EN) · Parsa Esmati, Somjit Nath, Katja Hofmann, Derek Nowrouzezahrai, Samira Ebrahimi Kahou, Majid Mirmehdi ·

    物理学的看不见的手:当视频扩散模型比它们展示的知道得更多

    arXiv:2606.05328v1 Announce Type: cross Abstract: Modern video diffusion models generate increasingly realistic and temporally coherent videos, motivating their use as candidate world simulators. Yet it remains unclear whether these models internally encode physical structure, or…

  41. arXiv cs.LG TIER_1 English(EN) · Hongkun Dou, Zike Chen, Fengji Li, Hongjue Li, Yue Deng ·

    通过梯度信息logit校正实现离散扩散模型的即插即用引导

    arXiv:2606.06303v1 Announce Type: new Abstract: Controllable generation with discrete diffusion models is often hindered by high computational overhead or the need for retraining. In this paper, we present \underline{\textbf{G}}radient-\underline{\textbf{I}}nformed \underline{\te…

  42. arXiv cs.LG TIER_1 English(EN) · Haoyang Cao, Minshuo Chen, Yinbin Han, Renyuan Xu ·

    用于自适应序列数据生成的扩散模型

    arXiv:2606.06007v1 Announce Type: new Abstract: Generating realistic synthetic sequential data is critical in real-world applications across operations research, finance, healthcare, energy systems, and scientific computing, where time-indexed observations are used for prediction…

  43. arXiv cs.AI TIER_1 English(EN) · Yue Deng ·

    通过梯度信息logit校正实现离散扩散模型的即插即用引导

    Controllable generation with discrete diffusion models is often hindered by high computational overhead or the need for retraining. In this paper, we present \underline{\textbf{G}}radient-\underline{\textbf{I}}nformed \underline{\textbf{L}}ogit \underline{\textbf{C}}orrection (\t…

  44. Hugging Face Daily Papers TIER_1 English(EN) ·

    用于自适应序列数据生成的扩散模型

    Generating realistic synthetic sequential data is critical in real-world applications across operations research, finance, healthcare, energy systems, and scientific computing, where time-indexed observations are used for prediction, simulation, risk assessment, and data-driven d…

  45. arXiv cs.CL TIER_1 English(EN) · Xinrui Song, Zhuoran Wang, Mingju Gao, Hao Tang ·

    SemBlock:用于扩散式大语言模型的语义边界动态块

    arXiv:2606.04964v1 Announce Type: new Abstract: Diffusion language models (DLMs) generate text through iterative denoising, and blockwise decoding improves their practicality by committing tokens in local blocks. However, existing blockwise methods typically rely on fixed block s…

  46. arXiv cs.AI TIER_1 English(EN) · Omer Dahary, Benaya Koren, Daniel Garibi, Daniel Cohen-Or ·

    在扩散 Transformer 的上下文空间中进行即时排斥以实现丰富的多样性

    arXiv:2603.28762v2 Announce Type: replace-cross Abstract: Modern Text-to-Image (T2I) diffusion models have achieved remarkable semantic alignment, yet they often suffer from a significant lack of variety, converging on a narrow set of visual solutions for any given prompt. This t…

  47. arXiv cs.LG TIER_1 English(EN) · Ram\'on Nartallo-Kaluarachchi, Renaud Lambiotte, Alain Goriely ·

    漂移-扩散匹配:不对称神经网络潜在流形中的嵌入动力学

    arXiv:2602.14885v2 Announce Type: replace-cross Abstract: Recurrent neural networks (RNNs) provide a theoretical framework for understanding computation in biological neural circuits, yet classical results, such as Hopfield's model of associative memory, rely on symmetric connect…

  48. arXiv cs.LG TIER_1 English(EN) · Haojun Qiu, Kiriakos N. Kutulakos, David B. Lindell ·

    高效且无需训练的单图像扩散模型

    arXiv:2606.04299v1 Announce Type: cross Abstract: We consider the problem of generating images whose internal structure -- defined by the distribution of patches across multiple scales -- matches that of a single reference image. Recent approaches address this problem by training…

  49. arXiv cs.LG TIER_1 English(EN) · Lixing Zhang, Yidong Ouyang, Weifu Li, Shixiang Zhu, Guang Cheng, Liyan Xie ·

    学习不进行插补:一种面向有意义缺失的感知不确定性的扩散框架

    arXiv:2606.05073v1 Announce Type: new Abstract: Missing value imputation is a fundamental task in machine learning, with most existing methods assuming that all missing entries correspond to unobserved regular values. In many real-world datasets, however, missingness may arise fr…

  50. Hugging Face Daily Papers TIER_1 English(EN) ·

    Complexity-Balanced Diffusion Splitting

    Complexity-Balanced Splitting (CBS) allocates generative capacity across specialized sub-networks by partitioning the diffusion timeline based on local complexity measures, improving synthesis quality without increasing inference cost.

  51. arXiv cs.LG TIER_1 English(EN) · Liyan Xie ·

    学习不进行插补:一种面向有意义缺失的感知不确定性的扩散框架

    Missing value imputation is a fundamental task in machine learning, with most existing methods assuming that all missing entries correspond to unobserved regular values. In many real-world datasets, however, missingness may arise from two distinct sources: some entries are meanin…

  52. Hugging Face Daily Papers TIER_1 English(EN) ·

    学习不进行插补:一种面向有意义缺失的感知不确定性的扩散框架

    Missing value imputation is a fundamental task in machine learning, with most existing methods assuming that all missing entries correspond to unobserved regular values. In many real-world datasets, however, missingness may arise from two distinct sources: some entries are meanin…

  53. arXiv cs.CL TIER_1 English(EN) · Hao Tang ·

    SemBlock:用于扩散大模型的语义边界动态块

    Diffusion language models (DLMs) generate text through iterative denoising, and blockwise decoding improves their practicality by committing tokens in local blocks. However, existing blockwise methods typically rely on fixed block sizes or delimiter-based runtime signals, which d…

  54. arXiv cs.LG TIER_1 English(EN) · Zerui Tao, Qibin Zhao ·

    具有扩散模型先验的贝叶斯张量分解

    arXiv:2606.03212v1 Announce Type: new Abstract: Low-rank tensor decomposition (TD) is usually effective on clean, fully observed data, but it often degrades under severe missingness or noise. Low-rankness is itself a useful but limited structural prior, and additional handcrafted…

  55. arXiv cs.AI TIER_1 English(EN) · Siva Rajesh Kasa, Yasong Dai, Sumit Negi, Hongdong Li ·

    Fast-dLLM++:用于更快扩散 LLM 推理的 Fréchet Profile 解码

    arXiv:2606.02955v1 Announce Type: cross Abstract: Diffusion large language models promise parallel token generation, yet inference remains bottlenecked by deciding which masked tokens can be safely committed together. Fast-dLLM addressed this with KV caching and confidence-guided…

  56. arXiv cs.AI TIER_1 English(EN) · David Turtora Zagardo ·

    几何感知表格扩散

    arXiv:2606.02607v1 Announce Type: cross Abstract: Tabular synthesis is critical for privacy-preserving sharing and augmentation, yet diffusion models rely on implicit mechanisms to capture inter-column relationships. We introduce Geometry-Aware Tabular Diffusion (GATD), which aug…

  57. arXiv cs.LG TIER_1 English(EN) · Kexiang Mao ·

    Flicker-DDPM:通过 1/f 颜色噪声注入加速去噪扩散

    arXiv:2606.03393v1 Announce Type: new Abstract: We propose a novel diffusion model, Flicker-DDPM, which incorporates flicker (1/f) noise inspired by self-organized criticality (SOC), a widely observed phenomenon in natural systems. Unlike denoising diffusion probabilistic models …

  58. arXiv cs.AI TIER_1 English(EN) · Shaokun Lan, Haoran Dou, Jinghan Huang, Arezoo Zakeri, Fengming Lin, Zherui Zhou, Jinming Duan, Alejandro F. Frangi ·

    用于虚拟人群合成的带傅里叶运动建模的条件潜在扩散模型

    arXiv:2606.03827v1 Announce Type: cross Abstract: In-silico trials of medical devices require the generation of virtual populations of anatomies. In cardiovascular applications, virtual anatomy is typically represented as a 3D+t mesh sampled from a generative model. However, most…

  59. arXiv cs.AI TIER_1 English(EN) · Yifu Luo, Yongzhe Chang, Xueqian Wang ·

    Wavelet Fourier Diffuser:用于强化学习的频率感知扩散模型

    arXiv:2509.19305v2 Announce Type: replace-cross Abstract: Diffusion probability models have shown significant promise in offline reinforcement learning by directly modeling trajectory sequences. However, existing approaches primarily focus on time-domain features while overlookin…

  60. arXiv cs.LG TIER_1 English(EN) · Niccol\`o Perrone, Fanny Lehmann, Stefania Fresca, Filippo Gatti ·

    通过稀疏观测的扩散后验采样纠正神经算子谱偏差

    arXiv:2606.03936v1 Announce Type: new Abstract: Neural operator surrogates (NO) approximate PDE solutions orders of magnitude faster than numerical solvers, but suffer from spectral bias: high-frequency content is systematically attenuated, limiting reliability where fine-scale s…

  61. arXiv cs.AI TIER_1 English(EN) · Davide Gallon, Philippe von Wurstemberger, Patrick Cheridito, Arnulf Jentzen ·

    谱空间中的物理信息扩散模型

    arXiv:2602.09708v2 Announce Type: replace-cross Abstract: We propose physics-informed spectral diffusion (PISD), a methodology that combines generative latent diffusion models with physics-informed machine learning to generate solutions of partial differential equations (PDEs) co…

  62. Hugging Face Daily Papers TIER_1 English(EN) ·

    面向具有不可达边界的扩散贝叶斯推断的神经伽辽金归一化流

    One of the primary challenges in Bayesian inference on the parameters of a diffusion model from discrete observations is the unavailability of an analytical expression for the transition density function between consecutive observation times, which is needed to derive the likelih…

  63. Hugging Face Daily Papers TIER_1 English(EN) ·

    通过稀疏观测的扩散后验采样纠正神经算子谱偏差

    Neural operator surrogates (NO) approximate PDE solutions orders of magnitude faster than numerical solvers, but suffer from spectral bias: high-frequency content is systematically attenuated, limiting reliability where fine-scale structure matters. Sparse sensor measurements of …

  64. arXiv cs.LG TIER_1 English(EN) · Filippo Gatti ·

    通过稀疏观测的扩散后验采样纠正神经算子谱偏差

    Neural operator surrogates (NO) approximate PDE solutions orders of magnitude faster than numerical solvers, but suffer from spectral bias: high-frequency content is systematically attenuated, limiting reliability where fine-scale structure matters. Sparse sensor measurements of …

  65. arXiv cs.AI TIER_1 English(EN) · Alejandro F. Frangi ·

    用于虚拟人群合成的基于傅里叶运动建模的条件潜在扩散模型

    In-silico trials of medical devices require the generation of virtual populations of anatomies. In cardiovascular applications, virtual anatomy is typically represented as a 3D+t mesh sampled from a generative model. However, most existing mesh generators focus on static anatomy,…

  66. arXiv cs.AI TIER_1 English(EN) · Longxuan Yu, Yunshu Wu, Yu Fu, Siheng Xiong, Rob Brekelmans, Hui Liu, Yue Dong, Greg Ver Steeg ·

    DSL-LLaDA:将连续去噪扩展到 8B 掩码扩散语言模型

    arXiv:2606.01024v1 Announce Type: cross Abstract: Discrete Masked diffusion language models generate text by iterative parallel decoding, but few-step decoding suffers from a tradeoff between length and quality: with a fixed step budget, standard methods can generate a short, hig…

  67. arXiv cs.AI TIER_1 English(EN) · Naoki Murata, Yuhta Takida, Chieh-Hsin Lai, Toshimitsu Uesaka, Bac Nguyen, Stefano Ermon, Yuki Mitsufuji ·

    GUDA:通过遗忘实现扩散模型的反事实分组训练数据归因

    arXiv:2601.22651v2 Announce Type: replace-cross Abstract: Training-data attribution for vision generative models aims to identify which training data influenced a given output. While most methods score individual examples, practitioners often need group-level answers (e.g., artis…

  68. arXiv cs.AI TIER_1 English(EN) · Abdullah Al Shafi, Kazi Saeed Alam, Sk Imran Hossain, Engelbert Mephu Nguifo ·

    DASH:用于引导校准的紧凑型扩散模型的双分支分数蒸馏

    arXiv:2606.00798v1 Announce Type: cross Abstract: Parameter compression of class-conditional diffusion models reveals an underexplored limitation in output-level distillation: the unconditional score branch remains unsupervised, leaving the classifier-free guidance gap underdeter…

  69. arXiv cs.AI TIER_1 English(EN) · Zihan Li, Jialan Zheng, Ziyu Li, Xun Yuan, Kasidit Anmahapong, Ziang Wang, Mingxuan Liu, Hongjia Yang, Yifei Chen, Zhuhao Wang, Yuhang He, Fang Chen, Rui Li, Huaiqiang Sun, Yi Liao, Congyu Liao, Yang Yang, Haibo Qu, Xue Zhang, Hongen Liao, Qiyuan Tian ·

    面向定量弥散MRI的物理信息基础模型

    arXiv:2606.00156v1 Announce Type: cross Abstract: Understanding the human brain requires access to its microscopic tissue architecture. Diffusion magnetic resonance imaging (MRI) provides the only noninvasive window into whole-brain microstructure in vivo, yet reliable quantitati…

  70. arXiv cs.AI TIER_1 English(EN) · Duoduo Xue, Zhiyu Zhu, Junhui Hou ·

    具有显式数据流形几何建模的扩散图像生成

    arXiv:2606.00094v1 Announce Type: cross Abstract: Image generative models aim to sample data points from the underlying data manifold, a task that requires learning and decoding a dense, low-dimensional, and compact parameterization space. To achieve this, we propose the Data Man…

  71. arXiv cs.AI TIER_1 English(EN) · Renhao Zhang, Haotian Fu, Mingxi Jia, George Konidaris, Yilun Du, Bruno Castro da Silva ·

    从噪声到控制:参数化扩散策略

    arXiv:2606.00336v1 Announce Type: new Abstract: We propose Parameterized Diffusion Policy (PDP), a framework for learning diffusion policies conditioned on low-dimensional, continuous parameters embedded in a learned behavior manifold. By constructing this manifold so that distan…

  72. arXiv cs.LG TIER_1 English(EN) · Daniela Breitman, Andrei Mesinger, Steven G. Murray, Ivan Nikolic, Roberto Trotta ·

    21cmEMUv3:21cmFAST摘要可观测量的混合扩散-LSTM模拟器

    arXiv:2606.00219v1 Announce Type: cross Abstract: We are witnessing a surge in observations of the cosmic dawn (CD) and epoch of reionisation (EoR), driving an increasing demand for fast and robust theoretical interpretation frameworks. In response, machine learning (ML), and emu…

  73. arXiv cs.AI TIER_1 English(EN) · Zhiying Jiang, Raihan Seraj, Marcos Villagra, Bidhan Roy ·

    异构去中心化扩散模型

    arXiv:2603.06741v2 Announce Type: replace-cross Abstract: Training frontier-scale diffusion models often requires substantial computational resources concentrated in tightly-coupled clusters, limiting participation to well-resourced institutions. While Decentralized Diffusion Mod…

  74. arXiv cs.LG TIER_1 English(EN) · Jama Hussein Mohamud, Mohsin Hasan, Mirco Ravanelli, Yoshua Bengio ·

    面向掩码扩散的自适应订单策略

    arXiv:2606.00295v1 Announce Type: new Abstract: Masked diffusion models have seen great success in capturing data distributions over discrete sequences in domains such as text and proteins. These models generate data by iteratively unmasking tokens starting from a fully masked se…

  75. arXiv cs.LG TIER_1 English(EN) · Anjian Li, Bartolomeo Stellato, Ryne Beeson ·

    GLENS:通过学习求解器迭代和扩散模型进行全局搜索

    arXiv:2606.00366v1 Announce Type: new Abstract: We consider the problem of generating a large collection of initial guesses for local minima of multimodal non-convex continuous optimization problems. The goal is for these initial guesses to be high-quality (i.e., a numerical solv…

  76. arXiv cs.LG TIER_1 English(EN) · Simon De Reuver, Tamas Kristof Toth, Teddy Lazebnik ·

    使用扩散模型对符号回归进行数据丰富

    arXiv:2606.00988v1 Announce Type: new Abstract: Symbolic regression (SR) offers a route to scientific discovery by converting observations into interpretable governing equations. However, despite its promise, its reliability degrades sharply when spatiotemporal measurements are s…

  77. arXiv cs.LG TIER_1 English(EN) · Guanyu Zhou, Yao Liu, Yanglei Gan, Yuxiang Cai, Peng He, Run Lin, Yuxiang Liu, Qiao Liu ·

    GLIDE:图引导的飞跃推理用于时空点过程的扩散估计

    arXiv:2606.01273v1 Announce Type: new Abstract: Spatio-temporal point processes (STPPs) provide a principled framework for modeling asynchronous events in continuous time and space. Recent diffusion-based approaches offer a flexible alternative to deterministic prediction by mode…

  78. arXiv cs.LG TIER_1 English(EN) · Hamza Cherkaoui, H\'el\`ene Halconruy, Antonio Ocello ·

    重尾有助于扩散吗?关于初始化与训练之间的微妙权衡

    arXiv:2605.13175v2 Announce Type: replace Abstract: Recent works have proposed incorporating heavy-tailed (HT) noise into diffusion- and flow-based generative models, with the goals of better recovering the tails of target distributions and improving generative diversity. This mo…

  79. arXiv cs.LG TIER_1 English(EN) · Constant Bourdrez, Alexandre V\'erine, Olivier Capp\'e ·

    通过逆强化学习从扩散模型中学习采样

    arXiv:2602.08689v2 Announce Type: replace Abstract: Diffusion models generate samples through an iterative denoising process guided by a pretrained neural network. Once the denoiser is fixed, the sampling algorithm itself (noise schedules, guidance scales, stochasticity profiles)…

  80. arXiv cs.LG TIER_1 English(EN) · Pengfei Jin, Yiqi Tian, Kailong Fan, Bingjie Qi, Quanzheng Li ·

    面向逆问题的幻觉感知扩散采样与鲁棒先验更新

    arXiv:2606.02331v1 Announce Type: cross Abstract: Diffusion-based inverse problem solvers can produce realistic reconstructions, but realism alone does not ensure that the recovered details are supported by the measurement. We study this failure as measurement-conditioned halluci…

  81. arXiv cs.AI TIER_1 English(EN) · Sajad Ebrahimi, Bhaskar Mitra, Negar Arabzadeh, Ye Yuan, Haolun Wu, Fattane Zarrinkalam, Ebrahim Bagheri ·

    从噪声到有序:通过去噪扩散学习排序

    arXiv:2602.11453v2 Announce Type: replace-cross Abstract: In information retrieval (IR), learning-to-rank (LTR) methods have traditionally limited themselves to discriminative machine learning approaches that model the probability of the document being relevant to the query given…

  82. arXiv cs.AI TIER_1 English(EN) · Yeongmin Kim, Donghyeok Shin, Byeonghu Na, Minsang Park, Richard Lee Kim, Il-Chul Moon ·

    用于扩散模型测试时缩放的前瞻样本奖励指导

    arXiv:2602.03211v2 Announce Type: replace-cross Abstract: Diffusion models have demonstrated strong generative performance; however, generated samples often fail to fully align with human intent. This paper studies an efficient test-time scaling method for sampling from regions w…

  83. arXiv cs.AI TIER_1 English(EN) · Ziseok Lee, Minyeong Hwang, Wooyeol Lee, Sanghyun Jo, Jihyung Ko, Young Bin Park, Jae-Mun Choi, Eunho Yang, Kyungsu Kim ·

    生成路径崩溃:扩散模型引导的标准与修正

    arXiv:2512.10339v2 Announce Type: replace Abstract: Inference-time steering adapts pretrained diffusion and flow models to new tasks without retraining, often utilizing ratio-of-densities constructions that reweight time-indexed marginals with fixed exponents. We identify Margina…

  84. arXiv cs.LG TIER_1 English(EN) · Quanzheng Li ·

    通过鲁棒先验更新实现面向逆问题的幻觉感知扩散采样

    Diffusion-based inverse problem solvers can produce realistic reconstructions, but realism alone does not ensure that the recovered details are supported by the measurement. We study this failure as measurement-conditioned hallucination: visually meaningful content that is either…

  85. Hugging Face Daily Papers TIER_1 English(EN) ·

    基于扩散模型的漂移估计器的误差界限

    Parameter estimation in stochastic differential equations is a classical statistical problem of much importance in many scientific fields. Recent work of Tapia Costa et al. (2026) introduced a novel technique for estimating the drift when the diffusion parameter is known, using d…

  86. arXiv cs.LG TIER_1 English(EN) · Alireza Kheirandish, Jihoon Hong, Sara Fridovich-Keil ·

    KLIP:基于KL散度和扩散先验的逆问题中的局部分布偏移检测

    arXiv:2605.31596v1 Announce Type: cross Abstract: Diffusion models have shown promising performance as data-driven priors for computational imaging, as well as some capacity to detect out-of-distribution (OOD) images. However, existing approaches to OOD detection often require so…

  87. arXiv cs.AI TIER_1 English(EN) · Enrico Cassano, Riccardo Renzulli, Marco Nurisso, Mirko Zaffaroni, Alan Perotti, Marco Grangetto ·

    SAEmnesia:在监督稀疏自编码器中擦除扩散模型中的概念

    arXiv:2509.21379v3 Announce Type: replace-cross Abstract: Concept unlearning in diffusion models is hampered by feature splitting, where concepts are distributed across many latent features, making their removal challenging and computationally expensive. We introduce SAEmnesia, a…

  88. arXiv cs.CL TIER_1 English(EN) · Zekai Li, Ji Liu, Yiqing Huang, Ziqiong Liu, Dong Li, Emad Barsoum ·

    通过时空并行解码和置信外推实现高效扩散大模型

    arXiv:2605.30753v1 Announce Type: new Abstract: Diffusion-based large language models (dLLMs) support parallel text generation via iterative denoising, yet inference remains latency-heavy because many steps are spent on redundant refinement and repeated remasking of tokens whose …

  89. arXiv cs.AI TIER_1 English(EN) · Jinwoo Kim, S\'ekou-Oumar Kaba, Jiyun Park, Seunghoon Hong, Siamak Ravanbakhsh ·

    通过扩散采样反转数据变换

    arXiv:2602.08267v2 Announce Type: replace-cross Abstract: We study the problem of transformation inversion on general Lie groups: a datum is transformed by an unknown group element, and the goal is to recover an inverse transformation that maps it back to the original data distri…

  90. arXiv cs.LG TIER_1 English(EN) · Victor M. Yeom-Song, Severi Rissanen, Arno Solin, Samuel Kaski, Mingfei Sun ·

    Softly Constrained Denoisers for Diffusion Models Applied to Partial Differential Equations

    arXiv:2512.14980v4 Announce Type: replace Abstract: Diffusion models have become a powerful generative prior for solutions of partial differential equations (PDEs). Existing approaches enforce physical constraints either by adding the PDE residuals as loss regularizers or through…

  91. Hugging Face Daily Papers TIER_1 English(EN) ·

    用于统一且数据高效的图像到图像翻译的解耦残差去噪扩散模型

    Decoupled Residual Denoising Diffusion models (DRDD) improve unified image-to-image translation by separating noise diffusion for domain harmonization from residual diffusion for semantic mapping, enhancing data efficiency and performance.

  92. arXiv cs.LG TIER_1 English(EN) · Sara Fridovich-Keil ·

    KLIP:基于KL散度和扩散先验的逆问题中的局部分布偏移检测

    Diffusion models have shown promising performance as data-driven priors for computational imaging, as well as some capacity to detect out-of-distribution (OOD) images. However, existing approaches to OOD detection often require some knowledge of the shifted distribution, fail to …

  93. arXiv cs.LG TIER_1 English(EN) · Leyi Qi, Yiming Li, Siyuan Liang, Zhengzhong Tu, Dacheng Tao ·

    Cert-LAS:面向文本到图像扩散模型的认证层自适应平滑方法,实现模型所有权认证

    arXiv:2605.29809v1 Announce Type: cross Abstract: Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingl…

  94. arXiv cs.LG TIER_1 English(EN) · Itamar Levine, Yair Weiss ·

    扩散模型、去噪器架构与创造力

    arXiv:2605.16415v2 Announce Type: replace-cross Abstract: The creativity of diffusion models refers to their ability to generate highly realistic images that are different from their training data. Creativity is somewhat surprising since it is known that if the denoiser used in t…

  95. arXiv cs.LG TIER_1 English(EN) · Benjamin A. Burns, Sara Fridovich-Keil ·

    何时、为何以及如何失效扩散后验采样器?有限样本视角

    arXiv:2605.30330v1 Announce Type: new Abstract: Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampling in imaging inverse problems. Existing methods can incorporate any mea…

  96. arXiv cs.LG TIER_1 English(EN) · Danylo Boiko, Viktoriia Mishkurova ·

    面向神经退行性疾病进展预测的治疗条件扩散模型

    arXiv:2605.29932v1 Announce Type: new Abstract: Forecasting the progression of neurodegenerative diseases, such as Parkinson's disease, is essential for effective long-term planning and personalized therapeutic intervention. Existing systems typically produce scalar clinical scor…

  97. arXiv cs.LG TIER_1 English(EN) · Gabriel Moreira, Manuel Marques, Jo\~ao Paulo Costeira, Chenyan Xiong ·

    Spectral Guidance for Flexible and Efficient Control of Diffusion Models

    arXiv:2605.28900v1 Announce Type: new Abstract: We introduce Spectral Guidance, a framework for controlling diffusion models by leveraging the intrinsic geometry of the generative process. As data is progressively corrupted by noise, only a small number of features remain informa…

  98. arXiv cs.AI TIER_1 English(EN) · Xincheng Wang, Hanchi Sun, Wenjun Sun, Kejun Xue, Wangqiu Zhou, Jianbo Zhang, Wei Sun, Dandan Zhu, Xiongkuo Min, Jun Jia, Zhijun Fang ·

    评估数据集水印技术在定制化扩散模型微调溯源中的应用:综合基准测试与去除方法

    arXiv:2511.19316v2 Announce Type: replace-cross Abstract: Recent fine-tuning techniques for diffusion models enable them to reproduce specific image sets, such as particular faces or artistic styles, but also introduce copyright and security risks. Dataset watermarking has been p…

  99. arXiv cs.AI TIER_1 English(EN) · Antoni Kowalczuk, Dominik Hintersdorf, Lukas Struppek, Kristian Kersting, Adam Dziedzic, Franziska Boenisch ·

    发现 DoRI:扩散模型中保留图像的发现

    arXiv:2507.16880v3 Announce Type: replace-cross Abstract: Text-to-image diffusion models (DMs) have achieved remarkable success in image generation. However, concerns about data privacy and intellectual property remain due to their potential to inadvertently memorize and replicat…

  100. arXiv cs.AI TIER_1 English(EN) · Shuaidi Wang, Zhan Zhuang, Ruping Huang, Yu Zhang ·

    NaRA:用于扩散式大语言模型参数高效微调的噪声感知LoRA

    arXiv:2605.29716v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising non-autoregressive generative paradigm. Given the prohibitive computational cost of full fine-tuning, Parameter-Efficient Fine-Tuning (PEFT) has become the standard…

  101. arXiv cs.AI TIER_1 English(EN) · Dueun Kim, Albert No ·

    置信度捷径:掩码扩散模型的推理失败模式

    arXiv:2605.29123v1 Announce Type: new Abstract: Masked diffusion language models (MDMs) uniquely support any-order generation, with confidence-based decoding currently serving as the de facto standard inference policy. To optimize for this, recent training schemes attempt to alig…

  102. arXiv cs.AI TIER_1 English(EN) · Yuhao Sun, Lingyun Yu, Haoxiang Xu, Fengyuan Miao, Zhuoer Xu, Hongtao Xie ·

    正交概念擦除用于扩散模型

    arXiv:2605.28902v1 Announce Type: new Abstract: Concept erasure has emerged as a promising approach to mitigate undesired or unsafe content in diffusion models, yet existing methods still face significant limitations. While training-based methods are effective, their high computa…

  103. arXiv cs.LG TIER_1 English(EN) · Sara Fridovich-Keil ·

    扩散后验采样器何时、为何以及如何失败?一个有限样本视角

    Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampling in imaging inverse problems. Existing methods can incorporate any measurement model at inference time but must use an…

  104. arXiv cs.AI TIER_1 English(EN) · Seunghyeok Shin, Minwoo Kim, Dabin Kim, Hongki Lim ·

    具有去噪器-拉回曲率引导和流形对齐阻尼的几何校正扩散后验采样

    arXiv:2605.27990v1 Announce Type: cross Abstract: Diffusion posterior sampling conditions diffusion priors on measurements, but data-consistency updates are typically scaled by hand-tuned guidance weights and can destabilize sampling under stiff, operator-dependent curvature. We …

  105. arXiv cs.LG TIER_1 English(EN) · Abduragim Shtanchaev, Albina Ilina, Yazid Janati, Arip Asadulaev, Martin Takac, Eric Moulines ·

    稀疏调度扩散引导用于逆问题

    arXiv:2603.07860v2 Announce Type: replace Abstract: Pretrained diffusion models are effective priors for Bayesian inverse problems, but posterior sampling with these priors is often costly because data-consistency guidance is applied throughout the full reverse trajectory. Existi…

  106. arXiv cs.AI TIER_1 English(EN) · Chieh-Hsin Lai, Yang Song, Dongjun Kim, Yuki Mitsufuji, Stefano Ermon ·

    扩散模型原理

    arXiv:2510.21890v2 Announce Type: replace-cross Abstract: This book presents the core principles that have guided the development of diffusion models, tracing their origins and showing how diverse formulations arise from shared mathematical ideas. Diffusion modeling starts by def…

  107. arXiv cs.AI TIER_1 English(EN) · Hyunmin Cho, Woo Kyoung Han, Kyong Hwan Jin ·

    通过对称注意力分解平衡扩散模型中的保真度和多样性:从霍普菲尔德视角

    arXiv:2605.27476v1 Announce Type: cross Abstract: We characterize the pre-softmax attention matrix $\mathbf{QK^\top}$ in transformers as an associative memory matrix encoding pairwise associations between input features. By decomposing this matrix into its symmetric and skew-symm…

  108. arXiv cs.AI TIER_1 English(EN) · Calvin Yeung, Prathyush Poduval, Ali Zakeri, Zhuowen Zou, Mohsen Imani ·

    用于解释扩散模型的残差时间稀疏自编码器

    arXiv:2605.27813v1 Announce Type: cross Abstract: Text-to-image diffusion models generate images through an iterative denoising process, so internal neural layers produce trajectories of activations rather than single static representations. Sparse autoencoders (SAEs) have recent…

  109. arXiv cs.LG TIER_1 English(EN) · Andrew Millard, Fredrik Lindsten, Zheng Zhao ·

    用于偏微分方程的粒子引导扩散模型

    arXiv:2601.23262v2 Announce Type: replace Abstract: We introduce a guided stochastic sampling method that augments sampling from diffusion models with physics-based guidance derived from partial differential equation (PDE) residuals and observational constraints, ensuring generat…

  110. arXiv cs.LG TIER_1 English(EN) · Nithesh Chandher Karthikeyan, Jonas Unger, Gabriel Eilertsen ·

    用于引导训练数据生成的表示条件扩散模型

    arXiv:2605.27495v1 Announce Type: cross Abstract: Data availability remains a critical bottleneck in many deep learning applications. Large-scale datasets are often expensive to collect, curate and annotate, which can limit the scalability and applicability of supervised learning…

  111. arXiv cs.LG TIER_1 English(EN) · Zhengyang Liang, Qihang Zhang, Ceyuan Yang ·

    显式批评指导用于对齐扩散模型

    arXiv:2605.27736v1 Announce Type: new Abstract: Online reinforcement learning is becoming increasingly important for aligning diffusion models with non-differentiable objectives. However, existing methods still face limitations in assigning fine-grained credit along denoising tra…

  112. arXiv cs.AI TIER_1 English(EN) · Peiliang Cai, Jiacheng Liu, Haowen Xu, Xinyu Wang, Chang Zou, Linfeng Zhang ·

    LESA:用于扩散模型加速的可学习阶段感知预测器

    arXiv:2602.20497v3 Announce Type: replace-cross Abstract: Diffusion models have achieved remarkable success in image and video generation tasks. However, the high computational demands of Diffusion Transformers (DiTs) pose a significant challenge to their practical deployment. Wh…

  113. arXiv cs.AI TIER_1 English(EN) · Gabriel Raya, Bac Nguyen, Georgios Batzolis, Yuhta Takida, Dejan Stancevic, Naoki Murata, Chieh-Hsin Lai, Yuki Mitsufuji, Luca Ambrogioni ·

    噪声调度作为扩散训练中的信息引导分配

    arXiv:2602.18647v2 Announce Type: replace-cross Abstract: We introduce InfoNoise, an online adaptive noise schedule for diffusion training that reallocates optimization effort toward noise levels where denoising is most informative. Together with loss weighting, a noise schedule …

  114. Hugging Face Daily Papers TIER_1 English(EN) ·

    彩色噪声扩散采样

    Diffusion models exhibit spectral bias in image synthesis, and a new sampling method called Colored Noise Sampling addresses this by dynamically allocating energy based on frequency-dependent schedules, leading to improved image quality metrics.

  115. arXiv cs.AI TIER_1 English(EN) · William Yuan, Sungwon Jeong, Amirali Aghazadeh ·

    离散扩散中随机性的纠错效应

    arXiv:2605.26582v1 Announce Type: cross Abstract: Discrete diffusion models achieve strong performance in text and image generation, but their inference remains slow and must inherently balance sampling efficiency and sample quality. In this work, we present a systematic study of…

  116. arXiv cs.AI TIER_1 English(EN) · Mahdi Farahbakhsh, Vishnu Teja Kunde, Dileep Kalathil, Krishna Narayanan, Jean-Francois Chamberland ·

    使用侧面信息的推理时间搜索用于基于扩散的图像重建

    arXiv:2510.03352v3 Announce Type: replace-cross Abstract: Diffusion models have been used as priors for solving inverse problems. However, existing approaches typically overlook side information that could significantly improve reconstruction quality, especially in severely ill-p…

  117. arXiv cs.LG TIER_1 English(EN) · Xing Cong, Hanlin Tang, Kan Liu, Lan Tao, Lin Qu, Chenhao Xie ·

    RT-Lynx: 将 GEMM 稀疏性以正确方式应用于扩散模型

    arXiv:2605.26632v1 Announce Type: new Abstract: Diffusion Transformers (DiT) achieve strong performance in image generation but incur substantial inference costs. While prior work has reduced this cost via quantization and distillation, semi-structured sparsity, which can nearly …

  118. arXiv cs.LG TIER_1 English(EN) · Loukas Sfountouris, Giannis Daras, Paris Giampouras ·

    对齐与反转:通过表示对齐使用扩散模型和流模型解决逆问题

    arXiv:2511.16870v3 Announce Type: replace-cross Abstract: Enforcing alignment between the internal representations of diffusion or flow-based generative models and those of pretrained self-supervised encoders has recently been shown to provide a powerful inductive bias, improving…

  119. arXiv cs.LG TIER_1 English(EN) · Nicola Novello, Federico Fontana, Luigi Cinque, Deniz Gunduz, Andrea M. Tonello ·

    面向 f-散度扩散模型遗忘的统一框架

    arXiv:2509.21167v2 Announce Type: replace Abstract: Most existing methods for concept unlearning in text-to-image diffusion models minimize a mean squared error (MSE) loss between the denoiser outputs conditioned on a target and an anchor concept, which is implicitly the KL diver…

  120. arXiv cs.LG TIER_1 English(EN) · Nithesh Chandher Karthikeyan, Jonas Unger, Gabriel Eilertsen ·

    通过表征条件化扩散模型实现可控图像生成

    arXiv:2605.27343v1 Announce Type: cross Abstract: Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such…

  121. arXiv cs.LG TIER_1 English(EN) · Funing Fu, Tenghui Wang, Junyong Cen, Qichao Zhu, Guanyu Zhou ·

    JLT:潜在扩散 Transformer 中的清洁潜在预测

    arXiv:2605.27102v1 Announce Type: cross Abstract: Flow matching with clean-data prediction has shown that regressing the clean point can exploit low-dimensional structure more effectively than predicting an ambient noised quantity. We ask whether this principle remains useful aft…

  122. arXiv cs.LG TIER_1 English(EN) · Gwangho Kim, Sungyoon Lee ·

    通过逐坐标曲率差异在扩散模型中本地化记忆区域

    arXiv:2605.26756v1 Announce Type: new Abstract: Diffusion models can unintentionally memorize training samples, raising concerns about privacy and copyright. While recent methods can detect memorization, they often rely on global or model-specific signals and provide limited insi…

  123. arXiv cs.AI TIER_1 English(EN) · Junseo Bang, Joonhee Lee, Kyeonghyun Lee, Haechang Lee, Dong Un Kang, Se Young Chun ·

    用于任意尺度图像超分辨率的自级联扩散模型

    arXiv:2506.07813v2 Announce Type: replace-cross Abstract: Arbitrary-scale image super-resolution aims to upsample images to any desired resolution, offering greater flexibility than traditional fixed-scale super-resolution. Recent approaches based on regression-based or generativ…

  124. arXiv cs.LG TIER_1 English(EN) · Gabriel Eilertsen ·

    通过表示条件扩散模型实现可控图像生成

    Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text prompts or semantic maps, which require e…

  125. arXiv cs.LG TIER_1 English(EN) · Guanyu Zhou ·

    JLT:潜在扩散 Transformer 中的清洁潜在预测

    Flow matching with clean-data prediction has shown that regressing the clean point can exploit low-dimensional structure more effectively than predicting an ambient noised quantity. We ask whether this principle remains useful after images are mapped into a learned latent space, …

  126. Hugging Face Daily Papers TIER_1 English(EN) ·

    离散扩散中随机性的纠错效应

    Discrete diffusion models achieve strong performance in text and image generation, but their inference remains slow and must inherently balance sampling efficiency and sample quality. In this work, we present a systematic study of how the \emph{degree of stochasticity} in Markov …

  127. arXiv cs.AI TIER_1 English(EN) · Matthew Niedoba, Berend Zwartsenberg, Frank Wood ·

    Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization

    arXiv:2605.24192v1 Announce Type: cross Abstract: The neural-network denoising functions which form the backbone of image diffusion models are remarkably consistent in their generalization behaviour across a wide variety of network architectures and training procedure hyperparame…

  128. arXiv cs.AI TIER_1 English(EN) · Sol Park, Soobin Um ·

    超越生成先验:使用JEPA引导的扩散进行少数采样

    arXiv:2605.24631v1 Announce Type: cross Abstract: Minority sampling aims to generate low-density instances on a data manifold and is of central importance in applications such as medical diagnosis, anomaly detection, and creative AI. Existing approaches, however, define minority …

  129. arXiv cs.AI TIER_1 English(EN) · Weixin Wang, Yu Yang, Wei Deng, Pan Xu ·

    通过信任区域迭代扭曲序列蒙特卡洛在推理时对扩散模型进行对齐

    arXiv:2605.25123v1 Announce Type: cross Abstract: We study inference-time alignment for diffusion-based generative models, aiming to steer a base model toward high-reward outputs without updating its weights. Recent Sequential Monte Carlo (SMC)-based steering methods approximate …

  130. arXiv cs.AI TIER_1 English(EN) · Qingyuan Zeng, Pengxiang Cai, Zixin Guan, Ziyang Chen, Anglin Liu, Lang Qin, Xinyao Lai, Jintai Chen ·

    无需重新训练,只需复用:从单目标扩散模型中恢复双目标分子

    arXiv:2605.25681v1 Announce Type: cross Abstract: Designing a single molecule that modulates two targets is a promising strategy for polypharmacology, but it remains substantially harder than standard single-target generation because one candidate must satisfy two binding require…

  131. arXiv cs.LG TIER_1 English(EN) · Arran Carter, Sanghyeok Choi, Kirill Tamogashev, V\'ictor Elvira, Nikolay Malkin ·

    离散扩散采样器与桥接:离策略算法及其在潜在空间中的应用

    arXiv:2602.05961v2 Announce Type: replace Abstract: Sampling from a distribution $p(x) \propto e^{-\mathcal{E}(x)}$ known up to a normalising constant is an important and challenging problem in statistics. Recent years have seen the rise of a new family of amortised sampling algo…

  132. arXiv cs.LG TIER_1 English(EN) · Sungwon Park, Anthony Zhou, Hongjoong Kim, Amir Barati Farimani ·

    生成式神经算子通过扩散最后一层

    arXiv:2602.04139v2 Announce Type: replace Abstract: Neural operators provide a powerful framework for learning discretization invariant mappings between function spaces, but standard deterministic models do not capture predictive uncertainty. We introduce diffusion last layer (DL…

  133. arXiv cs.LG TIER_1 English(EN) · Nishanth Shetty, Madhava Prasath, Chandra Sekhar Seelamantula ·

    Dale meets Langevin:一种乘法去噪扩散模型

    arXiv:2510.02730v2 Announce Type: replace Abstract: Exponentiated gradient descent (EGD), a biologically motivated optimisation algorithm that respects Dale's law, produces log-normally distributed synaptic weights at convergence, in alignment with experimental observations in ne…

  134. arXiv cs.LG TIER_1 English(EN) · Jaeyeon Kim, Seunggeun Kim, Taekyun Lee, David Z. Pan, Hyeji Kim, Sham Kakade, Sitan Chen ·

    微调掩码扩散模型以实现可证明的自我修正

    arXiv:2510.01384v4 Announce Type: replace Abstract: A natural desideratum for generative models is self-correction--detecting and revising low-quality tokens at inference. While Masked Diffusion Models (MDMs) have emerged as a promising approach for generative modeling in discret…

  135. arXiv cs.LG TIER_1 English(EN) · Yanbo Xu, Yu Wu, Sungjae Park, Zhizhuo Zhou, Shubham Tulsiani ·

    扩散模型和流模型中温度采样的时间分数重缩放

    arXiv:2510.01184v2 Announce Type: replace Abstract: We present a mechanism to steer the sampling diversity of denoising diffusion and flow matching models, allowing users to sample from a sharper or broader distribution than the training distribution. We build on the observation …

  136. arXiv cs.LG TIER_1 English(EN) · Zichao Yue, Zhiru Zhang ·

    重新审视预传播GNN:鲁棒的扩散算子和隐藏状态的再传播

    arXiv:2605.25111v1 Announce Type: new Abstract: Pre-propagation graph neural networks (PPGNNs) decouple node feature propagation from transformation: graph diffusion is performed once as preprocessing, and training reduces to dense per-node transformations. This design enables mi…

  137. arXiv cs.CL TIER_1 English(EN) · Ke Lin, Yiyang Luo, Zhaolong Su, Yunya Song, Anyi Rao ·

    Triplet-Block Diffusion RWKV

    arXiv:2605.25969v1 Announce Type: new Abstract: Causal Transformer language models suffer from strictly sequential decoding and a quadratic per-step attention cost. While linear-time causal models and discrete diffusion models each address these weaknesses, their integration rema…

  138. arXiv cs.AI TIER_1 English(EN) · Ziheng Cheng, Yixiao Huang, Hanlin Zhu, Haoran Geng, Somayeh Sojoudi, Jitendra Malik, Pieter Abbeel, Xin Guo ·

    扩散模型的多目标学习:半监督学习下的统计理论

    arXiv:2605.25210v1 Announce Type: cross Abstract: Diffusion models are increasingly used as powerful conditional generators, yet real deployments often involve multiple target distributions arising from different tasks, e.g., diverse prompt domains in text-to-image generation, or…

  139. arXiv cs.LG TIER_1 English(EN) · R\'emi Bourgerie, \v{S}ar\=unas Girdzijauskas, Viktoria Fodor ·

    深度神经网络层扩散

    arXiv:2605.19021v2 Announce Type: replace Abstract: Deep Graph Neural Networks (GNNs) are essential for capturing complex dependencies in graph-structured data. However, scaling GNNs to depth remains challenging, as stacking layers leads to representation collapse and diminishing…

  140. arXiv cs.AI TIER_1 English(EN) · Zixin Jessie Chen, Zhuo Chen, Archer Wang, Jeff Gore, William T. Freeman, Congyue Deng, Marin Solja\v{c}i\'c ·

    万物皆可缩放:具有连续超分辨率的尺度不变扩散模型

    arXiv:2605.26032v1 Announce Type: cross Abstract: Creating images from noise is image generation; reconstructing fine details from coarse inputs is super-resolution. Despite their practical differences, both can be understood as reversing information loss across scales. We introd…

  141. arXiv cs.AI TIER_1 English(EN) · Shaorong Zhang, Rob Brekelmans, Greg Ver Steeg ·

    用于扩散模型的局部 MAP 采样

    arXiv:2510.07343v3 Announce Type: replace-cross Abstract: Diffusion Posterior Sampling (DPS) provides a principled Bayesian approach to inverse problems by sampling from $p(x_0 \mid y)$. While posterior sampling is valuable for capturing uncertainty and multi-modality, many class…

  142. Hugging Face Daily Papers TIER_1 English(EN) ·

    面向逆问题的三元动力学感知扩散后验采样:优化引导和随机性调度

    Generative posterior sampling using diffusion models has emerged as a dominant paradigm for solving inverse problems in imaging, which usually consists of three main components: data consistency (DC) guidance, classifier-free guidance (CFG) and stochasticity. While prior arts hav…

  143. Hugging Face Daily Papers TIER_1 English(EN) ·

    JLT:潜在扩散 Transformer 中的清洁潜在预测

    Latent diffusion models using clean-data prediction outperform velocity prediction in compressed representations, demonstrating that prediction targets are geometrically dependent rather than algebraically interchangeable.

  144. Hugging Face Daily Papers TIER_1 English(EN) ·

    RT-Lynx:为扩散模型正确地引入 GEMM 稀疏性

    Diffusion Transformers achieve strong image generation performance but face high inference costs; this work proposes RT-Lynx, which uses activation sparsification and optimized CUDA kernels to accelerate inference while maintaining generation quality.

  145. Hugging Face Daily Papers TIER_1 English(EN) ·

    通过对称注意力分解平衡扩散模型中的保真度和多样性:霍普菲尔德视角

    The symmetric and skew-symmetric components of transformer attention matrices are analyzed as governing energy landscape structure and circulation dynamics, respectively, with implications for generation trade-offs.

  146. Hugging Face Daily Papers TIER_1 English(EN) ·

    万物皆可缩放:具有连续超分辨率的尺度不变扩散模型

    Creating images from noise is image generation; reconstructing fine details from coarse inputs is super-resolution. Despite their practical differences, both can be understood as reversing information loss across scales. We introduce $\textbf{SKILD}$, a $\textbf{S}$cale-invariant…

  147. arXiv cs.AI TIER_1 English(EN) · Marin Soljačić ·

    万物皆可缩放:具有连续超分辨率的尺度不变扩散模型

    Creating images from noise is image generation; reconstructing fine details from coarse inputs is super-resolution. Despite their practical differences, both can be understood as reversing information loss across scales. We introduce $\textbf{SKILD}$, a $\textbf{S}$cale-invariant…

  148. arXiv cs.CL TIER_1 English(EN) · Anyi Rao ·

    Triplet-Block Diffusion RWKV

    Causal Transformer language models suffer from strictly sequential decoding and a quadratic per-step attention cost. While linear-time causal models and discrete diffusion models each address these weaknesses, their integration remains inherently inconsistent: diffusion requires …

  149. arXiv cs.AI TIER_1 English(EN) · Jintai Chen ·

    无需重新训练,只需重复使用:从单目标扩散模型中恢复双目标分子

    Designing a single molecule that modulates two targets is a promising strategy for polypharmacology, but it remains substantially harder than standard single-target generation because one candidate must satisfy two binding requirements while preserving drug-likeness and synthesiz…

  150. arXiv cs.LG TIER_1 English(EN) · Egor Lifar, Semyon Savkin, Timur Garipov, Shangyuan Tong, Tommi Jaakkola ·

    扩散域扩展:学习协调预训练扩散模型

    arXiv:2605.23275v1 Announce Type: new Abstract: In this paper, we propose Diffusion Domain Expansion (DDE), a method that efficiently extends pre-trained diffusion models to generate larger objects and handle more complex conditioning beyond their original capabilities. Our metho…

  151. arXiv cs.CL TIER_1 English(EN) · Kaisen Yang, Jayden Teoh, Kaicheng Yang, Yitong Zhang, Alex Lamb ·

    通过信息增益改进掩码扩散模型的采样

    arXiv:2602.18176v3 Announce Type: replace Abstract: Masked Diffusion Models (MDMs) enable flexible decoding orders, yet existing samplers remain largely greedy, selecting locally certain tokens without accounting for their downstream effects. We show that this myopia can increase…

  152. arXiv cs.LG TIER_1 English(EN) · Jaihoon Kim, Taehoon Yoon, Prin Phunyaphibarn, Seungjun Kim, Morteza Mardani, Minhyuk Sung ·

    用于离散扩散中摊销序贯蒙特卡洛的对比分布匹配

    arXiv:2605.23346v1 Announce Type: new Abstract: Discrete diffusion models have emerged as powerful frameworks for generating structured categorical data. However, efficiently sampling from reward-tilted distributions remains a fundamental challenge. While Twisted Sequential Monte…

  153. arXiv cs.LG TIER_1 English(EN) · Benjamin Rozonoyer, Jacopo Minniti, Dhruvesh Patel, Neil Band, Avishek Joey Bose, Tim G. J. Rudner, Andrew McCallum ·

    面向前瞻性离散扩散模型的学习式中继表示

    arXiv:2605.22967v1 Announce Type: new Abstract: When Masked Diffusion Models (MDMs) generate sequences through iterative refinement, the rich internal computation over masked positions is discarded, forcing every subsequent refinement step to recompute the valuable internal infor…

  154. Hugging Face Daily Papers TIER_1 English(EN) ·

    万物皆可缩放:具有连续超分辨率的尺度不变扩散模型

    SKILD is a scale-invariant k-space image learning diffusion model that unifies image generation and continuous super-resolution through a single unconditional framework by leveraging scale invariance in image content and physics systems.

  155. Hugging Face Daily Papers TIER_1 English(EN) ·

    Triplet-Block Diffusion RWKV

    B³D-RWKV combines diffusion and RWKV architectures to achieve parallel, bidirectional processing with improved decoding speed while maintaining competitive accuracy.

  156. Hugging Face Daily Papers TIER_1 English(EN) ·

    在推理时将图像引导注入文本条件扩散模型

    Visual Concept Fusion enables dual text and image conditioning in diffusion models through feature alignment and fusion strategies without requiring retraining.

  157. arXiv cs.CL TIER_1 English(EN) · Chunsan Hong, Sanghyun Lee, Jong Chul Ye ·

    统一具有各种生成顺序的掩码扩散模型及其扩展

    arXiv:2602.02112v2 Announce Type: replace-cross Abstract: Masked diffusion models (MDMs) are a potential alternative to autoregressive models (ARMs) for language generation, but generation quality depends critically on the generation order. Prior work either hard-codes an orderin…

  158. arXiv cs.LG TIER_1 English(EN) · Jack Kendall ·

    Equilibrium Propagation and Hamiltonian Inference in the Diffusive Fitzhugh-Nagumo Model

    arXiv:2605.21568v1 Announce Type: new Abstract: In this work, we extend the Equilibrium Propagation framework to skew-gradient systems and show an equivalence between deep Energy-Based Models and Hamiltonian neural networks. We focus on networks of diffusively coupled Fitzhugh-Na…

  159. arXiv cs.LG TIER_1 English(EN) · Mengni Jia, Mengyu Zhou, Yihao Liu, Xiaoxi Jiang, Guanjun Jiang ·

    为扩散模型带来稳定性:分解和降低训练掩码扩散模型的方差

    arXiv:2511.18159v2 Announce Type: replace Abstract: Masked diffusion models (MDMs) are a promising alternative to autoregressive models (ARMs), but they suffer from inherently much higher training variance. High variance leads to noisier gradient estimates and unstable optimizati…

  160. arXiv cs.LG TIER_1 English(EN) · Samuel Koovely, Alexandre Bovet ·

    时间网络上热扩散的条件熵

    arXiv:2605.21514v1 Announce Type: cross Abstract: Many complex systems can be modeled by temporal networks, whose organization often evolves through distinct structural phases. Detecting the change points that delimit these phases is both important and challenging. In this work, …

  161. arXiv cs.LG TIER_1 English(EN) · Seo Taek Kong, Weina Wang, R. Srikant ·

    Diffusion模型噪声调度设计:最优控制视角

    arXiv:2605.21911v1 Announce Type: new Abstract: We develop a principled framework for analyzing and designing noise schedules in diffusion models. We show that one can recast this design problem as an optimal control problem, whose state is the Fisher information of the diffusion…

  162. arXiv cs.AI TIER_1 English(EN) · Wei Huang, Andi Han, Mingyuan Bai, Huanjian Zhou, Qixin Zhang, Taiji Suzuki, Kenji Fukumizu ·

    在流形假设下可证明地学习扩散模型:坍塌与精炼

    arXiv:2605.20235v1 Announce Type: cross Abstract: Diffusion models generate high-dimensional data with remarkable quality, yet how their training efficiently learns the score function, bypassing the curse of dimensionality when data is supported on low-dimensional manifolds, rema…

  163. arXiv cs.AI TIER_1 English(EN) · Chenyang An, Xiaoqian Xu ·

    二阶常微分方程的下界:利用AI生成证明进行探索

    arXiv:2605.20623v1 Announce Type: cross Abstract: We establish explicit lower bounds for advection-diffusion equations in three settings: a polynomial $\dot H^{-1}$ bound for inviscid shears with $u\in L^\infty_t W^{1,1}_y$, a uniform positive lower bound on the mixing scale for …

  164. arXiv cs.AI TIER_1 English(EN) · Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau, Marylou Gabri\'e ·

    扩散模型中的关键减速

    arXiv:2605.12597v2 Announce Type: replace-cross Abstract: Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theor…

  165. arXiv cs.CL TIER_1 English(EN) · Jiayi Fu, Yuxia Wang ·

    扩散理论教程:从微分方程到扩散模型

    arXiv:2605.22586v1 Announce Type: cross Abstract: This tutorial develops diffusion models from the viewpoint of differential equations. We begin with the conditional Gaussian forward process and show that this path admits both an ordinary differential equation (ODE) representatio…

  166. Hugging Face Daily Papers TIER_1 English(EN) ·

    用于离散扩散中摊销序贯蒙特卡洛的对比分布匹配

    Contrastive Distribution Matching addresses efficient sampling from reward-tilted distributions in discrete diffusion models through learned twist functions that reduce computational overhead while maintaining accuracy across diverse applications.

  167. arXiv cs.CL TIER_1 English(EN) · Yuxia Wang ·

    扩散理论教程:从微分方程到扩散模型

    This tutorial develops diffusion models from the viewpoint of differential equations. We begin with the conditional Gaussian forward process and show that this path admits both an ordinary differential equation (ODE) representation and a stochastic differential equation (SDE) rep…

  168. Hugging Face Daily Papers TIER_1 English(EN) ·

    通过输出相似度感知重新思考扩散模型的Token缩减

    Diffusion Transformers (DiTs) achieve superior image generation quality but suffer from quadratic computational complexity relative to token count. While various token reduction (TR) methods have been proposed to mitigate this cost, they overlook the primary objective of generati…

  169. Hugging Face Daily Papers TIER_1 English(EN) ·

    统一扩散模型再探:留一去噪器与吸收态重构

    Discrete diffusion models are often trained through clean-data prediction, but the prediction can be used in different ways to define the reverse dynamics. In Masked Diffusion Models (MDM) these choices largely coincide, whereas in Uniform Diffusion Models (UDM) they do not. We s…

  170. arXiv cs.AI TIER_1 English(EN) · Xiaoqian Xu ·

    二阶常微分方程的下界:基于AI生成证明的探索

    We establish explicit lower bounds for advection-diffusion equations in three settings: a polynomial $\dot H^{-1}$ bound for inviscid shears with $u\in L^\infty_t W^{1,1}_y$, a uniform positive lower bound on the mixing scale for diffusive shears, and an exponential $L^2$ bound f…

  171. Hugging Face Daily Papers TIER_1 English(EN) ·

    关于扩散模型中潜在重用的局限性

    Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a source-target setting in which both datasets are app…

  172. arXiv cs.LG TIER_1 English(EN) · Lu Yu ·

    关于扩散模型中潜在重用的局限性

    Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a source-target setting in which both datasets are app…

  173. arXiv cs.CL TIER_1 English(EN) · Yo-Sub Han ·

    扩散语言模型安全生成中的自适应转向和重掩码

    Diffusion Language Models (DLMs) provide a promising alternative to autoregressive language models by generating text through iterative denoising and bidirectional refinement. However, this iterative generation paradigm also introduces unique safety vulnerabilities when harmful t…

  174. arXiv cs.CL TIER_1 English(EN) · Jong Chul Ye ·

    理解和加速掩码扩散语言模型的训练

    Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models (ARMs) for language modeling. However, MDMs are known to learn substantially more slowly than ARMs, which may become problematic when scaling MDMs to larger models. Therefore, we ask t…

  175. arXiv cs.AI TIER_1 English(EN) · Shubhankar Mohapatra ·

    DriftXpress:通过投影RKHS场实现更快的漂移模型

    Drifting Models have emerged as a new paradigm for one-step generative modeling, achieving strong image quality without iterative inference. The premise is to replace the iterative denoising process in diffusion models with a single evaluation of a generator. However, this create…

  176. arXiv cs.CL TIER_1 English(EN) · Haoliang Li ·

    自蒸馏轨迹感知玻尔兹曼建模:弥合扩散语言模型中的训练-推理差异

    Diffusion Language Models (DLMs) have recently emerged as a promising alternative to autoregressive language models, offering stronger global awareness and highly parallel generation. However, post-training DLMs with standard Negative Evidence Lower Bound (NELBO)-based supervised…

  177. arXiv cs.LG TIER_1 English(EN) · Rui Yu ·

    阐释扩散模型训练中的表示退化问题

    Diffusion models have achieved remarkable success, yet their training remains inefficient due to a severe optimization bottleneck, which we term Representation Degradation. As noise levels increase, the outputs of the trained model exhibit progressive structural distortion, which…

  178. Hugging Face Daily Papers TIER_1 Deutsch(DE) ·

    核梯度漂移模型

    We propose kernel-gradient drifting, a one-step generative modeling framework that replaces the fixed Euclidean displacement direction in drifting models with directions induced by the kernel itself. Standard drifting is attractive because it enables fast, high-quality generation…

  179. arXiv cs.LG TIER_1 Deutsch(DE) · Floor Eijkelboom ·

    核梯度漂移模型

    We propose kernel-gradient drifting, a one-step generative modeling framework that replaces the fixed Euclidean displacement direction in drifting models with directions induced by the kernel itself. Standard drifting is attractive because it enables fast, high-quality generation…

  180. arXiv cs.LG TIER_1 English(EN) · Aaron R. Dinner ·

    通过生成式吉布斯采样,用显式物理上下文组合扩散先验

    Pretrained diffusion models provide powerful learned priors, but in scientific sampling the target distribution often depends on physical context that is not fully represented by one generative model. We introduce Generative Gibbs for Physics-Aware Sampling (GG-PA), a training-fr…

  181. arXiv cs.CL TIER_1 English(EN) · Difan Zou ·

    用于扩散语言模型的相对分数策略优化

    Diffusion large language models (dLLMs) offer a promising route to parallel and efficient text generation, but improving their reasoning ability requires effective post-training. Reinforcement learning with verifiable rewards (RLVR) is a natural choice for this purpose, yet its a…

  182. arXiv cs.AI TIER_1 English(EN) · Andrea M. Tonello ·

    Empty SPACE:扩散模型概念擦除的交叉注意力稀疏性

    Erasing specific concepts from text-to-image diffusion models is essential for avoiding the generation of copyrighted and explicit content. Closed-form concept erasure methods offer a fast alternative to backpropagation-based techniques, but they become less effective when scalin…

  183. arXiv cs.CL TIER_1 English(EN) · Hongsheng Li ·

    面向并行掩码扩散语言模型的基于编辑的精炼

    Masked diffusion language models enable parallel token generation and offer improved decoding efficiency over autoregressive models. However, their performance degrades significantly when generating multiple tokens simultaneously, due to a mismatch between token-level training ob…

  184. arXiv cs.LG TIER_1 English(EN) · Erhan Bayraktar ·

    当扩散模型可以忽略维度:一个基于熵的理论

    Diffusion models perform remarkably well on high-dimensional data such as images, often using only a modest number of reverse-time steps. Despite this practical success, existing convergence theory does not fully explain why such samplers remain efficient in high dimensions. Many…

  185. arXiv cs.CL TIER_1 English(EN) · Dmitry Vetrov ·

    如何联合训练你的潜在扩散语言模型及其潜在空间

    Latent diffusion models offer an attractive alternative to discrete diffusion for non-autoregressive text generation by operating on continuous text representations and denoising entire sequences in parallel. The major challenge in latent diffusion modeling is constructing a suit…

  186. arXiv cs.CL TIER_1 English(EN) · Tim Van de Cruys ·

    引导并非超参数:学习扩散语言模型的动态控制

    Classifier-Free Guidance (CFG) is a widely used mechanism for controlling diffusion-based generative models, yet its guidance scale is typically treated as a fixed hyperparameter throughout generation. This static design yields a suboptimal controllability and quality tradeoff, a…

  187. arXiv cs.LG TIER_1 English(EN) · Chun Kai Ling ·

    无条件扩散模型的推理时属性分布对齐

    Inference-time controllable generation is essential for real-world applications of unconditional diffusion models. However, most existing techniques focus on individual samples, struggling in applications that require the sample population to follow specific attribute distributio…

  188. arXiv cs.LG TIER_1 English(EN) · Benjamin Sterling, Yousef El-Laham, M\'onica F. Bugallo ·

    通过高阶 Langevin 动力学防御扩散模型免受成员推理攻击

    arXiv:2509.14225v3 Announce Type: replace Abstract: Recent advances in generative artificial intelligence applications have raised new data security concerns. This paper focuses on defending diffusion models against membership inference attacks. This type of attack occurs when th…

  189. arXiv cs.LG TIER_1 English(EN) · Canyu Zhao, Hao Chen, Yunze Tong, Yu Qiao, Jiacheng Li, Chunhua Shen ·

    MARBLE:扩散强化学习的多方面奖励平衡

    arXiv:2605.06507v1 Announce Type: cross Abstract: Reinforcement learning fine-tuning has become the dominant approach for aligning diffusion models with human preferences. However, assessing images is intrinsically a multi-dimensional task, and multiple evaluation criteria need t…

  190. arXiv cs.LG TIER_1 English(EN) · Flavio Nicoletti, Chenxiao Ma, Enrico Ventura, Luca Saglietti, Stefano Sarao Mannelli ·

    数据结构与不平衡在扩散模型学习动态中的相互作用

    arXiv:2605.06367v1 Announce Type: cross Abstract: Real-world datasets are inherently heterogeneous, yet how per-class structural differences and sampling imbalance shape the training dynamics of diffusion models-and potentially exacerbate disparities-remains poorly understood. Wh…

  191. arXiv cs.LG TIER_1 English(EN) · Meira Iske, Carola-Bibiane Sch\"onlieb ·

    双Lipschitz归一化流的表达能力:基于分数的扩散视角

    arXiv:2605.06172v1 Announce Type: cross Abstract: Many normalizing flow architectures impose regularity constraints, yet their distributional approximation properties are not fully characterized. We study the expressivity of bi-Lipschitz normalizing flows through the lens of scor…

  192. arXiv cs.LG TIER_1 English(EN) · Sankarshana Venugopal (Seoul National University), Mohammad Mostafavi (Seoul National University), Jonghyun Choi (Seoul National University) ·

    DBMSolver:一种无需训练的高质量图像到图像翻译扩散桥采样器

    arXiv:2605.05889v1 Announce Type: cross Abstract: Diffusion-based image-to-image (I2I) translation excels in high-fidelity generation but suffers from slow sampling in state-of-the-art Diffusion Bridge Models (DBMs), often requiring dozens of function evaluations (NFEs). We intro…

  193. arXiv cs.LG TIER_1 English(EN) · Gal Vinograd, Idan Achituve, Ethan Fetaya ·

    Diverse Sampling in Diffusion Models with Marginal Preserving Particle Guidance

    arXiv:2605.06553v1 Announce Type: new Abstract: We present EDDY (Exact-marginal Diversification via Divergence-free dYnamics), a guidance mechanism for diffusion and flow matching models that promotes diversity among samples generated while maintaining quality. EDDY exploits symm…

  194. arXiv cs.LG TIER_1 English(EN) · Matias G. Delgadino, Sebastien Motsch, Advait Parulekar, William Porteous, Sanjay Shakkottai ·

    基于扩散的后验采样:偏差与稳定性的Feynman-Kac分析

    arXiv:2605.06538v1 Announce Type: new Abstract: Diffusion-based posterior samplers use pretrained diffusion priors to sample from measurement- or reward-conditioned posteriors, and are widely used for inverse problems. Yet their theoretical behavior remains poorly understood: eve…

  195. arXiv cs.LG TIER_1 English(EN) · Alexander Conzelmann, Albert Catalan-Tatjer, Shiwei Liu ·

    Diffusion Language Models 中的层坍塌

    arXiv:2605.06366v1 Announce Type: new Abstract: Diffusion language models (DLMs) have recently emerged as competitive alternatives to autoregressive (AR) language models, yet differences in their activation dynamics remain poorly understood. We characterize these dynamics in LLaD…

  196. arXiv cs.LG TIER_1 English(EN) · Eugenio Lomurno, Filippo Balzarini, Francesco Benelle, Francesca Pia Panaccione, Matteo Matteucci ·

    推理时精炼缩小表格扩散中的合成-真实差距

    arXiv:2605.06261v1 Announce Type: new Abstract: Diffusion-based generators set the current state of the art for synthetic tabular data. These methods approach but rarely exceed real-data utility, and closing this synthetic-real gap has so far been pursued exclusively at training …

  197. arXiv cs.CL TIER_1 Français(FR) · Hongcan Guo, Qinyu Zhao, Yian Zhao, Shen Nie, Rui Zhu, Qiushan Guo, Feng Wang, Tao Yang, Hengshuang Zhao, Guoqiang Wei, Yan Zeng ·

    连续潜在扩散语言模型

    arXiv:2605.06548v1 Announce Type: new Abstract: Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve gene…

  198. arXiv cs.LG TIER_1 English(EN) · Ahmad Aghapour, Erhan Bayraktar, Asaf Cohen ·

    线性约束下的条件扩散:Langevin混合与信息论保证

    arXiv:2605.05387v1 Announce Type: new Abstract: We study zero-shot conditional sampling with pretrained diffusion models for linear inverse problems, including inpainting and super-resolution. In these problems, the observation determines only part of the unknown signal. The rema…

  199. arXiv cs.LG TIER_1 English(EN) · Pierre Marion, Yu-Han Wu ·

    理解扩散模型需要(再次)重新思考泛化能力

    arXiv:2605.06077v1 Announce Type: new Abstract: This position paper argues that understanding generalization in diffusion models requires fundamentally new theoretical frameworks that go beyond both classical statistical learning theory and the benign overfitting paradigm develop…

  200. arXiv cs.LG TIER_1 English(EN) · Pengqi Lu ·

    均值模式尖叫:1000层扩散 Transformer 的均值-方差分割残差

    arXiv:2605.06169v1 Announce Type: new Abstract: Scaling Diffusion Transformers (DiTs) to hundreds of layers introduces a structural vulnerability: networks can enter a silent, mean-dominated collapse state that homogenizes token representations and suppresses centered variation. …

  201. arXiv cs.LG TIER_1 English(EN) · Tongda Xu, Mingwei He, Shady Abu-Hussein, Jose Miguel Hernandez-Lobato, Chunhang Zheng, Kai Zhao, Chao Zhou, Ya-Qin Zhang, Yan Wang ·

    使重建FID能够预测扩散生成FID

    arXiv:2603.05630v2 Announce Type: replace-cross Abstract: It is well known that the reconstruction FID (rFID) of a VAE is poorly correlated with the generation FID (gFID) of a latent diffusion model. We propose interpolated FID (iFID), a simple variant of rFID that exhibits a str…

  202. arXiv cs.LG TIER_1 English(EN) · Manyi Li, Yufan Liu, Lai Jiang, Bing Li, Yuming Li, Weiming Hu ·

    遗忘的幻觉:通过初始潜在变量优化攻击未学习的扩散模型

    arXiv:2602.00175v2 Announce Type: replace Abstract: Text-to-image diffusion models (DMs) are frequently abused to produce harmful or copyrighted content, violating public interests. Concept erasure (unlearning) is a promising paradigm to alleviate this issue. However, there exist…

  203. arXiv cs.LG TIER_1 English(EN) · Ethan Fetaya ·

    Diverse Sampling in Diffusion Models with Marginal Preserving Particle Guidance

    We present EDDY (Exact-marginal Diversification via Divergence-free dYnamics), a guidance mechanism for diffusion and flow matching models that promotes diversity among samples generated while maintaining quality. EDDY exploits symmetries of the Fokker-Planck equation, using drif…

  204. arXiv cs.LG TIER_1 English(EN) · Sanjay Shakkottai ·

    基于扩散的后验采样:偏差与稳定性的Feynman-Kac分析

    Diffusion-based posterior samplers use pretrained diffusion priors to sample from measurement- or reward-conditioned posteriors, and are widely used for inverse problems. Yet their theoretical behavior remains poorly understood: even with exact prior scores, their outputs are bia…

  205. arXiv cs.LG TIER_1 English(EN) · Adrien Jacquet Cr\'etides, Mouad Abrini, Hamed Rahimi, Mohamed Chetouani ·

    Encoding Predictability and Legibility for Style-Conditioned Diffusion Policy

    arXiv:2603.16368v2 Announce Type: replace-cross Abstract: Striking a balance between efficiency and transparent motion is a core challenge in human-robot collaboration, as highly expressive movements often incur unnecessary time and energy costs. In collaborative environments, le…

  206. arXiv cs.LG TIER_1 English(EN) · Michael Rottoli, Subhankar Roy, Stefano Paraboschi ·

    Predict-then-Diffuse:面向计算预算受限的Diffusion LLM推理的自适应响应长度

    arXiv:2605.04215v1 Announce Type: new Abstract: Diffusion-based Large Language Models (D-LLMs) represent a promising frontier in generative AI, offering fully parallel token generation that can lead to significant throughput advantages and superior GPU utilization over traditiona…

  207. arXiv cs.LG TIER_1 English(EN) · Arthur Gretton, Li Kevin Wenliang, Alexandre Galashov, James Thornton, Valentin De Bortoli, Arnaud Doucet ·

    漂移模型上的 Wasserstein梯度流解释

    arXiv:2605.05118v1 Announce Type: new Abstract: Recently, Deng et al. (2026) proposed Generative Modeling via Drifting (GMD), a novel framework for generative tasks. This note presents an analysis of GMD through the lens of Wasserstein Gradient Flows (WGF), i.e., the path of stee…

  208. arXiv cs.LG TIER_1 English(EN) · Christopher Nemeth ·

    通过结构化随机扩散实现超图生成

    arXiv:2605.05024v1 Announce Type: cross Abstract: Hypergraphs model higher-order interactions, but realistic hypergraph generation remains difficult because incidence, hyperedge-size heterogeneity, and overlap structure are not faithfully captured by pairwise reductions. We propo…

  209. arXiv cs.LG TIER_1 Deutsch(DE) · Xiaoyu Wu, Yifei Wang, Tsu-Jui Fu, Liang-Chieh Chen, Zhe Gan, Chen Wei ·

    驯服扩散Transformer中的异常Token

    arXiv:2605.05206v1 Announce Type: cross Abstract: We study outlier tokens in Diffusion Transformers (DiTs) for image generation. Prior work has shown that Vision Transformers (ViTs) can produce a small number of high-norm tokens that attract disproportionate attention while carry…

  210. arXiv cs.LG TIER_1 English(EN) · Kaiwen Zheng, Yuji Wang, Qianli Ma, Huayu Chen, Jintao Zhang, Yogesh Balaji, Jianfei Chen, Ming-Yu Liu, Jun Zhu, Qinsheng Zhang ·

    通过分数正则化连续时间一致性实现大规模扩散蒸馏

    arXiv:2510.08431v3 Announce Type: replace-cross Abstract: Although continuous-time consistency models (e.g., sCM, MeanFlow) are theoretically principled and empirically powerful for fast academic-scale diffusion, its applicability to large-scale text-to-image and video tasks rema…

  211. arXiv cs.LG TIER_1 English(EN) · Riccardo de Lutio, Tobias Fischer, Yen-Yu Chang, Yuxuan Zhang, Jay Zhangjie Wu, Xuanchi Ren, Tianchang Shen, Katarina Tothova, Zan Gojcic, Haithem Turki ·

    ArtiFixer:利用自回归扩散模型增强和扩展三维重建

    arXiv:2603.00492v2 Announce Type: replace-cross Abstract: Per-scene optimization methods such as 3D Gaussian Splatting provide state-of-the-art novel view synthesis quality but extrapolate poorly to under-observed areas. Methods that leverage generative priors to correct artifact…

  212. arXiv cs.LG TIER_1 English(EN) · Andreas Makris, Paul Fearnhead, Chris Nemeth ·

    Tempered Guided Diffusion

    arXiv:2605.03712v1 Announce Type: cross Abstract: Training-free conditional diffusion provides a flexible alternative to task-specific conditional model training, but existing samplers often allocate computation inefficiently: independent guided trajectories can vary widely in qu…

  213. arXiv cs.LG TIER_1 English(EN) · Francesca Romana Crucinio ·

    关于 Kullback--Leibler 散度通过梯度流进行采样的独特性质的说明

    arXiv:2507.04330v2 Announce Type: replace-cross Abstract: We consider the problem of sampling from a probability distribution $\pi$ which admits a density w.r.t. a dominating measure. It is well known that this can be written as an optimisation problem over the space of probabili…

  214. arXiv cs.LG TIER_1 English(EN) · Alexandre Alouadi, Pierre Henry-Labord\`ere, Gr\'egoire Loeper, Othmane Mazhar, Huy\^en Pham, Nizar Touzi ·

    LightSBB-M:连接Schr\"odinger与Bass,用于生成式扩散建模

    arXiv:2601.19312v2 Announce Type: replace Abstract: The Schrodinger Bridge and Bass (SBB) formulation, which jointly controls drift and volatility, is an established extension of the classical Schrodinger Bridge (SB). Building on this framework, we introduce LightSBB-M, an algori…

  215. arXiv cs.LG TIER_1 English(EN) · Eitan Kosman, Gabriele Serussi, Chaim Basking ·

    结构化扩散桥:用于去噪扩散桥的归纳偏置

    arXiv:2605.02973v1 Announce Type: new Abstract: Modality translation is inherently under-constrained, as multiple cross-modal mappings may yield the same marginals. Recent work has shown that diffusion bridges are effective for this task. However, most existing approaches rely on…

  216. arXiv cs.LG TIER_1 English(EN) · James Rowbottom, Elizabeth L. Baker, Nick Huang, Ben Adcock, Carola-Bibiane Sch\"onlieb, Alexander Denker ·

    GRIFDIR:函数空间上不规则域上的图分辨率不变有限元扩散模型

    arXiv:2605.03497v1 Announce Type: new Abstract: Score-based diffusion models in infinite-dimensional function spaces provide a mathematically principled framework for modelling function-valued data, offering key advantages such as resolution invariance and the ability to handle i…

  217. arXiv cs.LG TIER_1 English(EN) · Aaron Havens, Brian Karrer, Neta Shaul ·

    Flow Sampling: 通过去噪条件过程学习从非归一化密度采样

    arXiv:2605.03984v1 Announce Type: new Abstract: Sampling from unnormalized densities is analogous to the generative modeling problem, but the target distribution is defined by a known energy function instead of data samples. Because evaluating the energy function is often costly,…

  218. arXiv cs.LG TIER_1 English(EN) · Francisco M. Castro-Mac\'ias, Pablo Morales-\'Alvarez, Saifuddin Syed, Daniel Hern\'andez-Lobato, Rafael Molina, Jos\'e Miguel Hern\'andez-Lobato ·

    条件扩散采样

    arXiv:2605.04013v1 Announce Type: cross Abstract: Sampling from unnormalized multimodal distributions with limited density evaluations remains a fundamental challenge in machine learning and natural sciences. Successful approaches construct a bridge between a tractable reference …

  219. arXiv cs.LG TIER_1 English(EN) · José Miguel Hernández-Lobato ·

    条件扩散采样

    Sampling from unnormalized multimodal distributions with limited density evaluations remains a fundamental challenge in machine learning and natural sciences. Successful approaches construct a bridge between a tractable reference and the target distribution. Parallel Tempering (P…

  220. arXiv cs.AI TIER_1 English(EN) · Neta Shaul ·

    Flow Sampling: 通过去噪条件过程学习从非归一化密度采样

    Sampling from unnormalized densities is analogous to the generative modeling problem, but the target distribution is defined by a known energy function instead of data samples. Because evaluating the energy function is often costly, a primary challenge is to learn an efficient sa…

  221. arXiv cs.AI TIER_1 English(EN) · Min Zhang ·

    DMGD:扩散模型中具有语义分布匹配的无训练数据集蒸馏

    Dataset distillation enables efficient training by distilling the information of large-scale datasets into significantly smaller synthetic datasets. Diffusion based paradigms have emerged in recent years, offering novel perspectives for dataset distillation. However, they typical…

  222. arXiv cs.LG TIER_1 English(EN) · Chris Nemeth ·

    Tempered Guided Diffusion

    Training-free conditional diffusion provides a flexible alternative to task-specific conditional model training, but existing samplers often allocate computation inefficiently: independent guided trajectories can vary widely in quality, and additional function evaluations along a…

  223. arXiv cs.LG TIER_1 English(EN) · Alexander Denker ·

    GRIFDIR: 函数空间中基于图的、与分辨率无关的FEM扩散模型在不规则域上的应用

    Score-based diffusion models in infinite-dimensional function spaces provide a mathematically principled framework for modelling function-valued data, offering key advantages such as resolution invariance and the ability to handle irregular discretisations. However, practical imp…

  224. arXiv cs.LG TIER_1 English(EN) · Phil Sidney Ostheimer, Mayank Nagda, Andriy Balinskyy, Gabriel Vicente Rodrigues, Jean Radig, Carl Herrmann, Stephan Mandt, Marius Kloft, Sophie Fellenz ·

    Diffusion模型跳过零值以生成稀疏数据

    arXiv:2605.01817v1 Announce Type: new Abstract: Diffusion models (DMs) excel on dense continuous data, but are not designed for sparse continuous data. They do not model exact zeros that represent the deliberate absence of a signal. As a result, they erase sparsity patterns and p…

  225. arXiv cs.LG TIER_1 English(EN) · Tongzhen Dang, Weiyang Ding, Michael K. Ng ·

    具有 $\omega$-参数化核的复杂扩散图揭示固有的谐波表示

    arXiv:2605.01691v1 Announce Type: new Abstract: In this paper, we propose Complex Diffusion Maps (CDM), a novel diffusion mapping framework that aims to reveal the dominant complex harmonics of high-dimensional data. Inspired by the local Gaussian kernel relevant to the heat equa…

  226. arXiv cs.LG TIER_1 Deutsch(DE) · Hasan Amin, Yuan Gao, Yaser Souri, Subhojit Som, Ming Yin, Rajiv Khanna, Xia Song ·

    Consistent Diffusion Language Models

    arXiv:2605.00161v1 Announce Type: new Abstract: Diffusion language models (DLMs) are an attractive alternative to autoregressive models because they promise sublinear-time, parallel generation, yet practical gains remain elusive as high-quality samples still demand hundreds of re…

  227. arXiv cs.LG TIER_1 English(EN) · Carles Domingo-Enrich, Yuanqi Du, Michael S. Albergo ·

    扩散模型和流模型微调与采样统一视角

    arXiv:2605.00229v1 Announce Type: cross Abstract: We study the problem of training diffusion and flow generative models to sample from target distributions defined by an exponential tilting of a base density; a formulation that subsumes both sampling from unnormalized densities a…

  228. arXiv cs.LG TIER_1 English(EN) · Zihan Zhou, Chenguang Wang, Hongyi Ye, Yongtao Guan, Tianshu Yu ·

    数据不全,动力十足:一种扩散方法

    arXiv:2509.20098v2 Announce Type: replace Abstract: Learning physical dynamics from data is a fundamental challenge in machine learning and scientific modeling. Real-world observational data are inherently incomplete and irregularly sampled, posing significant challenges for exis…

  229. arXiv cs.AI TIER_1 English(EN) · Yonggan Fu, Lexington Whalen, Zhifan Ye, Xin Dong, Shizhe Diao, Jingyu Liu, Chengyue Wu, Hao Zhang, Enze Xie, Song Han, Maksim Khadkevich, Jan Kautz, Yingyan Celine Lin, Pavlo Molchanov ·

    Efficient-DLM:从自回归到扩散语言模型,以及速度上的超越

    arXiv:2512.14067v2 Announce Type: replace-cross Abstract: Diffusion language models (dLMs) have emerged as a promising paradigm that enables parallel, non-autoregressive generation, but their learning efficiency lags behind that of autoregressive (AR) language models when trained…

  230. arXiv cs.AI TIER_1 English(EN) · Michael Cardei, Huu Binh Ta, Ferdinando Fioretto ·

    面向掩码扩散模型的简单自适应条件化

    arXiv:2604.26985v1 Announce Type: cross Abstract: Masked diffusion models (MDMs) generate discrete sequences by iterative denoising under an absorbing masking process. In standard masked diffusion, if a token remains masked after a reverse update, the model discards its clean-sta…

  231. arXiv cs.AI TIER_1 English(EN) · Gabe Guo, Thanawat Sornwanee, Lutong Hao, Elon Litman, Stefano Ermon, Jose Blanchet ·

    ABC:连续时空中的非马尔可夫扩散桥的任意子集自回归

    arXiv:2604.27443v1 Announce Type: cross Abstract: Generating continuous-time, continuous-space stochastic processes (e.g., videos, weather forecasts) conditioned on partial observations (e.g., first and last frames) is a fundamental challenge. Existing approaches, (e.g., diffusio…

  232. arXiv cs.LG TIER_1 English(EN) · Hyukjun Lim, Soojung Yang, Lucas Pin\`ede, Miguel Steiner, Yuanqi Du, Rafael G\'omez-Bombarelli ·

    基于引导扩散的先验采样过渡态

    arXiv:2603.25980v2 Announce Type: replace-cross Abstract: Transition states, the first-order saddle points on the potential energy surfaces, govern the kinetics and mechanisms of chemical reactions and conformational changes. Locating them is challenging because transition pathwa…

  233. arXiv cs.CL TIER_1 English(EN) · Yihong Dong, Zhaoyu Ma, Xue Jiang, Zhiyuan Fan, Jiaru Qian, Yongmin Li, Jianha Xiao, Zhi Jin, Rongyu Cao, Binhua Li, Fei Huang, Yongbin Li, Ge Li ·

    Saber:一种用于扩散语言模型的自适应加速和回溯增强重掩码的高效采样方法

    arXiv:2510.18165v3 Announce Type: replace-cross Abstract: Diffusion language models (DLMs) are emerging as a compelling alternative to the dominant autoregressive paradigm, offering inherent advantages in parallel generation and bidirectional context modeling. However, for the ta…

  234. arXiv cs.LG TIER_1 English(EN) · Zixuan Zhang, Kaixuan Huang, Tuo Zhao, Mengdi Wang, Minshuo Chen ·

    流形数据的扩散模型:得分分解、曲率与统计复杂度

    arXiv:2603.20645v2 Announce Type: replace Abstract: Diffusion models have become a leading framework in generative modeling, yet their theoretical understanding -- especially for high-dimensional data concentrated on low-dimensional structures -- remains incomplete. This paper in…

  235. arXiv cs.LG TIER_1 English(EN) · Yuxiang Wang, Yu Xiang, Baojian Zhou, Qifang Zhao, Keyue Jiang, Yanghua Xiao, Xiaoxiao Xu ·

    通过块局部性训练掩码扩散语言模型

    arXiv:2604.24832v1 Announce Type: new Abstract: Masked diffusion language models (MDMs) have recently emerged as a promising alternative to standard autoregressive large language models (AR-LLMs), yet their optimization can be substantially less stable. We study blockwise MDMs an…

  236. Hugging Face Daily Papers TIER_1 English(EN) ·

    从不相交的噪声数据流形探索扩散生成模型中的时间条件

    Practically, training diffusion models typically requires explicit time conditioning to guide the network through the denoising sampling process. Especially in deterministic methods like DDIM, the absence of time conditioning leads to significant performance degradation. However,…

  237. arXiv cs.LG TIER_1 English(EN) · Enshu Liu, Xuefei Ning, Yu Wang, Zinan Lin ·

    NI 采样:通过 Token 顺序优化加速离散扩散采样

    arXiv:2604.18471v2 Announce Type: replace Abstract: Discrete diffusion language models (dLLMs) have recently emerged as a promising alternative to traditional autoregressive approaches, offering the flexibility to generate tokens in arbitrary orders and the potential of parallel …

  238. arXiv cs.LG TIER_1 English(EN) · Yiming Zhang, Sitong Liu, Ke Li, Zhihong Wu, Alex Cloninger, Melvin Leok ·

    GeoEdit:用于扩散模型上流形编辑的快速、无训练的局部帧

    arXiv:2604.24238v1 Announce Type: new Abstract: Diffusion models are a leading paradigm for data generation, but training-free editing typically re-runs the full denoising trajectory for every edit strength, making iterative refinement expensive. To address this issue, we instead…

  239. arXiv cs.LG TIER_1 English(EN) · Dake Bu, Wei Huang, Andi Han, Hau-San Wong, Qingfu Zhang, Taiji Suzuki, Atsushi Nitanda ·

    DPRM:一种用于扩散语言模型的插入式Doob h变换诱导的Token排序模块

    arXiv:2604.24357v1 Announce Type: new Abstract: Diffusion language models generate without a fixed left-to-right order, making token ordering a central algorithmic choice: which tokens should be revealed, retained, revised or verified at each step? Existing systems mainly use ran…

  240. arXiv cs.LG TIER_1 English(EN) · Aditi De ·

    用于热力学扩散训练的对称平衡传播

    arXiv:2604.23806v1 Announce Type: new Abstract: The reverse process in score-based diffusion models is formally equivalent to overdamped Langevin dynamics in a time-dependent energy landscape. In our prior work we showed that a bilinearly-coupled analog substrate can physically r…

  241. arXiv cs.LG TIER_1 English(EN) · Dong Liu, Haisheng Wang, Yanxuan Yu ·

    利用误差反馈事件驱动缓存加速频域扩散模型

    arXiv:2604.22901v1 Announce Type: new Abstract: Diffusion models achieve remarkable success in time series generation. However, slow inference limits their practical deployment. We propose E$^2$-CRF (Error-Feedback Event-Driven Cumulative Residual Feature caching) to accelerate f…

  242. arXiv cs.LG TIER_1 English(EN) · Weiguo Gao, Ming Li ·

    迈向通过算子合并进行扩散轨迹蒸馏的理论洞见

    arXiv:2505.16024v2 Announce Type: replace Abstract: Diffusion trajectory distillation accelerates sampling by training a student model to approximate the multi-step denoising trajectories of a pretrained teacher model using far fewer steps. Despite strong empirical results, the t…

  243. arXiv cs.LG TIER_1 English(EN) · Zicheng Lyu, Zengfeng Huang ·

    径向负载--各向同性扩散采样器中 Wasserstein 传播的储备证书

    arXiv:2603.19670v3 Announce Type: replace Abstract: Nonasymptotic diffusion analyses often decompose sampling error into score estimation, continuous reverse-time propagation, discretization, and terminal conversion. We isolate the propagation module on certified scalar-isotropic…

  244. arXiv cs.AI TIER_1 English(EN) · Atsushi Nitanda ·

    DPRM:一种用于扩散语言模型的插入式Doob h变换诱导的Token排序模块

    Diffusion language models generate without a fixed left-to-right order, making token ordering a central algorithmic choice: which tokens should be revealed, retained, revised or verified at each step? Existing systems mainly use random masking or confidence-driven ordering. Rando…

  245. arXiv cs.LG TIER_1 English(EN) · Melvin Leok ·

    GeoEdit:用于扩散模型上流形上快速、无需训练的编辑的局部框架

    Diffusion models are a leading paradigm for data generation, but training-free editing typically re-runs the full denoising trajectory for every edit strength, making iterative refinement expensive. To address this issue, we instead edit near the data manifold, where small local …

  246. arXiv cs.LG TIER_1 English(EN) · Luca Ambrogioni ·

    训练过的扩散模型中,非平衡相变如何播下模式形成的种子

    arXiv:2603.20092v4 Announce Type: replace Abstract: Diffusion models generate structure by progressively transforming noise into data, yet the mechanisms underlying this transition remain poorly understood. In this work, we show that pattern formation in trained diffusion models …

  247. arXiv stat.ML TIER_1 English(EN) · Ziao Wang, Lei Ying ·

    扩散网络对齐:一种高效算法与显式概率界限

    arXiv:2606.12879v1 Announce Type: cross Abstract: This paper studies a variation of the classic network alignment problem, named diffusion-network alignment. The goal is to align the vertices of a rooted diffusion tree to the vertices of a network, where the diffusion tree could …

  248. arXiv cs.CV TIER_1 English(EN) · Lidia Troeshestova, Alexander Ustyuzhanin, Sergey Kastryulin ·

    DuET: Diffusion图像编辑的双专家轨迹

    arXiv:2606.13303v1 Announce Type: new Abstract: Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the re…

  249. arXiv cs.CV TIER_1 English(EN) · Sergey Kastryulin ·

    DuET: Diffusion图像编辑的双专家轨迹

    Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the result appears, especially when the target scene d…

  250. arXiv stat.ML TIER_1 English(EN) · Pierre-Alexandre Mattei ·

    利用 Jeffrey Guidance 实现对扩散模型的更通用控制

    A key strength of diffusion models lies in their flexibility, since their outputs can be controlled at sampling time through guidance. However, beyond simple cases such as conditional sampling, the target distribution is often left implicit, defined only through a sampling rule o…

  251. arXiv cs.CV TIER_1 English(EN) · Benedetta Tondi ·

    图像扩散模型的有效、鲁棒且抗共谋指纹识别

    Model fingerprinting, embedding user-specific identifiers (fingerprints) into generated outputs, has recently emerged as a popular solution to protect the intellectual property rights (IPR) of generative text-to-image (T2I) models and prevent unauthorized redistribution. In this …

  252. arXiv stat.ML TIER_1 English(EN) · Lei Ying ·

    Diffusion-Network Alignment: An Efficient Algorithm and Explicit Probability Bounds

    This paper studies a variation of the classic network alignment problem, named diffusion-network alignment. The goal is to align the vertices of a rooted diffusion tree to the vertices of a network, where the diffusion tree could be from a communication trace or contact tracing, …

  253. arXiv stat.ML TIER_1 English(EN) · Lorenzo Bardone, Claudia Merger, Sebastian Goldt ·

    关于扩散模型学习数据统计的理论,从易到难

    arXiv:2603.12901v2 Announce Type: replace Abstract: While diffusion models have emerged as a powerful class of generative models, their learning dynamics remain poorly understood. We address this issue first by empirically showing that standard diffusion models trained on natural…

  254. arXiv cs.CV TIER_1 English(EN) · Chunlin Qiu, Ang Li, Tianxiao Huang, Ruilin Gan, Yunjie Ge, Shenyi Zhang, Huayi Duan, Lingchen Zhao, Chao Shen, Qian Wang ·

    VOID:在潜在扩散模型中击败未经授权的模仿

    arXiv:2606.12263v1 Announce Type: new Abstract: While Latent Diffusion Models (LDMs) have revolutionized visual synthesis, they are increasingly exploited for unauthorized mimicry of individuals. Existing defenses inject deceptive perturbations to steer the generated images towar…

  255. arXiv cs.CV TIER_1 English(EN) · Ruitong Sun, Tianze Yang, Wei Niu, Jin Sun ·

    RSTR:在Diffusion Transformers中减少时空冗余

    arXiv:2512.14096v2 Announce Type: replace Abstract: Diffusion Transformers (DiTs) have achieved remarkable success in image generation, yet their deployment is hindered by high computational costs. We identify two sources of redundancy. First, temporal redundancy: Classifier-Free…

  256. arXiv cs.CV TIER_1 English(EN) · Qian Wang ·

    VOID:在潜在扩散模型中击败未经授权的模仿

    While Latent Diffusion Models (LDMs) have revolutionized visual synthesis, they are increasingly exploited for unauthorized mimicry of individuals. Existing defenses inject deceptive perturbations to steer the generated images toward irrelevant targets. However, this approach hin…

  257. arXiv stat.ML TIER_1 English(EN) · Dennis Elbr\"achter, Giovanni S. Alberti, Matteo Santacesaria ·

    MAD: Manifold Attracted Diffusion

    arXiv:2509.24710v2 Announce Type: replace Abstract: Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an…

  258. arXiv cs.CV TIER_1 English(EN) · Zhengxuan Wei, Yi Dong, Zonghui Li, Xianhui Lin, Xing Liu, Hong Gu, Shaofeng Zhang, Wenbin Li, Qi Fan ·

    SSR-Merge:用于扩散模型无训练LoRA合并的子空间信号路由

    arXiv:2606.10617v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) merging can efficiently combine diverse generative capabilities from multiple trained LoRAs for a diffusion model. However, existing LoRA merging techniques often suffer from severe parameter interference,…

  259. arXiv stat.ML TIER_1 English(EN) · Zahra Kadkhodaie, Aram-Alexandre Pooladian, Sinho Chewi, Eero Simoncelli ·

    盲去噪扩散模型与维度带来的恩惠

    arXiv:2602.09639v2 Announce Type: replace-cross Abstract: Denoising diffusion models (DDMs) are state-of-the-art methods for learning densities from data across numerous domains, yet many aspects of the training and sampling pipeline remain poorly understood. In particular, noise…

  260. arXiv cs.CV TIER_1 English(EN) · Qi Fan ·

    SSR-Merge:扩散模型中用于无训练LoRA合并的子空间信号路由

    Low-Rank Adaptation (LoRA) merging can efficiently combine diverse generative capabilities from multiple trained LoRAs for a diffusion model. However, existing LoRA merging techniques often suffer from severe parameter interference, causing destructive collisions in the shared pa…

  261. arXiv stat.ML TIER_1 English(EN) · Al Zadid Sultan Bin Habib, Md Younus Ahamed, Prashnna Gyawali, Gianfranco Doretto, Donald A. Adjeroh ·

    BSTabDiff: 面向高维表格数据生成的块状子单元扩散先验

    arXiv:2606.09257v1 Announce Type: cross Abstract: High-Dimensional Low-Sample Size (HDLSS) tabular domains (e.g., omics) are characterized by $n \ll m$, where $n$ = number of samples, and $m$ = number of features. Such domains often exhibit strong local correlation groups, sparse…

  262. arXiv cs.CV TIER_1 English(EN) · Lianyu Pang, Tianlin Pan, Cheng Da, Changqian Yu, Huan Yang, Kun Gai, Song Guo, Wenhan Luo ·

    MaskAlign:用于高效扩散训练的令牌子集表示对齐

    arXiv:2606.08788v1 Announce Type: new Abstract: Representation alignment with pretrained vision models has recently shown strong potential for accelerating diffusion transformer training. By aligning intermediate diffusion features with clean-image representations from self-super…

  263. arXiv stat.ML TIER_1 English(EN) · Yilin Zheng, Haowei Wang, Szu Hui Ng, Enlu Zhou ·

    通过训练感知条件扩散模型改进贝叶斯优化

    arXiv:2606.08438v1 Announce Type: new Abstract: Bayesian optimization (BO) is a widely used approach for black-box optimization that uses a Gaussian process (GP) as a surrogate and guides sequential evaluations via an acquisition function, with the ultimate goal of locating the g…

  264. arXiv cs.CV TIER_1 English(EN) · Kaizhen Zhu, Mokai Pan, Zhechuan Yu, Jingya Wang, Jingyi Yu, Ye Shi ·

    扩散桥还是流匹配?一个统一的框架和比较分析

    arXiv:2509.24531v2 Announce Type: replace Abstract: Diffusion Bridge and Flow Matching have both demonstrated compelling empirical performance in transformation between arbitrary distributions. However, there remains confusion about which approach is generally preferable, and the…

  265. arXiv stat.ML TIER_1 English(EN) · Donald A. Adjeroh ·

    BSTabDiff:用于高维表格数据生成的高维表格数据生成块-子单元扩散先验

    High-Dimensional Low-Sample Size (HDLSS) tabular domains (e.g., omics) are characterized by $n \ll m$, where $n$ = number of samples, and $m$ = number of features. Such domains often exhibit strong local correlation groups, sparse cross-group dependencies, heavy-tailed non-Gaussi…

  266. arXiv stat.ML TIER_1 English(EN) · Enlu Zhou ·

    通过训练感知条件扩散模型改进贝叶斯优化

    Bayesian optimization (BO) is a widely used approach for black-box optimization that uses a Gaussian process (GP) as a surrogate and guides sequential evaluations via an acquisition function, with the ultimate goal of locating the global optimum $\mathbf{x}^{\star}$. To align wit…

  267. arXiv stat.ML TIER_1 English(EN) · Hongfan Gao, Wangmeng Shen, Bin Yang, Jilin Hu ·

    HyFAD:混合时频扩散模型,结合频域感知嵌入用于时间序列插补

    arXiv:2606.05239v1 Announce Type: new Abstract: Diffusion models have demonstrated strong performance in time series modeling due to their ability to progressively capture complex data distributions through iterative denoising. However, existing approaches struggle with frequency…

  268. arXiv cs.CV TIER_1 English(EN) · Weiyan Chen, Weijian Deng, Yao Xiao, Weijie Tu, ZiYi Dong, Ibrahim Radwan, Liang Lin, Pengxu Wei ·

    当偏好标签不足时:从真实数据中对齐扩散模型

    arXiv:2605.19839v2 Announce Type: replace Abstract: Preference alignment aims to guide generative models by learning from comparisons between preferred and non-preferred samples. In practice, most existing approaches rely on preference pairs constructed from model-generated image…

  269. arXiv cs.CV TIER_1 English(EN) · Noam Issachar, Dani Lischinski, Raanan Fattal ·

    Complexity-Balanced Diffusion Splitting

    arXiv:2606.06477v1 Announce Type: new Abstract: Standard continuous-time generative models rely on monolithic architectures that must navigate vastly different signal regimes, from isotropic noise to intricate data distributions. While scaling model capacity improves performance,…

  270. arXiv cs.CV TIER_1 English(EN) · Micha\"el Soumm, Alexandre Fournier Montgieux, Yunlong He, Pietro Gori, Alasdair Newson ·

    Diff-CA:使用扩散模型分离通用和显著因素

    arXiv:2606.06120v1 Announce Type: new Abstract: Contrastive Analysis aims to separate factors that are common between two data distributions from those that are salient to only one of them. Existing contrastive methods are based on generative models (e.g., VAEs or GANs) that ofte…

  271. arXiv cs.CV TIER_1 English(EN) · Xiao Cui, Yulei Qin, Mo Zhu, Wengang Zhou, Hongsheng Li, Houqiang Li ·

    面向扩散模型训练的几何感知数据集浓缩

    arXiv:2606.05883v1 Announce Type: new Abstract: Dataset condensation aims to construct compact datasets from real data via synthesis or selection. However, existing approaches are ill-suited for diffusion model training: synthetic data generation often yields low-fidelity samples…

  272. arXiv stat.ML TIER_1 English(EN) · Hongkai Zheng, Austin Wang, Zihui Wu, Zhengyu Huang, Ricardo Baptista, Yisong Yue ·

    Blade:一种使用扩散先验的无导数贝叶斯反演方法

    arXiv:2510.10968v3 Announce Type: replace-cross Abstract: Derivative-free Bayesian inversion arises in science and engineering applications, particularly when forward model is costly or infeasible to differentiate through. Existing derivative-free methods collapse the posterior t…

  273. arXiv stat.ML TIER_1 English(EN) · Na\"il B. Khelifa, Richard E. Turner, Ramji Venkataramanan ·

    扩散模型仅观察梯度:评分匹配误差的几何视角

    arXiv:2606.06179v1 Announce Type: new Abstract: Score-based diffusion models are typically trained by minimizing the $L^2$ score matching error, and standard theoretical analyses rely on this quantity to bound the sampling discrepancy between the learned and target distributions.…

  274. arXiv cs.CV TIER_1 English(EN) · Raanan Fattal ·

    Complexity-Balanced Diffusion Splitting

    Standard continuous-time generative models rely on monolithic architectures that must navigate vastly different signal regimes, from isotropic noise to intricate data distributions. While scaling model capacity improves performance, deploying a massive network uniformly across th…

  275. arXiv stat.ML TIER_1 English(EN) · Ramji Venkataramanan ·

    扩散模型仅观察梯度:评分匹配误差的几何视角

    Score-based diffusion models are typically trained by minimizing the $L^2$ score matching error, and standard theoretical analyses rely on this quantity to bound the sampling discrepancy between the learned and target distributions. We show the $L^2$ score error is not the right …

  276. arXiv cs.CV TIER_1 English(EN) · Alasdair Newson ·

    Diff-CA:使用扩散模型分离常见和显著因素

    Contrastive Analysis aims to separate factors that are common between two data distributions from those that are salient to only one of them. Existing contrastive methods are based on generative models (e.g., VAEs or GANs) that often suffer from limited reconstruction and image q…

  277. arXiv cs.CV TIER_1 English(EN) · Houqiang Li ·

    面向扩散模型训练的几何感知数据集浓缩

    Dataset condensation aims to construct compact datasets from real data via synthesis or selection. However, existing approaches are ill-suited for diffusion model training: synthetic data generation often yields low-fidelity samples unsuitable for authentic modeling, while real s…

  278. arXiv stat.ML TIER_1 English(EN) · Riccardo Saporiti, Fabio Nobile ·

    面向具有不可达边界的扩散贝叶斯推断的神经伽辽金归一化流

    arXiv:2606.04324v1 Announce Type: cross Abstract: One of the primary challenges in Bayesian inference on the parameters of a diffusion model from discrete observations is the unavailability of an analytical expression for the transition density function between consecutive observ…

  279. arXiv stat.ML TIER_1 English(EN) · Jilin Hu ·

    HyFAD:混合时频扩散模型,结合频域感知嵌入用于时间序列插补

    Diffusion models have demonstrated strong performance in time series modeling due to their ability to progressively capture complex data distributions through iterative denoising. However, existing approaches struggle with frequency-sensitive denoising, high-frequency reconstruct…

  280. arXiv cs.CV TIER_1 English(EN) · Yan Zeng, Masanori Suganuma, Takayuki Okatani ·

    颠倒去噪扩散隐式模型生成过程:实证评估与一种新方法

    arXiv:2606.03111v1 Announce Type: new Abstract: This paper studies the problem of inverting the DDIM image generation process to recover latent variables, particularly the initial noise map, from a generated image. Existing methods often struggle with accuracy in this task. We pr…

  281. arXiv cs.CV TIER_1 English(EN) · Tianxiong Zhong, Xingye Tian, Xuebo Wang, Xin Tao, Pengfei Wan ·

    在正确的空间中扩散:潜在扩散性的系统研究

    arXiv:2606.03578v1 Announce Type: new Abstract: Latent diffusion models leverage visual tokenizers to compress images into latent spaces for efficient generative modeling. However, better reconstruction quality of a tokenizer does not necessarily translate into better generation …

  282. arXiv cs.CV TIER_1 English(EN) · Prithviraj Verma, Pawan Kumar, Chandan Deshani, Prasun Chandra Tripathi ·

    一种基于注意力机制的扩散加权成像去噪模型

    arXiv:2606.03903v1 Announce Type: new Abstract: Diffusion-weighted imaging (DWI) is used for whole-body cancer screening, but it typically requires a long acquisition time. When the scan time is reduced, the image quality often suffers, leading to increased noise in the scans. Ma…

  283. arXiv stat.ML TIER_1 English(EN) · Jing Jia, Wei Yuan, Sifan Liu, Liyue Shen, Guanyang Wang ·

    弱扩散先验仍可实现强大的逆问题性能

    arXiv:2601.22443v2 Announce Type: replace-cross Abstract: Can a diffusion model trained on bedrooms recover human faces? Diffusion models are widely used as priors for inverse problems, but standard approaches usually assume a high-fidelity model trained on data that closely matc…

  284. arXiv stat.ML TIER_1 English(EN) · Jungkyu Kim, Taeyoung Park, Kibok Lee ·

    AugMask:通过随机增强和掩码在不完整表格数据上训练扩散模型

    arXiv:2606.03347v1 Announce Type: cross Abstract: Score-based diffusion models have emerged as prominent deep generative models; however, their application to tabular data remains challenging because their backbones assume fully specified inputs, whereas real-world tabular data o…

  285. arXiv cs.CV TIER_1 English(EN) · Yan Zeng, Masanori Suganuma, Takayuki Okatani ·

    一种改进的个性化扩散模型的方法

    arXiv:2407.05312v2 Announce Type: replace Abstract: Diffusion models have demonstrated impressive image generation capabilities. Personalized approaches, such as textual inversion and Dreambooth, enhance model individualization using specific images. These methods enable generati…

  286. arXiv cs.CV TIER_1 English(EN) · Katarzyna Zaleska, {\L}ukasz Popek, Monika Wysocza\'nska, Kamil Deja ·

    注意力机制,可否请您做出决策?在扩散模型中本地化生成选择

    arXiv:2604.06052v2 Announce Type: replace Abstract: Text-to-image diffusion models exhibit remarkable generative capabilities, yet their internal operations remain opaque, particularly when handling prompts that are not fully descriptive. In such scenarios, models must make impli…

  287. arXiv stat.ML TIER_1 English(EN) · Weiguo Gao, Ming Li, Lei Shi, Hanfei Zhou ·

    扩散模型中流动蒸馏的定量近似框架

    arXiv:2606.03820v1 Announce Type: new Abstract: We develop a quantitative approximation framework for diffusion distillation, viewing few-step sampling as error propagation under compositions of learned flow maps. Focusing on trajectory distillation for the probability-flow ODE, …

  288. arXiv stat.ML TIER_1 English(EN) · Fabio Nobile ·

    用于具有不可达边界的扩散贝叶斯推断的神经伽辽金归一化流

    One of the primary challenges in Bayesian inference on the parameters of a diffusion model from discrete observations is the unavailability of an analytical expression for the transition density function between consecutive observation times, which is needed to derive the likelih…

  289. arXiv cs.CV TIER_1 English(EN) · Prasun Chandra Tripathi ·

    一种基于注意力机制的扩散加权成像去噪模型

    Diffusion-weighted imaging (DWI) is used for whole-body cancer screening, but it typically requires a long acquisition time. When the scan time is reduced, the image quality often suffers, leading to increased noise in the scans. Magnitude reconstruction in DWI introduces signal-…

  290. arXiv stat.ML TIER_1 English(EN) · Hanfei Zhou ·

    扩散模型中流动蒸馏的定量近似框架

    We develop a quantitative approximation framework for diffusion distillation, viewing few-step sampling as error propagation under compositions of learned flow maps. Focusing on trajectory distillation for the probability-flow ODE, we show that local approximation errors can be s…

  291. arXiv cs.CV TIER_1 English(EN) · Pengfei Wan ·

    在正确的空间中扩散:潜在扩散性的系统研究

    Latent diffusion models leverage visual tokenizers to compress images into latent spaces for efficient generative modeling. However, better reconstruction quality of a tokenizer does not necessarily translate into better generation quality, suggesting that latent representations …

  292. arXiv stat.ML TIER_1 English(EN) · Kibok Lee ·

    AugMask:通过随机增强和掩码在不完整表格数据上训练扩散模型

    Score-based diffusion models have emerged as prominent deep generative models; however, their application to tabular data remains challenging because their backbones assume fully specified inputs, whereas real-world tabular data often contain missing values. We propose AugMask, a…

  293. arXiv cs.CV TIER_1 English(EN) · Yongsen Cheng, Kai Liu, Kaiwen Tao, Junxian Li, Zhixin Wang, Zhikai Chen, Renjing Pei, Yulun Zhang ·

    PermuQuant:通过重新排序扩散模型的通道来降低每组量化误差

    arXiv:2605.09503v2 Announce Type: replace Abstract: Large-scale visual generative models have achieved remarkable performance. However, their high computational and memory costs make deployment challenging in resource-constrained scenarios, such as interactive applications and pe…

  294. arXiv cs.CV TIER_1 English(EN) · Ziyue Lin, Jiahe Hou, Hongyu Xia, Xinrui Xie, Feifei Wang, Yuyin Zhou, Wei Wang, Jiawei Liu, Liangqiong Qu ·

    用于统一和数据高效图像到图像翻译的解耦残差去噪扩散模型

    arXiv:2606.01048v1 Announce Type: new Abstract: We propose Decoupled Residual Denoising Diffusion models (DRDD) for unified and data-efficient image-to-image (I2I) translation. While diffusion models have advanced I2I translation in terms of quality and diversity, we uncover a pr…

  295. arXiv stat.ML TIER_1 English(EN) · Keito Wakatsuki, Hideaki Shimazaki ·

    重尾扩散模型中的自调节退火

    arXiv:2606.01645v1 Announce Type: new Abstract: Diffusion models have emerged as a leading framework for deep generative modeling. While the standard Gaussian formulation is theoretically convenient, its suitability for heavy-tailed datasets remains unclear. To address this, heav…

  296. arXiv stat.ML TIER_1 English(EN) · Ioar Casado-Telletxea, Omar Rivasplata ·

    基于扩散模型的漂移估计器的误差界限

    arXiv:2606.02115v1 Announce Type: new Abstract: Parameter estimation in stochastic differential equations is a classical statistical problem of much importance in many scientific fields. Recent work of Tapia Costa et al. (2026) introduced a novel technique for estimating the drif…

  297. arXiv stat.ML TIER_1 English(EN) · Florian Handke, Dejan Stan\v{c}evi\'c, Felix Koulischer, Thomas Demeester, Luca Ambrogioni ·

    扩散模型中类别物种形成的熵特征

    arXiv:2602.09651v2 Announce Type: replace Abstract: Diffusion models do not recover semantic structure uniformly over time. Instead, samples transition from semantic ambiguity to class commitment within a narrow regime. Recent theoretical work attributes this transition to dynami…

  298. arXiv stat.ML TIER_1 English(EN) · Kijung Jeon, Michael Muehlebach, Molei Tao ·

    通过Landing在非凸等式和不等式约束下实现高效的扩散模型

    arXiv:2604.17838v2 Announce Type: replace-cross Abstract: Generative modeling within constrained sets is essential for scientific and engineering applications involving physical, geometric, or safety requirements (e.g., molecular generation, robotics). We present a unified framew…

  299. arXiv cs.CV TIER_1 English(EN) · Tao Wu, Senmao Li, Yaxing Wang, Shiqi Yang, Kai Wang, Joost van de Weijer ·

    面向单步扩散模型无需训练的图像反演

    arXiv:2606.01380v1 Announce Type: new Abstract: In this work, we introduce a novel training-free inversion (TFinv) framework for one-step diffusion models,addressing key challenges in real image inversion and editing. We first identify two critical factors hamperingreal-image inv…

  300. arXiv cs.CV TIER_1 English(EN) · Xueji Fang, Liyuan Ma, Jianhao Zeng, Jinjin Cao, Mingyuan Zhou, Guo-Jun Qi ·

    FocusDiT:在扩散 Transformer 中掩码查询以进行细粒度图像生成

    arXiv:2606.02090v1 Announce Type: new Abstract: Diffusion transformer (DiT) has been widely adopted in the generative diffusion field, advancing the denoising of query tokens through attention and Feed-Forward (\text{FFN}) layers. FFN actually acts as the key-value vocabulary for…

  301. arXiv stat.ML TIER_1 English(EN) · Omar Rivasplata ·

    基于扩散模型的漂移估计器的误差界限

    Parameter estimation in stochastic differential equations is a classical statistical problem of much importance in many scientific fields. Recent work of Tapia Costa et al. (2026) introduced a novel technique for estimating the drift when the diffusion parameter is known, using d…

  302. arXiv cs.CV TIER_1 English(EN) · Guo-Jun Qi ·

    FocusDiT:在扩散 Transformer 中掩码查询以实现细粒度图像生成

    Diffusion transformer (DiT) has been widely adopted in the generative diffusion field, advancing the denoising of query tokens through attention and Feed-Forward (\text{FFN}) layers. FFN actually acts as the key-value vocabulary for decoding visual contents where the value embeds…

  303. arXiv cs.CV TIER_1 English(EN) · Shreyansh Modi, Akshat Tomar, Aarush Aggarwal ·

    无条件扩散模型中低级感知编辑的指南

    arXiv:2605.31162v1 Announce Type: new Abstract: Unconditional diffusion models offer powerful generative priors, yet steering them toward aesthetically enhanced outputs remains largely unexplored. We show that h-space patching, the dominant paradigm for training-free diffusion ed…

  304. arXiv stat.ML TIER_1 English(EN) · Shervin Khalafi, Alejandro Ribeiro, Dongsheng Ding ·

    扩散模型中的遗忘:基于KL散度和似然约束的统一框架

    arXiv:2605.30825v1 Announce Type: cross Abstract: Unlearning in diffusion models aims to remove undesirable data or concepts while preserving the utility of pretrained models -- two fundamentally conflicting objectives. We propose a principled constrained optimization framework t…

  305. arXiv stat.ML TIER_1 English(EN) · Nail B. Khelifa, Richard E. Turner, Ramji Venkataramanan ·

    量化扩散模型中的误差传播和模型坍塌

    arXiv:2602.16601v2 Announce Type: replace Abstract: Machine learning models are increasingly trained or fine-tuned on synthetic data. Recursively training on such data has been observed to significantly degrade performance in a wide range of tasks, often characterized by a progre…

  306. arXiv cs.CV TIER_1 English(EN) · Nathan Kessler, Robin Magnet, Jean Feydy ·

    Sinkhorn Normalization of Diffusion Kernels

    arXiv:2507.06161v2 Announce Type: replace Abstract: Smoothing a signal based on local neighborhoods is a core operation in machine learning and geometry processing. On well-structured domains such as vector spaces and manifolds, the Laplace operator derived from differential geom…

  307. arXiv stat.ML TIER_1 English(EN) · Hideaki Shimazaki ·

    重尾扩散模型中的自调节退火

    Diffusion models have emerged as a leading framework for deep generative modeling. While the standard Gaussian formulation is theoretically convenient, its suitability for heavy-tailed datasets remains unclear. To address this, heavy-tailed diffusion models (HTDMs) extend the sta…

  308. arXiv cs.CV TIER_1 English(EN) · Aarush Aggarwal ·

    无条件扩散模型中低层感知编辑的指南

    Unconditional diffusion models offer powerful generative priors, yet steering them toward aesthetically enhanced outputs remains largely unexplored. We show that h-space patching, the dominant paradigm for training-free diffusion editing, systematically fails for global, low-leve…

  309. arXiv stat.ML TIER_1 English(EN) · Dongsheng Ding ·

    扩散模型中的遗忘:基于KL散度和似然约束的统一框架

    Unlearning in diffusion models aims to remove undesirable data or concepts while preserving the utility of pretrained models -- two fundamentally conflicting objectives. We propose a principled constrained optimization framework that formulates unlearning as minimizing the deviat…

  310. arXiv stat.ML TIER_1 English(EN) · Tassilo Schwarz, Cai Dieball, Constantin Kogler, Renaud Lambiotte, Arnaud Doucet, Alja\v{z} Godec, George Deligiannidis ·

    通过戴森扩散实现置换不变谱学习

    arXiv:2510.08535v2 Announce Type: replace Abstract: Diffusion models are central to generative modeling and have been adapted to graphs by diffusing adjacency matrix representations. The challenge of having up to $n!$ such representations for graphs with $n$ nodes is only partial…

  311. arXiv cs.CV TIER_1 English(EN) · Yurong Gao, Zicheng Zhang, Congying Han, Tiande Guo, Xinmin Qiu ·

    通过噪声对齐解决扩散桥中的端点欠拟合问题

    arXiv:2605.28962v1 Announce Type: new Abstract: Diffusion bridge models offer a powerful framework for connecting two data distributions, such as in image restoration and translation. Many existing methods learn this bridge by mimicking the score-matching formulation of standard …

  312. arXiv stat.ML TIER_1 English(EN) · Jingda Wu, Changxiao Cai ·

    扩散模型在学习低维多模态分布方面具有统计最优性

    arXiv:2605.30153v1 Announce Type: new Abstract: Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of …

  313. arXiv cs.CV TIER_1 English(EN) · Hadar Davidson, Noam Issachar, Sagie Benaim ·

    彩色噪声扩散采样

    arXiv:2605.30332v1 Announce Type: new Abstract: Diffusion models achieve state-of-the-art image synthesis, with their generative trajectories fundamentally exhibiting a spectral bias, resolving low-frequency global structures early and high-frequency fine details later. Conventio…

  314. arXiv stat.ML TIER_1 Italiano(IT) · Jennifer Rosina Andersson, Zheng Zhao ·

    扩散可微重采样

    arXiv:2512.10401v3 Announce Type: replace Abstract: This paper is concerned with differentiable resampling in the context of sequential Monte Carlo (e.g., particle filtering). Drawing on reparametrisation, we propose a new resampling method that is informative and instantly diffe…

  315. arXiv cs.CV TIER_1 English(EN) · Sagie Benaim ·

    彩色噪声扩散采样

    Diffusion models achieve state-of-the-art image synthesis, with their generative trajectories fundamentally exhibiting a spectral bias, resolving low-frequency global structures early and high-frequency fine details later. Conventional stochastic differential equation (SDE) solve…

  316. arXiv stat.ML TIER_1 English(EN) · Changxiao Cai ·

    扩散模型在学习低维多模态分布方面具有统计最优性

    Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of their statistical efficiency remains limited. Ex…

  317. arXiv cs.CV TIER_1 English(EN) · Viktoriia Mishkurova ·

    面向神经退行性疾病进展预测的治疗条件扩散模型

    Forecasting the progression of neurodegenerative diseases, such as Parkinson's disease, is essential for effective long-term planning and personalized therapeutic intervention. Existing systems typically produce scalar clinical scores that ignore the rich structure of longitudina…

  318. arXiv cs.CV TIER_1 English(EN) · Dacheng Tao ·

    Cert-LAS:面向文本到图像扩散模型的认证层自适应平滑方法,实现模型所有权认证

    Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingly critical. We find that existing backdoor-based d…

  319. arXiv stat.ML TIER_1 English(EN) · Gabriel Peyr\'e ·

    Optimal and Diffusion Transports in Machine Learning

    arXiv:2512.06797v2 Announce Type: replace-cross Abstract: Several problems in machine learning are naturally expressed as the design and analysis of time-evolving probability distributions. This includes sampling via diffusion methods, optimizing the weights of neural networks, a…

  320. arXiv cs.CV TIER_1 Deutsch(DE) · Sean Man, Gilad Deutch, Roy Ganz, Roi Ronen, Shahar Tsiper, Shai Mazor, Niv Nayman ·

    DODO:离散 OCR 扩散模型

    arXiv:2602.16872v2 Announce Type: replace Abstract: Optical Character Recognition (OCR) is a fundamental task for digitizing information, serving as a critical bridge between visual data and textual understanding. While modern Vision-Language Models (VLM) have achieved high accur…

  321. arXiv cs.CV TIER_1 English(EN) · Shangwen Zhu, Han Zhang, Zhantao Yang, Qianyu Peng, Zhao Pu, Huangji Wang, Fan Cheng ·

    利用局部转移相干性加速扩散采样

    arXiv:2503.09675v3 Announce Type: replace Abstract: Text-based diffusion models have made significant breakthroughs in generating high-quality images and videos from textual descriptions. However, the lengthy sampling time of the denoising process remains a significant bottleneck…

  322. arXiv cs.CV TIER_1 English(EN) · Hongki Lim ·

    具有去噪器-拉回曲率引导和流形对齐阻尼的几何校正扩散后验采样

    Diffusion posterior sampling conditions diffusion priors on measurements, but data-consistency updates are typically scaled by hand-tuned guidance weights and can destabilize sampling under stiff, operator-dependent curvature. We replace scalar guidance with a per-noise-level dam…

  323. arXiv stat.ML TIER_1 English(EN) · Yuchen Liang, Ness Shroff, Yingbin Liang ·

    从分数到Gibbs校正器:加速统一速率离散扩散模型

    arXiv:2605.27352v1 Announce Type: cross Abstract: Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration met…

  324. arXiv stat.ML TIER_1 English(EN) · Hyunmo Kang, Noam Itzhak Levi, Corinna Elena Wegner, Daniel J. Korchinski, Matthieu Wyart ·

    使用前向后向扩散步链进行采样

    arXiv:2605.27006v1 Announce Type: cross Abstract: Sampling from learned high-dimensional distributions is a foundational computational problem. We introduce U-turn chains: Markov chains obtained by iterating short forward-backward steps of a diffusion model, in which each step pr…

  325. arXiv cs.CV TIER_1 English(EN) · Junseo Bang, Dong Ju Mun, Hoigi Seo, Seongmin Hong, Se Young Chun ·

    用于逆问题的三元动力学感知扩散后验采样:优化引导和随机性调度

    arXiv:2605.26470v1 Announce Type: new Abstract: Generative posterior sampling using diffusion models has emerged as a dominant paradigm for solving inverse problems in imaging, which usually consists of three main components: data consistency (DC) guidance, classifier-free guidan…

  326. arXiv cs.CV TIER_1 (AF) · Felix Krause, Stefan Andreas Baumann, Johannes Schusterbauer, Olga Grebenkova, Ming Gui, Vincent Tao Hu, Bj\"orn Ommer ·

    引导稀疏Token的扩散模型

    arXiv:2601.01608v2 Announce Type: replace Abstract: Diffusion models deliver high quality in image synthesis but remain expensive during training and inference. Recent works have leveraged the inherent redundancy in visual content to make training more affordable by training only…

  327. arXiv stat.ML TIER_1 English(EN) · Yingbin Liang ·

    从分数到Gibbs校正器:加速统一速率离散扩散模型

    Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration methods either rely on training additional quantities…

  328. arXiv stat.ML TIER_1 English(EN) · Matthieu Wyart ·

    使用前向后向扩散步链进行采样

    Sampling from learned high-dimensional distributions is a foundational computational problem. We introduce U-turn chains: Markov chains obtained by iterating short forward-backward steps of a diffusion model, in which each step proposes a move that remains on the learned data man…

  329. arXiv cs.CV TIER_1 English(EN) · Jiacheng Yang, Jun Wu, Yaoyao Ding, Zhiying Xu, Yida Wang, Gennady Pekhimenko ·

    SwiftFusion: 用于GPU上扩散Transformer分布式推理的可扩展序列并行化

    arXiv:2601.20273v2 Announce Type: replace-cross Abstract: Diffusion Transformers (DiTs) have gained increasing adoption in high-quality image and video generation. As demand for higher-resolution images and longer videos increases, single-GPU inference becomes inefficient due to …

  330. arXiv cs.CV TIER_1 English(EN) · Weimin Bai, Yuxuan Gu, Yifei Wang, Weijian Luo, He Sun ·

    通过原则性后验匹配实现无偏扩散变分反演

    arXiv:2605.25042v1 Announce Type: new Abstract: Existing score-based methods for inverse problems often resort to approximate minimization of the KL divergence between the inversion distribution and the Bayesian posterior. Such an approximation leads to severe mode collapse and u…

  331. arXiv cs.CV TIER_1 English(EN) · Mingyu Liang, Dingkun Xu, Jingwei Xu ·

    面向缓存加速扩散模型的轨迹一致性校准

    arXiv:2605.24870v1 Announce Type: new Abstract: Diffusion Transformers require repeated denoiser evaluations during iterative sampling, making inference computationally expensive. Cache-based acceleration reduces this cost by reusing intermediate representations across denoising …

  332. arXiv cs.CV TIER_1 English(EN) · Agata \.Zywot, Iason Skylitsis, Thijmen Nijdam, Zoe Tzifa-Kratira, Derck Prinzhorn, Konrad Szewczyk, Aritra Bhowmik ·

    在推理时将图像引导注入文本条件扩散模型

    arXiv:2605.25191v1 Announce Type: new Abstract: Text-to-image diffusion models like Stable Diffusion generate high-quality images from text, but lack a way to inject visual guidance (e.g. sketches, styles) at inference without retraining. Existing methods either require computati…

  333. arXiv cs.CV TIER_1 English(EN) · Bingtian Qiao, Yue Shi, Yingjie Zhou, Yong Guo, Guangtao Zhai, Jiezhang Cao ·

    具有紧凑令牌压缩和线性注意力的单步高效扩散恢复模型

    arXiv:2605.23451v1 Announce Type: new Abstract: Real-world image super-resolution aims to recover high-quality images from complex and unknown real-world degradations. However, existing generative Real-ISR methods largely inherit the dense latent representations and quadratic-cos…

  334. arXiv stat.ML TIER_1 English(EN) · Xin Guo ·

    多目标学习用于扩散模型:半监督学习下的统计理论

    Diffusion models are increasingly used as powerful conditional generators, yet real deployments often involve multiple target distributions arising from different tasks, e.g., diverse prompt domains in text-to-image generation, or multiple environments in robotics with diffusion …

  335. arXiv stat.ML TIER_1 English(EN) · Pan Xu ·

    通过信任区域迭代扭曲序列蒙特卡洛在推理时对扩散模型进行对齐

    We study inference-time alignment for diffusion-based generative models, aiming to steer a base model toward high-reward outputs without updating its weights. Recent Sequential Monte Carlo (SMC)-based steering methods approximate reward-tilted target distributions in a principled…

  336. arXiv cs.CV TIER_1 English(EN) · Jiezhang Cao ·

    具有紧凑令牌压缩和线性注意力的单步高效扩散恢复模型

    Real-world image super-resolution aims to recover high-quality images from complex and unknown real-world degradations. However, existing generative Real-ISR methods largely inherit the dense latent representations and quadratic-cost global modeling paradigm developed for high-re…

  337. arXiv cs.CV TIER_1 English(EN) · Hangyeol Lee, Hyojeong Lee, Joo-Young Kim ·

    通过输出相似度感知重新思考扩散模型的Token缩减

    arXiv:2605.22011v1 Announce Type: new Abstract: Diffusion Transformers (DiTs) achieve superior image generation quality but suffer from quadratic computational complexity relative to token count. While various token reduction (TR) methods have been proposed to mitigate this cost,…

  338. arXiv cs.CV TIER_1 English(EN) · Le Zhang, Ning Mang, Aishwarya Agrawal ·

    RiT:Vanilla Diffusion Transformers在表示空间中已足够

    arXiv:2605.21981v1 Announce Type: new Abstract: Flow matching with $x$-prediction -- regressing the clean data point rather than the ambient velocity -- is known to exploit low-dimensional manifold structure effectively in pixel space \cite{li2025back}. We ask whether a pretraine…

  339. arXiv stat.ML TIER_1 English(EN) · Jonathan Lorraine ·

    方差缩减用于具有扩散教师的期望

    Pretrained diffusion models serve as frozen teachers feeding downstream pipelines such as text-to-3D, single-step distillation, and data attribution. The teacher gradients these pipelines consume are Monte Carlo (MC) expectations over noise levels and Gaussian noise samples; thei…

  340. arXiv stat.ML TIER_1 English(EN) · Benjamin Sterling, M\'onica F. Bugallo, Tom Tirer ·

    使用高阶 Langevin 动力学减少扩散模型记忆

    arXiv:2605.19170v1 Announce Type: new Abstract: Diffusion/score-based models have emerged as powerful generative models, capable of generating high-quality samples that mimic the training data distribution. However, it has been observed that they are prone to reproducing training…

  341. arXiv stat.ML TIER_1 English(EN) · Wenpin Tang, Nizar Touzi, Zikun Zhang, Xun Yu Zhou ·

    Tweedie公式与高斯以外的扩散生成模型

    arXiv:2605.19391v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in generating samples from unknown data distributions. Most popular stochastic differential equation-based diffusion models perturb the target distribution by adding Gaussian noise, …

  342. arXiv stat.ML TIER_1 English(EN) · Xun Yu Zhou ·

    Tweedie 公式与高斯以外的扩散生成模型

    Diffusion models have achieved remarkable success in generating samples from unknown data distributions. Most popular stochastic differential equation-based diffusion models perturb the target distribution by adding Gaussian noise, transforming it into a simple prior, and then us…

  343. arXiv stat.ML TIER_1 English(EN) · Kelvin Kan, Xingjian Li, Benjamin J. Zhang, Tuhin Sahai, Stanley Osher, Markos A. Katsoulakis ·

    离散扩散模型的无量纲收敛性:伴随方程诱导出正确的空间

    arXiv:2605.17232v1 Announce Type: cross Abstract: Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses d…

  344. arXiv stat.ML TIER_1 English(EN) · Chenyang Wang, Weizhong Wang, Yinuo Ren, Jose Blanchet, Yiping Lu ·

    基于路径测度的序列蒙特卡洛方法,用于扩散模型的简单近似和无导数推理时尺度调整

    arXiv:2605.17850v1 Announce Type: new Abstract: iffusion-based generative models increasingly rely on inference-time guidance, adding a drift term or reweighting mixture of experts, to improve sample quality on task-specific objectives. However, most existing techniques require r…

  345. arXiv stat.ML TIER_1 English(EN) · Tom Tirer ·

    使用高阶Langevin动力学减少扩散模型记忆

    Diffusion/score-based models have emerged as powerful generative models, capable of generating high-quality samples that mimic the training data distribution. However, it has been observed that they are prone to reproducing training samples-known as "memorization"-potentially vio…

  346. arXiv cs.CV TIER_1 English(EN) · Yiping Lu ·

    基于路径测度的序列蒙特卡洛方法,用于扩散模型的简单近似和无导数推理时尺度调整

    iffusion-based generative models increasingly rely on inference-time guidance, adding a drift term or reweighting mixture of experts, to improve sample quality on task-specific objectives. However, most existing techniques require repeated score or gradient evaluations, introduci…

  347. arXiv stat.ML TIER_1 English(EN) · Markos A. Katsoulakis ·

    离散扩散模型的无量纲收敛性:伴随方程诱导正确空间

    Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses diverge under singular priors such as the masked di…

  348. arXiv cs.CV TIER_1 English(EN) · Yimao Cai ·

    少做多成:扩散模型强化学习微调是否需要每一步优化?

    Despite strong image-generation performance, diffusion models' reconstruction objectives limit alignment with human preferences. RL enables such alignment through explicit rewards. However, most studies apply RL to the full denoising trajectory, making it computationally costly a…

  349. arXiv stat.ML TIER_1 English(EN) · Yifeng Yu, Lu Yu ·

    关于扩散模型中潜在重用的局限性

    arXiv:2605.13448v1 Announce Type: new Abstract: Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a sour…

  350. arXiv stat.ML TIER_1 English(EN) · Ye He, Yitong Qiu, Molei Tao ·

    扩散模型的泛化能力可通过对数据依赖的脊流形进行归纳偏置来表征

    arXiv:2602.06021v2 Announce Type: replace Abstract: We study a data-dependent notion of diffusion-model generalization: when a model does not memorize the training set, where do its generated samples go relative to the geometry induced by the data? To answer this, we introduce a …

  351. arXiv stat.ML TIER_1 English(EN) · Yuzhen Zhao, Jiarong Fan, Yating Liu ·

    使用神经网络对扩散过程中的漂移函数进行插件分类

    arXiv:2602.02791v2 Announce Type: replace Abstract: We study supervised multiclass classification for diffusion processes, where each class is characterized by a distinct drift function and trajectories are observed at discrete times. We first derive a multidimensional Bayes rule…

  352. arXiv stat.ML TIER_1 Deutsch(DE) · Dario Shariatian, Alain Durmus, Umut Simsekli, Stefano Peluchetti ·

    潜变量增强离散扩散模型

    arXiv:2510.18114v3 Announce Type: replace-cross Abstract: Discrete diffusion models have emerged as a powerful class of models and a promising route to fast language generation, but practical implementations typically rely on factored reverse transitions ignoring cross-token depe…

  353. arXiv cs.CV TIER_1 English(EN) · Zhi Wang ·

    极端快速动作扩散的测试时稀疏性

    Action diffusion excels at high-fidelity action generation but incurs heavy computational costs owing to its iterative denoising nature. Despite current technologies showing promise in accelerating diffusion transformers by reusing the cached features, they struggle to adapt to p…

  354. arXiv stat.ML TIER_1 English(EN) · Tim Salimans ·

    Covariance-aware sampling for Diffusion Models

    We present a covariance-aware sampler that improves the quality of pixel-space Diffusion Model (DM) sampling in the few-step regime. We hypothesize that in the few-step regime samplers fail because they rely solely on the predicted mean of the reverse distribution, while our solu…

  355. arXiv stat.ML TIER_1 English(EN) · Guillaume Coqueret, Martial Laguerre ·

    具有后验漂移的过参数化模型

    arXiv:2506.23619v2 Announce Type: replace-cross Abstract: This paper investigates the impact of posterior drift on out-of-sample forecasting accuracy in overparametrized machine learning models. We document the loss in performance when the loadings of the data generating process …

  356. arXiv stat.ML TIER_1 English(EN) · Hao Chen, Renzheng Zhang, Scott S. Howard ·

    DAPS++:通过解耦后验退火重新思考扩散逆问题

    arXiv:2511.17038v3 Announce Type: replace-cross Abstract: From a Bayesian perspective, score-based diffusion solves inverse problems through joint inference, embedding the likelihood with the prior to guide the sampling process. However, this formulation fails to explain its prac…

  357. arXiv stat.ML TIER_1 English(EN) · Jing Jia, Liyue Shen, Guanyang Wang ·

    伴侣控制:扩散模型中的联合初始噪声设计

    arXiv:2605.11311v1 Announce Type: cross Abstract: Diffusion models typically generate image batches from independent Gaussian initial noises. We argue that this independence assumption is only one choice within a broader class of valid joint noise designs. Instead, one can specif…

  358. arXiv cs.CV TIER_1 English(EN) · Edmond S. L. Ho ·

    基于连续隐式表示的扩散生成运动插值

    Recent advances in generative models have yielded impressive progress on motion in-betweening, allowing for more complex, varied, and realistic motion transitions. However, recent methods still exhibit noticeable limitations in preserving keyframe information and ensuring motion …

  359. arXiv cs.CV TIER_1 English(EN) · Dong-Jun Han ·

    用于概念分离扩散遗忘的解耦稀疏表示

    Unlearning specific concepts in text-to-image diffusion models has become increasingly important for preventing undesirable content generation. Among prior approaches, sparse autoencoder (SAE)-based methods have attracted attention due to their ability to suppress target concepts…

  360. arXiv cs.CV TIER_1 English(EN) · Konstantin Kulikov ·

    面向下游视觉任务的扩散模型少样本合成数据生成

    Class imbalance is a persistent challenge in visual recognition, particularly in safety-critical domains where collecting positive examples is expensive and rare events are inherently underrepresented. We propose a lightweight synthetic data augmentation pipeline that fine-tunes …

  361. arXiv stat.ML TIER_1 English(EN) · Guanyang Wang ·

    伴侣控制:扩散模型中的联合初始噪声设计

    Diffusion models typically generate image batches from independent Gaussian initial noises. We argue that this independence assumption is only one choice within a broader class of valid joint noise designs. Instead, one can specify a coupling of the initial noises: each noise rem…

  362. arXiv cs.CV TIER_1 English(EN) · Wang Chen ·

    Filtering Memorization from Parameter-Space in Diffusion Models

    Low-Rank Adaptation (LoRA) has become a widely used mechanism for customizing diffusion models, enabling users to inject new visual concepts or styles through lightweight parameter updates. However, LoRAs can memorize training images, causing generated outputs to reproduce copyri…

  363. arXiv stat.ML TIER_1 English(EN) · Enrico Ventura, Beatrice Achilli, Luca Ambrogioni, Carlo Lucibello ·

    高维引导扩散模型中失真的出现

    arXiv:2602.00716v4 Announce Type: replace Abstract: Classifier-free guidance (CFG) is the de facto standard for conditional sampling in diffusion models, yet it often reduces sample diversity. Using tools from statistical physics, we analyze the emergence of generative distortion…

  364. arXiv stat.ML TIER_1 English(EN) · Dongqing Li, Geoff K. Nicholls, Shiyi Sun, You Luo ·

    用于潜在偏序推理的可微分贝叶斯松弛

    arXiv:2605.06976v1 Announce Type: new Abstract: Many ranking and agent trace datasets are recorded as linear orders even though their latent structure is only partially ordered. This is especially common in agent and workflow traces, where observed order may reflect arbitrary lin…

  365. arXiv stat.ML TIER_1 English(EN) · Qiao Wang ·

    期望最大化作为谱约束的松弛流

    arXiv:2605.07818v1 Announce Type: new Abstract: The expectation--maximization (EM) algorithm combines global monotonicity, local linear convergence, and strong practical robustness, but these features are usually analyzed separately. Global descent is nonlinear, whereas local con…

  366. arXiv stat.ML TIER_1 English(EN) · Simon Bienewald, Lukas Trottner ·

    球形首次命中扩散模型的统计收敛性

    arXiv:2605.07625v1 Announce Type: cross Abstract: Denoising diffusion models have evolved into a state-of-the-art method for tasks in various fields, such as denoising and generation of images, text generation, or generation of synthetic data for training of other machine learnin…

  367. arXiv stat.ML TIER_1 English(EN) · James Matthew Young, Paula Cordero-Encinar, Sebastian Reich, Andrew Duncan, O. Deniz Akyildiz ·

    Diffusion Path Samplers via Sequential Monte Carlo

    arXiv:2601.21951v2 Announce Type: replace Abstract: We develop diffusion-based samplers for target distributions known up to a normalising constant. To this end, we rely on the well-known diffusion path that smoothly interpolates between a simple base distribution and the target,…

  368. arXiv stat.ML TIER_1 English(EN) · Arnaud Doucet ·

    Metropolis-Adjusted Diffusion Models

    Sampling from score-based diffusion models incurs bias due to both time discretisation and the approximation of the score function. A common strategy for reducing this bias is to apply corrector steps based on the unadjusted Langevin algorithm (ULA) at each noise level within a p…

  369. arXiv cs.CV TIER_1 English(EN) · Kaushik Roy ·

    HEART:通过Kent表示遍历在扩散模型中实现超球嵌入对齐

    Text-to-image diffusion models can generate visually stunning images, yet, controlling what appears and how it appears, remains surprisingly difficult, especially when operating solely within the constraints of the text-conditioning space. For example, changing a subject or adjus…

  370. arXiv cs.CV TIER_1 English(EN) · Yali Wang ·

    对于有利于扩散的潜在流形,什么最重要?面向潜在扩散的先验对齐自编码器

    Tokenizers are a crucial component of latent diffusion models, as they define the latent space in which diffusion models operate. However, existing tokenizers are primarily designed to improve reconstruction fidelity or inherit pretrained representations, leaving unclear what kin…

  371. arXiv stat.ML TIER_1 English(EN) · Qiao Wang ·

    期望最大化作为谱约束的松弛流

    The expectation--maximization (EM) algorithm combines global monotonicity, local linear convergence, and strong practical robustness, but these features are usually analyzed separately. Global descent is nonlinear, whereas local convergence is governed by the spectrum of the line…

  372. arXiv cs.CV TIER_1 English(EN) · Baoru Huang ·

    SARA: 视频扩散模型的语义自适应关系对齐

    Recent video diffusion models (VDMs) synthesize visually convincing clips, yet still drop entities, mis-bind attributes, and weaken the interactions specified in the prompt. Representation-alignment objectives such as VideoREPA and MoAlign improve fine-grained text following by d…

  373. arXiv stat.ML TIER_1 English(EN) · Lukas Trottner ·

    球形首次命中扩散模型的统计收敛性

    Denoising diffusion models have evolved into a state-of-the-art method for tasks in various fields, such as denoising and generation of images, text generation, or generation of synthetic data for training of other machine learning models. First hitting diffusion models (FHDM) ar…

  374. arXiv stat.ML TIER_1 English(EN) · You Luo ·

    用于潜在偏序推理的可微分贝叶斯松弛

    Many ranking and agent trace datasets are recorded as linear orders even though their latent structure is only partially ordered. This is especially common in agent and workflow traces, where observed order may reflect arbitrary linearization rather than true prerequisites. We in…

  375. arXiv cs.CV TIER_1 Français(FR) · Yan Zeng ·

    连续潜在扩散语言模型

    Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve generation efficiency, scalable representation learn…

  376. arXiv cs.CV TIER_1 English(EN) · Chunhua Shen ·

    MARBLE:扩散强化学习的多方面奖励平衡

    Reinforcement learning fine-tuning has become the dominant approach for aligning diffusion models with human preferences. However, assessing images is intrinsically a multi-dimensional task, and multiple evaluation criteria need to be optimized simultaneously. Existing practice d…

  377. arXiv stat.ML TIER_1 English(EN) · Stefano Sarao Mannelli ·

    数据结构与不平衡在扩散模型学习动态中的相互作用

    Real-world datasets are inherently heterogeneous, yet how per-class structural differences and sampling imbalance shape the training dynamics of diffusion models-and potentially exacerbate disparities-remains poorly understood. While models typically transition from an initial ph…

  378. arXiv stat.ML TIER_1 English(EN) · Carola-Bibiane Schönlieb ·

    双Lipschitz归一化流的表达能力:基于分数的扩散视角

    Many normalizing flow architectures impose regularity constraints, yet their distributional approximation properties are not fully characterized. We study the expressivity of bi-Lipschitz normalizing flows through the lens of score-based diffusion models. For the probability flow…

  379. arXiv cs.CV TIER_1 English(EN) · Bartlomiej Sobieski, Matthew Tivnan, Dawid P{\l}udowski, Micha{\l} Jan W{\l}odarczyk, Pengfei Jin, Przemyslaw Biecek, Quanzheng Li ·

    局部内禀维度揭示扩散模型中的幻觉

    arXiv:2605.05026v1 Announce Type: new Abstract: Diffusion models are prone to generating structural hallucinations - samples that match the statistical properties of the training data yet defy underlying structural rules, resulting in anomalies like hands with more than five fing…

  380. arXiv cs.CV TIER_1 English(EN) · Yiran Qiao, Yiren Lu, Yunlai Zhou, Disheng Liu, Linlin Hou, Rui Yang, Yu Yin, Jing Ma ·

    结构化三维潜在表示出奇地强大:释放具有二维扩散的通用风格

    arXiv:2605.04412v1 Announce Type: new Abstract: 3D asset generation plays a pivotal role in fields such as gaming and virtual reality, enabling the rapid synthesis of high-fidelity 3D objects from a single or multiple images. Building on this capability, enabling style-controllab…

  381. arXiv cs.CV TIER_1 Deutsch(DE) · Chen Wei ·

    驯服扩散 Transformer 中的异常 Token

    We study outlier tokens in Diffusion Transformers (DiTs) for image generation. Prior work has shown that Vision Transformers (ViTs) can produce a small number of high-norm tokens that attract disproportionate attention while carrying limited local information, but their role in g…

  382. arXiv stat.ML TIER_1 English(EN) · Arnaud Doucet ·

    漂移模型上的 Wasserstein梯度流解释

    Recently, Deng et al. (2026) proposed Generative Modeling via Drifting (GMD), a novel framework for generative tasks. This note presents an analysis of GMD through the lens of Wasserstein Gradient Flows (WGF), i.e., the path of steepest descent for a functional in the space of pr…

  383. arXiv cs.CV TIER_1 English(EN) · Quanzheng Li ·

    局部内在维度揭示扩散模型中的幻觉

    Diffusion models are prone to generating structural hallucinations - samples that match the statistical properties of the training data yet defy underlying structural rules, resulting in anomalies like hands with more than five fingers. Recent research studied this failure mode f…

  384. arXiv stat.ML TIER_1 English(EN) · Christopher Nemeth ·

    通过结构化随机扩散实现超图生成

    Hypergraphs model higher-order interactions, but realistic hypergraph generation remains difficult because incidence, hyperedge-size heterogeneity, and overlap structure are not faithfully captured by pairwise reductions. We propose \HEDGE, a generative model defined directly on …

  385. arXiv cs.CV TIER_1 English(EN) · Qichao Wang, Yunhong Lu, Hengyuan Cao, Junyi Zhang, Min Zhang ·

    DMGD:扩散模型中的无训练数据集蒸馏与语义分布匹配

    arXiv:2605.03877v1 Announce Type: new Abstract: Dataset distillation enables efficient training by distilling the information of large-scale datasets into significantly smaller synthetic datasets. Diffusion based paradigms have emerged in recent years, offering novel perspectives…

  386. arXiv cs.CV TIER_1 English(EN) · Ruibin Min, Yexin Liu, Aimin Pan, Changsheng Lu, Jiafei Wu, Kelu Yao, Xiaogang Xu, Harry Yang ·

    AHPA:用于扩散 Transformer 的自适应分层先验对齐

    arXiv:2605.03317v1 Announce Type: new Abstract: Representation alignment has recently emerged as an effective paradigm for accelerating Diffusion Transformer training. Despite their success, existing alignment methods typically impose a fixed supervision target or a fixed alignme…

  387. arXiv stat.ML TIER_1 English(EN) · Kanishka Reddy ·

    前馈表示的扩散算子几何

    arXiv:2605.01107v1 Announce Type: cross Abstract: Neural networks transform data through learned representations whose geometry affects separation, contraction, and generalization. Recent work studies this geometry using discrete curvature on neighborhood graphs, suggesting Ricci…

  388. arXiv cs.CV TIER_1 English(EN) · Fangzheng Wu, Brian Summa ·

    SteeringDiffusion:用于扩散模型的瓶颈激活控制接口

    arXiv:2605.01653v1 Announce Type: new Abstract: We introduce SteeringDiffusion, a bottlenecked activation-level control interface for diffusion models that exposes a smooth, monotonic, and runtime-adjustable control surface over the content--style trade-off. Our method keeps the …

  389. arXiv cs.CV TIER_1 English(EN) · An Huang, Junggab Son, Zuobin Xiong ·

    Watch Your Step: Diffusion Models 中的信息注入 via Shadow Timestep Embedding

    arXiv:2605.00935v1 Announce Type: cross Abstract: Diffusion models have become the foundation of modern generative systems, with most research focusing primarily on improving generation efficiency and output quality. The timestep embedding component is a crucial part of the diffu…

  390. arXiv cs.CV TIER_1 English(EN) · Xun Su, Hiroyuki Kasai ·

    Noise is All You Need: 使用扩散模型的噪声组合采样解决线性逆问题

    arXiv:2510.23633v2 Announce Type: replace-cross Abstract: Pretrained diffusion models have demonstrated strong capabilities in zero-shot inverse problem solving by incorporating observation information into the generation process of the diffusion models. However, this presents an…

  391. arXiv cs.CV TIER_1 English(EN) · Harry Yang ·

    AHPA:用于扩散 Transformer 的自适应分层先验对齐

    Representation alignment has recently emerged as an effective paradigm for accelerating Diffusion Transformer training. Despite their success, existing alignment methods typically impose a fixed supervision target or a fixed alignment granularity throughout the entire denoising t…

  392. arXiv cs.CV TIER_1 English(EN) · Song Yan, Chenfeng Wang, Wei Zhai, Xinliang Bi, Jian Yang, Yancheng Cai, Yusen Zhang, Yunwei Lan, Tao Zhang, GuanYe Xiong, Min Li, Zheng-Jun Zha ·

    随机性的确定性:扩散模型中的潜在空间退化

    arXiv:2511.07756v4 Announce Type: replace Abstract: Diffusion models initialize generation from an isotropic Gaussian latent, yet changing only the random seed can substantially alter prompt faithfulness, composition, and visual quality. We explain this gap by distinguishing the …

  393. arXiv cs.CV TIER_1 English(EN) · Saeed Mohseni-Sehdeh, Walid Saad, Kei Sakaguchi, Tao Yu ·

    用于通过分段引导的后验采样解决逆问题的扩散模型

    arXiv:2507.18654v2 Announce Type: replace-cross Abstract: Diffusion models are powerful tools for sampling from high-dimensional distributions by progressively transforming pure noise into structured data through a denoising process. When equipped with a guidance mechanism, these…

  394. arXiv cs.CV TIER_1 English(EN) · Anne Harrington, A. Sophia Koepke, Shyamgopal Karthik, Trevor Darrell, Alexei A. Efros ·

    亡羊补牢,为时未晚:已训练扩散模型崩溃恢复中的噪声优化

    arXiv:2601.00090v2 Announce Type: replace Abstract: Contemporary text-to-image models exhibit a surprising degree of mode collapse, as can be seen when sampling several images given the same text prompt. Previous work has attempted to address this issue by steering the model usin…

  395. arXiv cs.CV TIER_1 Italiano(IT) · Simeon Allmendinger, Domenique Zipperling, Lukas Struppek, Niklas K\"uhl ·

    CollaFuse: 协作式扩散模型

    arXiv:2406.14429v3 Announce Type: replace-cross Abstract: In the landscape of generative artificial intelligence, diffusion-based models have emerged as a promising method for generating synthetic images. However, the application of diffusion models poses numerous challenges, par…

  396. arXiv stat.ML TIER_1 English(EN) · Kanishka Reddy ·

    前馈表示的扩散算子几何

    Neural networks transform data through learned representations whose geometry affects separation, contraction, and generalization. Recent work studies this geometry using discrete curvature on neighborhood graphs, suggesting Ricci-flow-like behavior across layers. We develop a sm…

  397. arXiv stat.ML TIER_1 English(EN) · Michael S. Albergo ·

    扩散模型和流模型微调与采样统一视角

    We study the problem of training diffusion and flow generative models to sample from target distributions defined by an exponential tilting of a base density; a formulation that subsumes both sampling from unnormalized densities and reward fine-tuning of pre-trained models. This …

  398. arXiv cs.CV TIER_1 English(EN) · Zhirong Shen, Rui Huang, Jiacheng Liu, Chang Zou, Peiliang Cai, Shikang Zheng, Zhengyi Shi, Liang Feng, Linfeng Zhang ·

    超越固定公式:数据驱动的线性预测器,用于高效的扩散模型

    arXiv:2604.26365v1 Announce Type: new Abstract: To address the high sampling cost of Diffusion Transformers (DiTs), feature caching offers a training-free acceleration method. However, existing methods rely on hand-crafted forecasting formulas that fail under aggressive skipping.…

  399. arXiv cs.CV TIER_1 Italiano(IT) · Haosen Li, Wenshuo Chen, Lei Wang, Shaofeng Liang, Bowen Tian, Soning Lai, Yutao Yue ·

    Delta Score很重要!扩散模型中的空间自适应多重引导

    arXiv:2604.26503v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in synthesizing complex static and temporal visuals, a breakthrough largely driven by Classifier-Free Guidance (CFG). However, despite its pivotal role in aligning generated content …

  400. arXiv cs.CV TIER_1 English(EN) · Miguel Espinosa, Eva Gmelich Meijling, Valerio Marsocci, Elliot J. Crowley, Mikolaj Czerkawski ·

    COP-GEN:用于哥白尼地球观测数据的潜在扩散Transformer

    arXiv:2603.03239v2 Announce Type: replace Abstract: Earth observation applications increasingly rely on data from multiple sensors, including optical, radar, elevation, and land-cover. Relationships between modalities are fundamental for data integration but are inherently non-in…

  401. arXiv cs.CV TIER_1 English(EN) · Yang Yang, Feifan Meng, Han Fang, Weiming Zhang ·

    ACPO:基于无参考质量引导的扩散模型锚点约束感知优化

    arXiv:2604.26348v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in image generation, yet their training is predominantly driven by full-reference objectives that enforce pixel-wise similarity to ground-truth images.Such supervision, while effecti…

  402. arXiv cs.CV TIER_1 English(EN) · Rui Xu, Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Shiqing Xin, Changhe Tu, Taku Komura, Wenping Wang ·

    ComboStoc:用于扩散生成模型组合随机性

    arXiv:2405.13729v3 Announce Type: replace-cross Abstract: In this paper, we study an under-explored but important factor of diffusion generative models, i.e., the combinatorial complexity. Data samples are generally high-dimensional, and for various structured generation tasks, a…

  403. arXiv cs.CV TIER_1 Italiano(IT) · Yutao Yue ·

    Delta Score很重要!扩散模型中的空间自适应多重引导

    Diffusion models have achieved remarkable success in synthesizing complex static and temporal visuals, a breakthrough largely driven by Classifier-Free Guidance (CFG). However, despite its pivotal role in aligning generated content with textual prompts, standard CFG relies on a g…

  404. arXiv cs.CV TIER_1 English(EN) · Linfeng Zhang ·

    超越固定公式:数据驱动的线性预测器助力高效扩散模型

    To address the high sampling cost of Diffusion Transformers (DiTs), feature caching offers a training-free acceleration method. However, existing methods rely on hand-crafted forecasting formulas that fail under aggressive skipping. We propose L2P (Learnable Linear Predictor), a …

  405. arXiv cs.CV TIER_1 English(EN) · Weiming Zhang ·

    ACPO:用于具有无参考质量指导的扩散模型的锚点约束感知优化

    Diffusion models have achieved remarkable success in image generation, yet their training is predominantly driven by full-reference objectives that enforce pixel-wise similarity to ground-truth images.Such supervision, while effective for fidelity, may insufficient in terms of su…

  406. arXiv cs.CV TIER_1 English(EN) · Liuzhuozheng Li, Zhiyuan Zhan, Shuhong Liu, Dengyang Jiang, Zanyi Wang, Guang Dai, Jingdong Wang, Mengmeng Wang ·

    从不相交的噪声数据流形探索扩散生成模型中的时间条件

    arXiv:2604.25289v1 Announce Type: cross Abstract: Practically, training diffusion models typically requires explicit time conditioning to guide the network through the denoising sampling process. Especially in deterministic methods like DDIM, the absence of time conditioning lead…

  407. arXiv cs.CV TIER_1 English(EN) · Nishit Anand, Manan Suri, Christopher Metzler, Dinesh Manocha, Ramani Duraiswami ·

    Diffusion模型中的学习光照控制

    arXiv:2604.24877v1 Announce Type: new Abstract: Controlling illumination in images is essential for photography and visual content creation. While closed-source models have demonstrated impressive illumination control, open-source alternatives either require heavy control inputs …

  408. arXiv cs.CV TIER_1 English(EN) · Mengmeng Wang ·

    从不相交的噪声数据流形探索扩散生成模型中的时间条件

    Practically, training diffusion models typically requires explicit time conditioning to guide the network through the denoising sampling process. Especially in deterministic methods like DDIM, the absence of time conditioning leads to significant performance degradation. However,…

  409. arXiv cs.CV TIER_1 English(EN) · Zhongjie Duan, Hong Zhang, Yingda Chen ·

    Diffusion Templates:可控Diffusion的统一插件框架

    arXiv:2604.24351v1 Announce Type: cross Abstract: Controllable diffusion methods have substantially expanded the practical utility of diffusion models, but they are typically developed as isolated, backbone-specific systems with incompatible training pipelines, parameter formats,…

  410. arXiv cs.CV TIER_1 English(EN) · Haosen Li, Wenshuo Chen, Shaofeng Liang, Lei Wang, Kaishen Yuan, Yutao Yue ·

    $Z^2$-Sampling:扩散模型中语义对齐的零成本之字形轨迹

    arXiv:2604.23536v1 Announce Type: new Abstract: Diffusion models have achieved unprecedented success in text-aligned generation, largely driven by Classifier-Free Guidance (CFG). However, standard CFG operates strictly on instantaneous gradients, omitting the intrinsic curvature …

  411. arXiv cs.CV TIER_1 English(EN) · Buddhi Wijenayake, Nichula Wasalathilake, Roshan Godaliyadda, Vijitha Herath, Parakrama Ekanayake, Vishal M. Patel ·

    通过提示控制的扩散增强来减轻长尾偏差

    arXiv:2602.04749v3 Announce Type: replace Abstract: Long-tailed class imbalance remains a fundamental obstacle in semantic segmentation of high-resolution remote-sensing imagery, where dominant classes shape learned representations and rare classes are systematically under-segmen…

  412. arXiv stat.ML TIER_1 English(EN) · Fan Chen, Sinho Chewi, Constantinos Daskalakis, Alexander Rakhlin ·

    高精度采样用于扩散模型和对数凹分布

    arXiv:2602.01338v2 Announce Type: replace-cross Abstract: We present algorithms for diffusion model sampling which obtain $\delta$-error in $\mathrm{polylog}(1/\delta)$ steps, given access to $\widetilde O(\delta)$-accurate score estimates in $L^2$. This is an exponential improve…

  413. arXiv stat.ML TIER_1 English(EN) · Bingqing Jiang, Difan Zou ·

    On the Memorization of Consistency Distillation for Diffusion Models

    arXiv:2604.23552v1 Announce Type: cross Abstract: Diffusion models are central to modern generative modeling, and understanding how they balance memorization and generalization is critical for reliable deployment. Recent work has shown that memorization in diffusion models is sha…

  414. arXiv cs.CV TIER_1 English(EN) · Ramani Duraiswami ·

    Learning Illumination Control in Diffusion Models

    Controlling illumination in images is essential for photography and visual content creation. While closed-source models have demonstrated impressive illumination control, open-source alternatives either require heavy control inputs like depth maps or do not release their data and…

  415. arXiv cs.CV TIER_1 English(EN) · Yingda Chen ·

    Diffusion Templates:可控Diffusion的统一插件框架

    Controllable diffusion methods have substantially expanded the practical utility of diffusion models, but they are typically developed as isolated, backbone-specific systems with incompatible training pipelines, parameter formats, and runtime hooks. This fragmentation makes it di…

  416. arXiv cs.CV TIER_1 English(EN) · Mingxing Rao, Bowen Qu, Daniel Moyer ·

    基于分数的扩散模型成员推断

    arXiv:2509.25003v2 Announce Type: replace-cross Abstract: Membership inference attacks (MIAs) against Diffusion Models (DMs) raise pressing privacy concerns by revealing whether a sample was part of the training set. While existing methods typically rely on measuring reconstructi…

  417. arXiv cs.CV TIER_1 English(EN) · Jincheng Ying, Yitao Chen, Li Wenlin, Minghui Xu, Yinhao Xiao ·

    通过嵌入损失实现高效扩散蒸馏

    arXiv:2604.22379v1 Announce Type: new Abstract: Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limitin…

  418. arXiv stat.ML TIER_1 English(EN) · Difan Zou ·

    On the Memorization of Consistency Distillation for Diffusion Models

    Diffusion models are central to modern generative modeling, and understanding how they balance memorization and generalization is critical for reliable deployment. Recent work has shown that memorization in diffusion models is shaped by training dynamics, with generalization and …

  419. arXiv cs.CV TIER_1 English(EN) · Yinhao Xiao ·

    高效扩散蒸馏通过嵌入损失

    Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limiting accessibility for resource-constrained researc…

  420. arXiv stat.ML TIER_1 English(EN) · Chang Liu ·

    商空间扩散模型

    Diffusion-based generative models have reformed generative AI, and have enabled new capabilities in the science domain, for example, generating 3D structures of molecules. Due to the intrinsic problem structure of certain tasks, there is often a symmetry in the system, which iden…

  421. arXiv stat.ML TIER_1 English(EN) · Houman Owhadi ·

    面向核化扩散图的自适应核选择

    Selecting an appropriate kernel is a central challenge in kernel-based spectral methods. In \emph{Kernelized Diffusion Maps} (KDM), the kernel determines the accuracy of the RKHS estimator of a diffusion-type operator and hence the quality and stability of the recovered eigenfunc…

  422. arXiv stat.ML TIER_1 English(EN) · Molei Tao ·

    通过Landing在非凸等式和不等式约束下实现高效扩散模型

    Generative modeling within constrained sets is essential for scientific and engineering applications involving physical, geometric, or safety requirements (e.g., molecular generation, robotics). We present a unified framework for constrained diffusion models on generic nonconvex …

  423. Smol AINews TIER_1 English(EN) ·

    Stable Diffusion 的最后辉煌?

    **Stability AI** launched **Stable Diffusion 3 Medium** with models ranging from **450M to 8B parameters**, featuring the MMDiT architecture and T5 text encoder for image text rendering. The community has shown mixed reactions following the departure of key researchers like Emad …

  424. Smol AINews TIER_1 English(EN) ·

    Stable Diffusion 3 — Rombach & Esser 再创佳绩!

    **Over 2500 new community members joined following Soumith Chintala's shoutout, highlighting growing interest in SOTA LLM-based summarization. The major highlight is the detailed paper release of **Stable Diffusion 3 (SD3)**, showcasing advanced text-in-image control and complex …

  425. 雷峰网 (Leiphone) TIER_1 中文(ZH) ·

    ICRA 2026 | EndoDDC:扩散模型赋能稀疏到稠密深度重建

    <section style="background-color: rgb(246, 249, 251); font-size: 15px; line-height: 2; letter-spacing: 1px; font-style: normal; font-weight: 400; text-align: justify; color: rgb(62, 62, 62);"><section style="text-align: center; margin-top: 10px; margin-bottom: 10px; line-height: …

  426. Together AI blog TIER_1 English(EN) ·

    Chipmunk:具有动态列稀疏增量的无训练加速扩散 Transformer

  427. Hacker News — AI stories ≥50 points TIER_1 English(EN) · benanne ·

    学习扩散模型的积分

  428. Practical AI TIER_1 English(EN) · Practical AI LLC ·

    Stable Diffusion

    <p>The new stable diffusion model is everywhere! Of course you can use this model to quickly and easily create amazing, dream-like images to post on twitter, reddit, discord, etc., but this technology is also poised to be used in very pragmatic ways across industry. In this episo…

  429. MarkTechPost TIER_1 English(EN) · Asif Razzaq ·

    Zyphra发布ZAYA1-8B-Diffusion-Preview:首个由自回归LLM转换而来的MoE扩散模型,速度提升高达7.7倍

    <p>Zyphra's latest release shows that an autoregressive MoE model can be converted into a discrete diffusion model with no systematic loss in evaluation performance. ZAYA1-8B-Diffusion-Preview achieves up to 7.7x inference speedup over autoregression by shifting decoding from mem…

  430. r/LocalLLaMA TIER_1 English(EN) · /u/qubridInc ·

    DiffusionGemma 在实际工作负载下的表现与基准演示差异明显

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1u30dhr/diffusiongemma_under_real_workloads_feels_very/"> <img alt="DiffusionGemma under real workloads feels very different from benchmark demos" src="https://preview.redd.it/zrnom6hrwn6h1.jpeg?width=640&amp;…

  431. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    高效且无需训练的单图像扩散模型 https://arxiv.org/abs/2606.04299 # HackerNews # Tech # AI

    Efficient and Training-Free Single-Image Diffusion Models https://arxiv.org/abs/2606.04299 # HackerNews # Tech # AI

  432. r/StableDiffusion TIER_2 Italiano(IT) · /u/recoilme ·

    Simple diffusion, SDXS-2B (新模型)

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1u1vbqa/simple_diffusion_sdxs2b_new_model/"> <img alt="Simple diffusion, sdxs-2b (new model)" src="https://external-preview.redd.it/IB3-LJ2W1KFbr3Sea_ZVMjEY0ZRSPzewEj2EDnJBvys.png?width=140&amp;height=75&…

  433. r/StableDiffusion TIER_2 English(EN) · /u/OptimisticPrompt ·

    在 iPhone 17 上本地运行 Stable Diffusion 1.5 生成图像 - 每张仅需 3 秒

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1tvkyzu/benchmarking_local_stable_diffusion_15/"> <img alt="Benchmarking local Stable Diffusion 1.5 generations on iPhone 17 - only 3 seconds per image" src="https://preview.redd.it/9crj8fwtl15h1.png?widt…

  434. r/StableDiffusion TIER_2 English(EN) · /u/Ok-Cartographer-5471 ·

    关于构建二维图像到三维图像扩散模型的指南 [D]

    <!-- SC_OFF --><div class="md"><p>I’m building a pipeline to turn 4-side product photos into professional studio images. I’m currently using SAM 2 for segmentation and an Inpainting pipeline to generate the studio background, but the model keeps hallucinating or degrading the pro…

  435. r/StableDiffusion TIER_2 English(EN) · /u/Elegant-Capital-9133 ·

    2026年Stable Diffusion模型推荐以获得更快、更清晰的输出?

    <!-- SC_OFF --><div class="md"><p>I’ve been switching between a few models lately but I still can’t find something that feels both fast and consistently clean in results. Some models look great but slow everything down, while others are fast but lose detail pretty quickly. Even w…

  436. r/StableDiffusion TIER_2 English(EN) · /u/AgeNo5351 ·

    彩色噪声扩散采样——即插即用,推理时采样器。

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1tray25/colored_noise_diffusion_sampling_plugandplay/"> <img alt="Colored Noise Diffusion Sampling - plug-and-play, inference-time sampler." src="https://preview.redd.it/pnumz6jlc44h1.png?width=140&amp;he…