PulseAugur
EN
LIVE 16:33:32

Apple researchers identify local scores for diffusion model generalization

Apple's research paper explores the mechanisms behind compositional generalization in conditional diffusion models, particularly focusing on how these models handle generating images with more objects than trained on. The study identifies 'local conditional scores' as a key factor enabling this ability, demonstrating that models succeeding at length generalization exhibit these scores, while those that fail do not. The research also proposes a method to enforce these local scores, which successfully enabled length generalization in a previously underperforming model. AI

IMPACT Research into diffusion model generalization could lead to more robust and controllable image generation systems.

RANK_REASON The cluster contains academic papers detailing research into diffusion models and their generalization capabilities.

Read on Hugging Face Blog →

AI-generated summary · Google Gemini · from 494 sources. How we write summaries →

Apple researchers identify local scores for diffusion model generalization

COVERAGE [494]

  1. Apple Machine Learning Research TIER_1 English(EN) ·

    Local Mechanisms of Compositional Generalization in Conditional Diffusion

    Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization,…

  2. Hugging Face Blog TIER_1 English(EN) ·

    Finetune Stable Diffusion Models with DDPO via TRL

  3. Hugging Face Blog TIER_1 English(EN) ·

    Fine-tuning Stable Diffusion models on Intel CPUs

  4. Hugging Face Blog TIER_1 English(EN) ·

    Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

  5. Hugging Face Blog TIER_1 English(EN) ·

    Instruction-tuning Stable Diffusion with InstructPix2Pix

  6. Hugging Face Blog TIER_1 English(EN) ·

    Accelerating Stable Diffusion Inference on Intel CPUs

  7. Hugging Face Blog TIER_1 English(EN) ·

    Using LoRA for Efficient Stable Diffusion Fine-Tuning

  8. Hugging Face Blog TIER_1 English(EN) ·

    Japanese Stable Diffusion

  9. Hugging Face Blog TIER_1 English(EN) ·

    The Annotated Diffusion Model

  10. arXiv cs.LG TIER_1 English(EN) · Pengcheng Wang, Qinghang Liu, Haotian Lin, Yiheng Li, Guojian Zhan, Masayoshi Tomizuka, Yixiao Wang ·

    DADP: Domain Adaptive Diffusion Policy

    arXiv:2602.04037v3 Announce Type: replace Abstract: Learning domain adaptive policies that can generalize to unseen transition dynamics, remains a fundamental challenge in learning-based control. Substantial progress has been made through domain representation learning to capture…

  11. arXiv cs.LG TIER_1 English(EN) · Calvin Luo, Chen Sun, Shuran Song ·

    DF-ExpEnse: Diffusion Filtered Exploration for Sample Efficient Finetuning

    arXiv:2606.19656v1 Announce Type: cross Abstract: A natural recipe for intelligent robotic decision-making is initializing from pretrained generative control policies, which have summarized offline experience, and adapting them to self-collected online experience. We present DF-E…

  12. arXiv cs.LG TIER_1 English(EN) · Jos\'e A. Ch\'avez ·

    On the Redundancy of Timestep Embeddings in Diffusion Models

    arXiv:2606.20416v1 Announce Type: new Abstract: Diffusion models rely heavily on explicit timestep embeddings to modulate the denoising process across various noise scales. In this work, we challenge the necessity of these temporal signals by analyzing their impact on U-Net and D…

  13. arXiv cs.LG TIER_1 English(EN) · Xinhe Mu, Zaijiu Shang, Zhaoqi Zhou, Chuan Zhou, Qi Meng, Guiying Yan, Zhiming Ma ·

    Score Approximation for Diffusion Models on Arbitrary Low-Dimensional Structures

    arXiv:2606.19894v1 Announce Type: new Abstract: The remarkable success of score-based diffusion models has spurred significant efforts to establish their theoretical foundations. However, existing complexity bounds for score approximation rely heavily on restrictive assumptions l…

  14. arXiv cs.LG TIER_1 English(EN) · Jeeho Ryoo, Yongchan Jung, Muhammad Ali Khaliq, Weidong Zhang, Jiatong Han, Byeong Kil Lee ·

    Performance Analysis and Optimization of 3D Generative Diffusion Models across GPU Architectures

    arXiv:2606.19365v1 Announce Type: new Abstract: Diffusion models have become essential for high-fidelity 3D MRI synthesis, yet their deployment remains constrained by substantial GPU resource demands arising from hundreds of U-Net evaluations per sample and a highly heterogeneous…

  15. arXiv cs.AI TIER_1 English(EN) · Dong Hoon Lee, Seunghoon Hong ·

    Variable-Length Tokenization via Learnable Global Merging for Diffusion Transformers

    arXiv:2606.20076v1 Announce Type: cross Abstract: Latent Diffusion Models (LDMs) have become dominant in visual synthesis, but their quality-compute trade-off is largely constrained by the tokenizer's fixed compression ratio. Variable-length tokenizers (VLTs) promise adaptive com…

  16. arXiv cs.AI TIER_1 English(EN) · Zhengheng Li, Panrui Li, Xuyang Liu, Puzhi Xia ·

    Where to Place the Query? Unveiling and Mitigating Positional Bias in In-Context Learning for Diffusion LLMs via Decoding Dynamics

    arXiv:2606.19349v1 Announce Type: cross Abstract: While In-Context Learning (ICL) is extensively studied in Autoregressive (AR) LLMs, its mechanism within Diffusion Large Language Models (dLLMs) remains largely unexplored. Unlike AR models restricted by unidirectional causal mask…

  17. arXiv cs.AI TIER_1 English(EN) · Seunghoon Hong ·

    Variable-Length Tokenization via Learnable Global Merging for Diffusion Transformers

    Latent Diffusion Models (LDMs) have become dominant in visual synthesis, but their quality-compute trade-off is largely constrained by the tokenizer's fixed compression ratio. Variable-length tokenizers (VLTs) promise adaptive compression by varying token counts, allowing diffusi…

  18. arXiv cs.LG TIER_1 English(EN) · Zhiming Ma ·

    Score Approximation for Diffusion Models on Arbitrary Low-Dimensional Structures

    The remarkable success of score-based diffusion models has spurred significant efforts to establish their theoretical foundations. However, existing complexity bounds for score approximation rely heavily on restrictive assumptions like Lipschitz continuous densities or smooth man…

  19. arXiv cs.CL TIER_1 English(EN) · Nicolas Floquet, Joseph Le Roux, Nadi Tomeh ·

    Approximate Structured Diffusion for Sequence Labelling

    arXiv:2606.18856v1 Announce Type: new Abstract: Sequence labelling, a core task of Natural Language Processing (NLP), consists in assigning each token of an input sentence a label. From a Machine Learning point of view, sequence labelling is often cast as a Linear-Chain Condition…

  20. arXiv cs.LG TIER_1 English(EN) · Muge Zhang, Muhammad Ali Khaliq, Jamal Alsakran, Byeong Kil Lee, Jeeho Ryoo ·

    Structural MRI Synthesis for Alzheimer's Disease via Conditional Diffusion on Anatomical Masks

    arXiv:2606.18354v1 Announce Type: cross Abstract: Recent advances in generative machine learning models have significantly improved medical imaging, offering promising solutions for data augmentation, privacy preservation, and improved model generalization. However, synthesizing …

  21. arXiv cs.LG TIER_1 English(EN) · Simon Pedro Galeano Munoz, Mustapha Bounoua, Giulio Franzese, Pietro Michiardi, Maurizio Filippone ·

    DIPHINE: Diffusion-based $\Phi$-ID Neural Estimator

    arXiv:2606.18997v1 Announce Type: new Abstract: Uncovering the true informational architecture of real-world complex systems requires disentangling how their components uniquely store, redundantly share, and synergistically integrate information over time. Integrated Information …

  22. arXiv cs.CL TIER_1 English(EN) · Zirui Wu, Lin Zheng, Jiacheng Ye, Shansan Gong, Xueliang Zhao, Yansong Feng, Wei Bi, Lingpeng Kong ·

    DreamReasoner-8B: Block-Size Curriculum Learning for Diffusion Reasoning Models

    arXiv:2606.19257v1 Announce Type: new Abstract: Block diffusion language models accelerate decoding through parallel block-wise denoising, yet whether they can be reliably scaled for long chain-of-thought (CoT) reasoning remains unresolved. To this end, we develop DreamReasoner-8…

  23. arXiv cs.CL TIER_1 English(EN) · Lingpeng Kong ·

    DreamReasoner-8B: Block-Size Curriculum Learning for Diffusion Reasoning Models

    Block diffusion language models accelerate decoding through parallel block-wise denoising, yet whether they can be reliably scaled for long chain-of-thought (CoT) reasoning remains unresolved. To this end, we develop DreamReasoner-8B, an open-source block diffusion reasoning mode…

  24. arXiv cs.LG TIER_1 English(EN) · Maurizio Filippone ·

    DIPHINE: Diffusion-based $Φ$-ID Neural Estimator

    Uncovering the true informational architecture of real-world complex systems requires disentangling how their components uniquely store, redundantly share, and synergistically integrate information over time. Integrated Information Decomposition ($Φ$ID) is a framework for decompo…

  25. arXiv cs.CL TIER_1 English(EN) · Nadi Tomeh ·

    Approximate Structured Diffusion for Sequence Labelling

    Sequence labelling, a core task of Natural Language Processing (NLP), consists in assigning each token of an input sentence a label. From a Machine Learning point of view, sequence labelling is often cast as a Linear-Chain Conditional Random Field (CRF) parametrised by a neural n…

  26. arXiv cs.LG TIER_1 English(EN) · Yair Schiff, Omer Belhasin, Roy Uziel, Guanghan Wang, Marianne Arriola, Gilad Turok, Ran Zilberstein, Michael Elad, Volodymyr Kuleshov ·

    Learn from Your Mistakes: Self-Correcting Masked Diffusion Models

    arXiv:2602.11590v3 Announce Type: replace Abstract: Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models, enabling parallel token generation while achieving competitive performance. Despite these advantages, MDMs face a fundamental limit…

  27. arXiv cs.LG TIER_1 English(EN) · Kaijian Wang, Yuanyuan Xu, Fanjiang Ye, Ye Cao, Jingwei Zuo, T. S. Eugene Ng, Yarong Mu, Yuke Wang ·

    AoiZora: Topology-Aware Auto-Parallel Optimization for Inference of Diffusion Transformers

    arXiv:2606.17566v1 Announce Type: cross Abstract: Video diffusion has quickly grown into a key generative serving workload, yet producing each clip demands many denoising iterations over large spatio-temporal latents, which puts low-latency inference out of reach on a single devi…

  28. arXiv cs.LG TIER_1 English(EN) · Jisung Hwang, Yunhong Min, Jaihoon Kim, I-Chao Shen, Minhyuk Sung ·

    NoiseTilt: Noise-Tilted Reverse Kernels for Diffusion Reward Alignment

    arXiv:2606.18066v1 Announce Type: new Abstract: We introduce the Noise-Tilted Reverse Kernel (NTRK), a reward-guided diffusion sampler that injects reward gradients through the noise term, leaving the pretrained reverse kernel unchanged and requiring only a single sample per step…

  29. arXiv cs.LG TIER_1 English(EN) · Alba Carballo-Castro, Julianna Piskorz, Paulius Rauba, Mihaela van der Schaar, Pascal Frossard ·

    Recursive Scaling in Masked Diffusion Models

    arXiv:2606.18022v1 Announce Type: new Abstract: Masked diffusion models (MDMs) have recently emerged as a promising paradigm for sequence generation. Scaling MDMs is conventionally achieved by increasing the parameter count or the number of denoising steps. We introduce Recursive…

  30. arXiv cs.LG TIER_1 English(EN) · Samar Hadou, Yigit Berkay Uslu, Alejandro Ribeiro ·

    Constrained Diffusion Models with Primal-Dual Inference

    arXiv:2606.17192v1 Announce Type: new Abstract: This paper develops constrained diffusion models with primal-dual inference (PDI) to sample from optimal distributions of entropy-regularized optimization problems with \emph{average} constraints. We formalize constrained sampling i…

  31. arXiv cs.AI TIER_1 English(EN) · Lifeng Chen, Jiner Wang, Zihao Pan, Beier Zhu, Xiaofeng Yang, Chi Zhang ·

    Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models

    arXiv:2507.17853v2 Announce Type: replace-cross Abstract: Recent advances in text-to-image (T2I) generation have led to impressive visual results. However, these models still face significant challenges when handling complex prompt, particularly those involving multiple subjects …

  32. arXiv cs.LG TIER_1 English(EN) · Minhyuk Sung ·

    NoiseTilt: Noise-Tilted Reverse Kernels for Diffusion Reward Alignment

    We introduce the Noise-Tilted Reverse Kernel (NTRK), a reward-guided diffusion sampler that injects reward gradients through the noise term, leaving the pretrained reverse kernel unchanged and requiring only a single sample per step. Reward-guided sampling at inference time has g…

  33. arXiv cs.LG TIER_1 English(EN) · Pascal Frossard ·

    Recursive Scaling in Masked Diffusion Models

    Masked diffusion models (MDMs) have recently emerged as a promising paradigm for sequence generation. Scaling MDMs is conventionally achieved by increasing the parameter count or the number of denoising steps. We introduce Recursive Masked Diffusion Models (R-MDMs), which add rec…

  34. arXiv cs.LG TIER_1 English(EN) · Yuke Wang ·

    AoiZora: Topology-Aware Auto-Parallel Optimization for Inference of Diffusion Transformers

    Video diffusion has quickly grown into a key generative serving workload, yet producing each clip demands many denoising iterations over large spatio-temporal latents, which puts low-latency inference out of reach on a single device. A denoising step is therefore typically distri…

  35. arXiv cs.AI TIER_1 English(EN) · Ziming Liu, Yifan Yang, Chengruidong Zhang, Yiqi Zhang, Lili Qiu, Yang You, Yuqing Yang ·

    Region-Adaptive Sampling for Diffusion Transformers

    arXiv:2502.10389v2 Announce Type: replace-cross Abstract: Diffusion models (DMs) have become the leading choice for generative tasks across diverse domains. However, their reliance on multiple sequential forward passes significantly limits real-time performance. Previous accelera…

  36. arXiv cs.LG TIER_1 English(EN) · Kyeongmin Yeo, Yunhong Min, Minhyuk Sung ·

    MUNI: Multimodal Unified Latent Diffusion for Coherent Any-to-Any Generation

    arXiv:2606.16408v1 Announce Type: new Abstract: We introduce MUNI, an end-to-end multimodal latent diffusion framework for any-to-any generation that unifies subset-conditioned cross-modal generation and unconditional joint sampling through a shared stochastic latent. Existing mu…

  37. arXiv cs.LG TIER_1 English(EN) · Maoliang Li, Haojing Chen, Jiayu Chen, Zihao Zheng, Xinhao Sun, Hailong Zou, Xiang Chen ·

    MoECa: Aligning Feature Reuse with Expert Decomposition in Diffusion Transformers

    arXiv:2606.15615v1 Announce Type: new Abstract: Diffusion Transformers with Mixture-of-Experts (DiT-MoE) improve model capacity under sparse activation, but diffusion inference is still bottlenecked by redundant computation across timesteps. Existing caching methods mainly operat…

  38. arXiv cs.LG TIER_1 English(EN) · Qizhen Ying, Yangchen Pan, Victor Adrian Prisacariu, Junfeng Wen ·

    Temporal Difference Learning for Diffusion Models

    arXiv:2606.15048v1 Announce Type: new Abstract: Diffusion models are typically trained with objectives that focus on local denoising targets at individual time steps (or adjacent pairs), which do not enforce consistency between predictions along the denoising trajectory. This lac…

  39. arXiv cs.AI TIER_1 English(EN) · Rutherford A. Patamia, Ming Liu, Wei Luo, Favour Ekong, Akan Cosgun ·

    Fast When, Careful Who: Dual-Process Multiparty Turn-Taking with Diffusion Augmentation

    arXiv:2606.16568v1 Announce Type: cross Abstract: Reliable turn-taking is essential for spoken dialogue systems. However, most existing methods are designed for two-speaker interaction and struggle with realistic multiparty audio containing overlap and rapid speaker changes. We s…

  40. Hugging Face Daily Papers TIER_1 English(EN) ·

    Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

    Spectral Forcing, a time-conditional 2D-DCT low-pass operator, improves diffusion model efficiency by explicitly separating signal from noise in pixel-space models.

  41. arXiv cs.AI TIER_1 English(EN) · Akan Cosgun ·

    Fast When, Careful Who: Dual-Process Multiparty Turn-Taking with Diffusion Augmentation

    Reliable turn-taking is essential for spoken dialogue systems. However, most existing methods are designed for two-speaker interaction and struggle with realistic multiparty audio containing overlap and rapid speaker changes. We study multiparty turn-taking on the VoxConverse dat…

  42. arXiv cs.AI TIER_1 English(EN) · Dvir Samuel, Issar Tzachor, Matan Levy, Michael Green, Gal Chechik, Rami Ben-Ari ·

    Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention

    arXiv:2602.01801v2 Announce Type: replace-cross Abstract: Autoregressive video diffusion models enable streaming generation, opening the door to long-form synthesis, video world models, and interactive neural game engines. However, their core attention layers become a major bottl…

  43. arXiv cs.AI TIER_1 English(EN) · Zheyuan Zhan, Hongchen Li, Can Wang, Yinfei Ma, Mingzhen Huang, Ruoshi Bai, Jiawei Chen, Siwei Lyu, Defang Chen ·

    Conditioning Matters: Stabilizing Inversion and Attention in Diffusion Image Editing

    arXiv:2606.14125v1 Announce Type: cross Abstract: Inversion-based image editing offers flexible and training-free control but still struggles with inversion accuracy and the trade-off between editing fidelity and background preservation. While recent methods improve inversion for…

  44. Hugging Face Daily Papers TIER_1 English(EN) ·

    UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

    UniDDT addresses key challenges in unified multimodal models by leveraging a Noisy ViT encoder and LLM for semantic encoding while using separate diffusion decoders to balance visual understanding and generation tasks.

  45. arXiv cs.AI TIER_1 English(EN) · Defang Chen ·

    Conditioning Matters: Stabilizing Inversion and Attention in Diffusion Image Editing

    Inversion-based image editing offers flexible and training-free control but still struggles with inversion accuracy and the trade-off between editing fidelity and background preservation. While recent methods improve inversion formulations or attention interactions, the role of t…

  46. arXiv cs.AI TIER_1 English(EN) · Rapha\"el Razafindralambo, R\'emy Sun, Fr\'ed\'eric Precioso, Jes Frellsen, Pierre-Alexandre Mattei ·

    Towards More General Control of Diffusion Models Using Jeffrey Guidance

    arXiv:2606.13240v1 Announce Type: cross Abstract: A key strength of diffusion models lies in their flexibility, since their outputs can be controlled at sampling time through guidance. However, beyond simple cases such as conditional sampling, the target distribution is often lef…

  47. arXiv cs.AI TIER_1 English(EN) · Jianwei Fei, Yunshu Dai, Zhihua Xia, Xiaochun Cao, Jiantao Zhou, Alessandro Piva, Benedetta Tondi ·

    Efficient, Robust, and Anti-Collusion Fingerprinting of Image Diffusion Models

    arXiv:2606.12977v1 Announce Type: cross Abstract: Model fingerprinting, embedding user-specific identifiers (fingerprints) into generated outputs, has recently emerged as a popular solution to protect the intellectual property rights (IPR) of generative text-to-image (T2I) models…

  48. Hugging Face Daily Papers TIER_1 English(EN) ·

    Uncertainty Estimation for Molecular Diffusion Models

    Diffusion models have seen wide adoption for 3D molecular generation, yet they offer no principled signal of when a generated molecule is likely to be of low quality. We propose a post-hoc method for estimating per-sample uncertainty in pretrained molecular diffusion models. Buil…

  49. arXiv cs.LG TIER_1 English(EN) · Metod Jazbec ·

    Uncertainty Estimation for Molecular Diffusion Models

    Diffusion models have seen wide adoption for 3D molecular generation, yet they offer no principled signal of when a generated molecule is likely to be of low quality. We propose a post-hoc method for estimating per-sample uncertainty in pretrained molecular diffusion models. Buil…

  50. arXiv cs.LG TIER_1 English(EN) · Muneendra Ojha ·

    Enhanced Low-Density Region Exploration in Classifier-Guided Diffusion Models Through Modified Reverse Diffusion Sampling

    Diffusion models have emerged as state-of-the-art generative models for high-fidelity image synthesis, particularly in their classifier-free guided and classifier-guided forms. However, standard classifier guidance concentrates probability mass around high-density class mean, lea…

  51. Hugging Face Daily Papers TIER_1 English(EN) ·

    DuET: Dual Expert Trajectories for Diffusion Image Editing

    Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the result appears, especially when the target scene d…

  52. arXiv cs.CL TIER_1 English(EN) · Lexington Whalen, Yuki Ito, Ryo Sakamoto ·

    Teaching Diffusion to Speculate Left-to-Right

    arXiv:2606.11552v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decodi…

  53. arXiv cs.LG TIER_1 English(EN) · Zhongxin Yang, Yuanwei Bin, Xiang I. A. Yang, Shiyi Chen ·

    Least-Action-Guided Diffusion for Physical Extrapolation

    arXiv:2606.11277v1 Announce Type: new Abstract: Reliable extrapolation remains a central challenge for generative models in computational physics, because models trained over finite ranges of time, parameters, or geometries may produce physically inconsistent predictions outside …

  54. arXiv cs.LG TIER_1 English(EN) · Yi Hu, Leying Yi, Emily Davis, Finn Carter ·

    On the Controllability-Fidelity Frontier in Diffusion Editing

    arXiv:2606.09901v1 Announce Type: cross Abstract: Diffusion-based generative models enable powerful image editing capabilities, but achieving precise control while maintaining fidelity and safety remains challenging. We present a comprehensive theoretical and empirical study of c…

  55. arXiv cs.LG TIER_1 English(EN) · Peng Wang, Huijie Zhang, Zekai Zhang, Siyi Chen, Yi Ma, Qing Qu ·

    Breaking the Curse of Dimensionality: Diffusion Models Efficiently Learn Low-Dimensional Distributions

    arXiv:2409.02426v5 Announce Type: replace Abstract: Despite their empirical success across a wide range of generative tasks, the fundamental principles underlying the ability of diffusion models to learn data distributions are poorly understood. In this work, we develop a new mat…

  56. arXiv cs.LG TIER_1 English(EN) · Huijie Zhang, Jinfan Zhou, Yifu Lu, Minzhe Guo, Peng Wang, Liyue Shen, Qing Qu ·

    The Emergence of Reproducibility and Generalizability in Diffusion Models

    arXiv:2310.05264v5 Announce Type: replace Abstract: In this work, we investigate an intriguing and prevalent phenomenon of diffusion models which we term as "consistent model reproducibility": given the same starting noise input and a deterministic sampler, different diffusion mo…

  57. arXiv cs.AI TIER_1 English(EN) · Matina Mahdizadeh Sani, Nima Jamali, Mohammad Jalali, Farzan Farnia ·

    MMD Guidance: Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance

    arXiv:2601.08379v2 Announce Type: replace-cross Abstract: Pre-trained diffusion models have emerged as powerful generative priors for both unconditional and conditional sample generation, yet their outputs often deviate from the characteristics of user-specific target data. Such …

  58. arXiv cs.CL TIER_1 English(EN) · Ryo Sakamoto ·

    Teaching Diffusion to Speculate Left-to-Right

    Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decoding addresses this bottleneck by employing a ligh…

  59. Hugging Face Daily Papers TIER_1 English(EN) ·

    Teaching Diffusion to Speculate Left-to-Right

    Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decoding addresses this bottleneck by employing a ligh…

  60. arXiv cs.LG TIER_1 English(EN) · Emma Finn, Binxu Wang, T. Anderson Keller, Demba E. Ba ·

    Where the Score Lives: A Wavelet View of Diffusion

    arXiv:2606.08309v1 Announce Type: new Abstract: Score-based generative models have had remarkable success over the last decade in generating a diverse set of visually plausible images. A variety of architectures including CNNs, U-Nets, and Transformers have been used as the score…

  61. arXiv cs.AI TIER_1 English(EN) · Danqi Zhuang, Jisui Huang, Xiaoyue Xi, Andrew Kiggins, Xiaojie Wang, Ke Chen, Yue Wu ·

    PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws

    arXiv:2606.09816v1 Announce Type: cross Abstract: Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explic…

  62. arXiv cs.LG TIER_1 English(EN) · Meher Chaitanya, Sebastian Dalleiger, Luana Ruiz ·

    Thresholded Local Hyper-Flow Diffusion

    arXiv:2606.09340v1 Announce Type: new Abstract: Local Hyper-Flow Diffusion (HFD) gives an edge-size-independent Cheeger-type guarantee for seeded clustering in general submodular hypergraphs, but existing HFD solvers do not keep intermediate computation local at every iteration. …

  63. arXiv cs.LG TIER_1 English(EN) · Shigui Li, Delu Zeng ·

    Mitigating the Contractivity Trap in Diffusion ODEs via Stein Stabilization

    arXiv:2606.07835v1 Announce Type: new Abstract: A fundamental tension exists in the large-step inference of diffusion models via their deterministic probability flow ordinary differential equation (PF-ODE) trajectories, which we identify as the contractivity trap: efficient infer…

  64. arXiv cs.AI TIER_1 English(EN) · Runze Li, Hanchen Wang, Wenjie Zhang, Binghao Li, Yu Zhang, Xuemin Lin, Ying Zhang ·

    FADTI: Fourier and Attention Driven Diffusion for Multivariate Time Series Imputation

    arXiv:2512.15116v2 Announce Type: replace-cross Abstract: Multivariate time series imputation is fundamental in applications such as healthcare, traffic forecasting, and biological modeling, where sensor failures and irregular sampling lead to pervasive missing values. However, e…

  65. arXiv cs.LG TIER_1 English(EN) · Xiao Li, Yixuan Jia, Zekai Zhang, Xiang Li, Lianghe Shi, Jinxin Zhou, Zhihui Zhu, Liyue Shen, Qing Qu ·

    Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles

    arXiv:2606.09718v1 Announce Type: new Abstract: Diffusion models have demonstrated remarkable generative capabilities and have also emerged as powerful self-supervised representation learners, yet the connection between these two abilities remains less explored. Drawing inspirati…

  66. arXiv cs.AI TIER_1 English(EN) · Yue Wu ·

    PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws

    Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explicit structure for data concentrated near low-dimens…

  67. Hugging Face Daily Papers TIER_1 English(EN) ·

    PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws

    Standard diffusion models typically use a single time-homogeneous Gaussian terminal distribution as the reference law for generation. While this choice is analytically convenient and empirically powerful, it provides little explicit structure for data concentrated near low-dimens…

  68. arXiv cs.LG TIER_1 English(EN) · Qing Qu ·

    Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles

    Diffusion models have demonstrated remarkable generative capabilities and have also emerged as powerful self-supervised representation learners, yet the connection between these two abilities remains less explored. Drawing inspiration from self-supervised learning (SSL), we intro…

  69. arXiv cs.LG TIER_1 English(EN) · Luana Ruiz ·

    Thresholded Local Hyper-Flow Diffusion

    Local Hyper-Flow Diffusion (HFD) gives an edge-size-independent Cheeger-type guarantee for seeded clustering in general submodular hypergraphs, but existing HFD solvers do not keep intermediate computation local at every iteration. We introduce Thresholded Local HFD (TL-HFD), a f…

  70. arXiv cs.LG TIER_1 English(EN) · Zekai Zhang, Xiao Li, Xiang Li, Lianghe Shi, Meng Wu, Molei Tao, Qing Qu ·

    Generalization of Diffusion Models Arises with a Balanced Representation Space

    arXiv:2512.20963v3 Announce Type: replace Abstract: Diffusion models excel at generating high-quality, diverse samples, yet they risk memorizing training data when overfit to the training objective. We analyze the distinctions between memorization and generalization in diffusion …

  71. arXiv cs.LG TIER_1 English(EN) · Lee Hyoseok, Sohwi Lim, Eunju Cha, Tae-Hyun Oh ·

    Measurement-Consistent Langevin Corrector for Stabilizing Latent Diffusion Inverse Problem Solvers

    arXiv:2601.04791v4 Announce Type: replace-cross Abstract: While latent diffusion models (LDMs) have emerged as powerful priors for inverse problems, existing LDM-based solvers frequently suffer from instability. In this work, we first identify the instability as a discrepancy bet…

  72. arXiv cs.AI TIER_1 English(EN) · Renzo G. Soatto, Anders Hoel, Greycen Ren, Shorna Alam, Stephen Bates, Nikolaos P. Daskalakis, Caroline Uhler, Maria Skoularidou ·

    CountsDiff: A Diffusion Model on the Natural Numbers for Generation and Imputation of Count-Based Data

    arXiv:2604.03779v2 Announce Type: replace-cross Abstract: Diffusion models have excelled at generative tasks for both continuous and token-based domains, but their application to discrete ordinal data remains underdeveloped. We present CountsDiff, a diffusion framework designed t…

  73. Hugging Face Daily Papers TIER_1 English(EN) ·

    MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training

    Token-subset representation alignment method called MaskAlign improves diffusion transformer training by reducing reliance on complete token sets and maintaining stable alignment behavior under perturbations.

  74. arXiv cs.LG TIER_1 English(EN) · Demba E. Ba ·

    Where the Score Lives: A Wavelet View of Diffusion

    Score-based generative models have had remarkable success over the last decade in generating a diverse set of visually plausible images. A variety of architectures including CNNs, U-Nets, and Transformers have been used as the score-approximation network in such diffusion modelin…

  75. arXiv cs.LG TIER_1 English(EN) · Aditya Shankar, Yuandou Wang, Rihan Hai, Lydia Y. Chen ·

    Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

    arXiv:2602.07875v3 Announce Type: replace Abstract: Generating tabular data under conditions is critical to applications requiring precise control over the generative process. Existing methods rely on training-time strategies that do not generalise to unseen constraints during in…

  76. arXiv cs.LG TIER_1 English(EN) · Haoyang Cao, Minshuo Chen, Yinbin Han, Renyuan Xu ·

    Diffusion Models for Adaptive Sequential Data Generation

    arXiv:2606.06007v1 Announce Type: new Abstract: Generating realistic synthetic sequential data is critical in real-world applications across operations research, finance, healthcare, energy systems, and scientific computing, where time-indexed observations are used for prediction…

  77. arXiv cs.LG TIER_1 English(EN) · Parsa Esmati, Somjit Nath, Katja Hofmann, Derek Nowrouzezahrai, Samira Ebrahimi Kahou, Majid Mirmehdi ·

    The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show

    arXiv:2606.05328v1 Announce Type: cross Abstract: Modern video diffusion models generate increasingly realistic and temporally coherent videos, motivating their use as candidate world simulators. Yet it remains unclear whether these models internally encode physical structure, or…

  78. arXiv cs.LG TIER_1 English(EN) · Hongkun Dou, Zike Chen, Fengji Li, Hongjue Li, Yue Deng ·

    Plug-and-Play Guidance for Discrete Diffusion Models via Gradient-Informed Logit Correction

    arXiv:2606.06303v1 Announce Type: new Abstract: Controllable generation with discrete diffusion models is often hindered by high computational overhead or the need for retraining. In this paper, we present \underline{\textbf{G}}radient-\underline{\textbf{I}}nformed \underline{\te…

  79. arXiv cs.AI TIER_1 English(EN) · Yue Deng ·

    Plug-and-Play Guidance for Discrete Diffusion Models via Gradient-Informed Logit Correction

    Controllable generation with discrete diffusion models is often hindered by high computational overhead or the need for retraining. In this paper, we present \underline{\textbf{G}}radient-\underline{\textbf{I}}nformed \underline{\textbf{L}}ogit \underline{\textbf{C}}orrection (\t…

  80. Hugging Face Daily Papers TIER_1 English(EN) ·

    Diffusion Models for Adaptive Sequential Data Generation

    Generating realistic synthetic sequential data is critical in real-world applications across operations research, finance, healthcare, energy systems, and scientific computing, where time-indexed observations are used for prediction, simulation, risk assessment, and data-driven d…

  81. arXiv cs.LG TIER_1 English(EN) · Haojun Qiu, Kiriakos N. Kutulakos, David B. Lindell ·

    Efficient and Training-Free Single-Image Diffusion Models

    arXiv:2606.04299v1 Announce Type: cross Abstract: We consider the problem of generating images whose internal structure -- defined by the distribution of patches across multiple scales -- matches that of a single reference image. Recent approaches address this problem by training…

  82. arXiv cs.LG TIER_1 English(EN) · Ram\'on Nartallo-Kaluarachchi, Renaud Lambiotte, Alain Goriely ·

    Drift-Diffusion Matching: Embedding dynamics in latent manifolds of asymmetric neural networks

    arXiv:2602.14885v2 Announce Type: replace-cross Abstract: Recurrent neural networks (RNNs) provide a theoretical framework for understanding computation in biological neural circuits, yet classical results, such as Hopfield's model of associative memory, rely on symmetric connect…

  83. arXiv cs.LG TIER_1 English(EN) · Lixing Zhang, Yidong Ouyang, Weifu Li, Shixiang Zhu, Guang Cheng, Liyan Xie ·

    Learning What Not to Impute: An Uncertainty-Aware Diffusion Framework for Meaningful Missingness

    arXiv:2606.05073v1 Announce Type: new Abstract: Missing value imputation is a fundamental task in machine learning, with most existing methods assuming that all missing entries correspond to unobserved regular values. In many real-world datasets, however, missingness may arise fr…

  84. arXiv cs.CL TIER_1 English(EN) · Xinrui Song, Zhuoran Wang, Mingju Gao, Hao Tang ·

    SemBlock: Semantic Boundary Dynamic Blocks for Diffusion LLMs

    arXiv:2606.04964v1 Announce Type: new Abstract: Diffusion language models (DLMs) generate text through iterative denoising, and blockwise decoding improves their practicality by committing tokens in local blocks. However, existing blockwise methods typically rely on fixed block s…

  85. arXiv cs.AI TIER_1 English(EN) · Omer Dahary, Benaya Koren, Daniel Garibi, Daniel Cohen-Or ·

    On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

    arXiv:2603.28762v2 Announce Type: replace-cross Abstract: Modern Text-to-Image (T2I) diffusion models have achieved remarkable semantic alignment, yet they often suffer from a significant lack of variety, converging on a narrow set of visual solutions for any given prompt. This t…

  86. Hugging Face Daily Papers TIER_1 English(EN) ·

    Complexity-Balanced Diffusion Splitting

    Complexity-Balanced Splitting (CBS) allocates generative capacity across specialized sub-networks by partitioning the diffusion timeline based on local complexity measures, improving synthesis quality without increasing inference cost.

  87. Hugging Face Daily Papers TIER_1 English(EN) ·

    Learning What Not to Impute: An Uncertainty-Aware Diffusion Framework for Meaningful Missingness

    Missing value imputation is a fundamental task in machine learning, with most existing methods assuming that all missing entries correspond to unobserved regular values. In many real-world datasets, however, missingness may arise from two distinct sources: some entries are meanin…

  88. arXiv cs.LG TIER_1 English(EN) · Liyan Xie ·

    Learning What Not to Impute: An Uncertainty-Aware Diffusion Framework for Meaningful Missingness

    Missing value imputation is a fundamental task in machine learning, with most existing methods assuming that all missing entries correspond to unobserved regular values. In many real-world datasets, however, missingness may arise from two distinct sources: some entries are meanin…

  89. arXiv cs.CL TIER_1 English(EN) · Hao Tang ·

    SemBlock: Semantic Boundary Dynamic Blocks for Diffusion LLMs

    Diffusion language models (DLMs) generate text through iterative denoising, and blockwise decoding improves their practicality by committing tokens in local blocks. However, existing blockwise methods typically rely on fixed block sizes or delimiter-based runtime signals, which d…

  90. arXiv cs.AI TIER_1 English(EN) · Davide Gallon, Philippe von Wurstemberger, Patrick Cheridito, Arnulf Jentzen ·

    Physics-informed diffusion models in spectral space

    arXiv:2602.09708v2 Announce Type: replace-cross Abstract: We propose physics-informed spectral diffusion (PISD), a methodology that combines generative latent diffusion models with physics-informed machine learning to generate solutions of partial differential equations (PDEs) co…

  91. arXiv cs.AI TIER_1 English(EN) · Shaokun Lan, Haoran Dou, Jinghan Huang, Arezoo Zakeri, Fengming Lin, Zherui Zhou, Jinming Duan, Alejandro F. Frangi ·

    Conditional Latent Diffusion Model with Fourier-based Motion Modelling for Virtual Population Synthesis

    arXiv:2606.03827v1 Announce Type: cross Abstract: In-silico trials of medical devices require the generation of virtual populations of anatomies. In cardiovascular applications, virtual anatomy is typically represented as a 3D+t mesh sampled from a generative model. However, most…

  92. arXiv cs.AI TIER_1 English(EN) · Siva Rajesh Kasa, Yasong Dai, Sumit Negi, Hongdong Li ·

    Fast-dLLM++: Fr\'{e}chet Profile Decoding for Faster Diffusion LLM Inference

    arXiv:2606.02955v1 Announce Type: cross Abstract: Diffusion large language models promise parallel token generation, yet inference remains bottlenecked by deciding which masked tokens can be safely committed together. Fast-dLLM addressed this with KV caching and confidence-guided…

  93. arXiv cs.LG TIER_1 English(EN) · Niccol\`o Perrone, Fanny Lehmann, Stefania Fresca, Filippo Gatti ·

    Correcting Neural Operator Spectral Bias via Diffusion Posterior Sampling with Sparse Observations

    arXiv:2606.03936v1 Announce Type: new Abstract: Neural operator surrogates (NO) approximate PDE solutions orders of magnitude faster than numerical solvers, but suffer from spectral bias: high-frequency content is systematically attenuated, limiting reliability where fine-scale s…

  94. arXiv cs.LG TIER_1 English(EN) · Kexiang Mao ·

    Flicker-DDPM: Accelerating Denoising Diffusion via 1/f Colored Noise Injection

    arXiv:2606.03393v1 Announce Type: new Abstract: We propose a novel diffusion model, Flicker-DDPM, which incorporates flicker (1/f) noise inspired by self-organized criticality (SOC), a widely observed phenomenon in natural systems. Unlike denoising diffusion probabilistic models …

  95. arXiv cs.LG TIER_1 English(EN) · Zerui Tao, Qibin Zhao ·

    Bayesian Tensor Decomposition with Diffusion Model Prior

    arXiv:2606.03212v1 Announce Type: new Abstract: Low-rank tensor decomposition (TD) is usually effective on clean, fully observed data, but it often degrades under severe missingness or noise. Low-rankness is itself a useful but limited structural prior, and additional handcrafted…

  96. arXiv cs.AI TIER_1 English(EN) · Yifu Luo, Yongzhe Chang, Xueqian Wang ·

    Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning

    arXiv:2509.19305v2 Announce Type: replace-cross Abstract: Diffusion probability models have shown significant promise in offline reinforcement learning by directly modeling trajectory sequences. However, existing approaches primarily focus on time-domain features while overlookin…

  97. arXiv cs.AI TIER_1 English(EN) · David Turtora Zagardo ·

    Geometry-Aware Tabular Diffusion

    arXiv:2606.02607v1 Announce Type: cross Abstract: Tabular synthesis is critical for privacy-preserving sharing and augmentation, yet diffusion models rely on implicit mechanisms to capture inter-column relationships. We introduce Geometry-Aware Tabular Diffusion (GATD), which aug…

  98. Hugging Face Daily Papers TIER_1 English(EN) ·

    Neural Galerkin Normalizing Flows for Bayesian Inference of Diffusions with Inaccessible Boundaries

    One of the primary challenges in Bayesian inference on the parameters of a diffusion model from discrete observations is the unavailability of an analytical expression for the transition density function between consecutive observation times, which is needed to derive the likelih…

  99. arXiv cs.LG TIER_1 English(EN) · Filippo Gatti ·

    Correcting Neural Operator Spectral Bias via Diffusion Posterior Sampling with Sparse Observations

    Neural operator surrogates (NO) approximate PDE solutions orders of magnitude faster than numerical solvers, but suffer from spectral bias: high-frequency content is systematically attenuated, limiting reliability where fine-scale structure matters. Sparse sensor measurements of …

  100. Hugging Face Daily Papers TIER_1 English(EN) ·

    Correcting Neural Operator Spectral Bias via Diffusion Posterior Sampling with Sparse Observations

    Neural operator surrogates (NO) approximate PDE solutions orders of magnitude faster than numerical solvers, but suffer from spectral bias: high-frequency content is systematically attenuated, limiting reliability where fine-scale structure matters. Sparse sensor measurements of …

  101. arXiv cs.AI TIER_1 English(EN) · Alejandro F. Frangi ·

    Conditional Latent Diffusion Model with Fourier-based Motion Modelling for Virtual Population Synthesis

    In-silico trials of medical devices require the generation of virtual populations of anatomies. In cardiovascular applications, virtual anatomy is typically represented as a 3D+t mesh sampled from a generative model. However, most existing mesh generators focus on static anatomy,…

  102. arXiv cs.LG TIER_1 English(EN) · Simon De Reuver, Tamas Kristof Toth, Teddy Lazebnik ·

    Data Enrichment for Symbolic Regression Using Diffusion Models

    arXiv:2606.00988v1 Announce Type: new Abstract: Symbolic regression (SR) offers a route to scientific discovery by converting observations into interpretable governing equations. However, despite its promise, its reliability degrades sharply when spatiotemporal measurements are s…

  103. arXiv cs.LG TIER_1 English(EN) · Guanyu Zhou, Yao Liu, Yanglei Gan, Yuxiang Cai, Peng He, Run Lin, Yuxiang Liu, Qiao Liu ·

    GLIDE: Graph-guided Leap Inference for Diffusion Estimation of Spatio-Temporal Point Processes

    arXiv:2606.01273v1 Announce Type: new Abstract: Spatio-temporal point processes (STPPs) provide a principled framework for modeling asynchronous events in continuous time and space. Recent diffusion-based approaches offer a flexible alternative to deterministic prediction by mode…

  104. arXiv cs.AI TIER_1 English(EN) · Longxuan Yu, Yunshu Wu, Yu Fu, Siheng Xiong, Rob Brekelmans, Hui Liu, Yue Dong, Greg Ver Steeg ·

    DSL-LLaDA: Scaling Continuous Denoising to 8B Masked Diffusion LMs

    arXiv:2606.01024v1 Announce Type: cross Abstract: Discrete Masked diffusion language models generate text by iterative parallel decoding, but few-step decoding suffers from a tradeoff between length and quality: with a fixed step budget, standard methods can generate a short, hig…

  105. arXiv cs.LG TIER_1 English(EN) · Pengfei Jin, Yiqi Tian, Kailong Fan, Bingjie Qi, Quanzheng Li ·

    Hallucination-Aware Diffusion Sampling for Inverse Problems via Robust Prior Updates

    arXiv:2606.02331v1 Announce Type: cross Abstract: Diffusion-based inverse problem solvers can produce realistic reconstructions, but realism alone does not ensure that the recovered details are supported by the measurement. We study this failure as measurement-conditioned halluci…

  106. arXiv cs.AI TIER_1 English(EN) · Ziseok Lee, Minyeong Hwang, Wooyeol Lee, Sanghyun Jo, Jihyung Ko, Young Bin Park, Jae-Mun Choi, Eunho Yang, Kyungsu Kim ·

    On the Collapse of Generative Paths: A Criterion and Correction for Diffusion Steering

    arXiv:2512.10339v2 Announce Type: replace Abstract: Inference-time steering adapts pretrained diffusion and flow models to new tasks without retraining, often utilizing ratio-of-densities constructions that reweight time-indexed marginals with fixed exponents. We identify Margina…

  107. arXiv cs.LG TIER_1 English(EN) · Hamza Cherkaoui, H\'el\`ene Halconruy, Antonio Ocello ·

    Do Heavy Tails Help Diffusion? On the Subtle Trade-off Between Initialization and Training

    arXiv:2605.13175v2 Announce Type: replace Abstract: Recent works have proposed incorporating heavy-tailed (HT) noise into diffusion- and flow-based generative models, with the goals of better recovering the tails of target distributions and improving generative diversity. This mo…

  108. arXiv cs.AI TIER_1 English(EN) · Renhao Zhang, Haotian Fu, Mingxi Jia, George Konidaris, Yilun Du, Bruno Castro da Silva ·

    From Noise to Control: Parameterized Diffusion Policies

    arXiv:2606.00336v1 Announce Type: new Abstract: We propose Parameterized Diffusion Policy (PDP), a framework for learning diffusion policies conditioned on low-dimensional, continuous parameters embedded in a learned behavior manifold. By constructing this manifold so that distan…

  109. arXiv cs.LG TIER_1 English(EN) · Daniela Breitman, Andrei Mesinger, Steven G. Murray, Ivan Nikolic, Roberto Trotta ·

    21cmEMUv3: a hybrid diffusion-LSTM emulator of 21cmFAST summary observables

    arXiv:2606.00219v1 Announce Type: cross Abstract: We are witnessing a surge in observations of the cosmic dawn (CD) and epoch of reionisation (EoR), driving an increasing demand for fast and robust theoretical interpretation frameworks. In response, machine learning (ML), and emu…

  110. arXiv cs.AI TIER_1 English(EN) · Abdullah Al Shafi, Kazi Saeed Alam, Sk Imran Hossain, Engelbert Mephu Nguifo ·

    DASH: Dual-Branch Score Distillation for Guidance-Calibrated Compact Diffusion Models

    arXiv:2606.00798v1 Announce Type: cross Abstract: Parameter compression of class-conditional diffusion models reveals an underexplored limitation in output-level distillation: the unconditional score branch remains unsupervised, leaving the classifier-free guidance gap underdeter…

  111. arXiv cs.AI TIER_1 English(EN) · Zihan Li, Jialan Zheng, Ziyu Li, Xun Yuan, Kasidit Anmahapong, Ziang Wang, Mingxuan Liu, Hongjia Yang, Yifei Chen, Zhuhao Wang, Yuhang He, Fang Chen, Rui Li, Huaiqiang Sun, Yi Liao, Congyu Liao, Yang Yang, Haibo Qu, Xue Zhang, Hongen Liao, Qiyuan Tian ·

    A physics-informed foundation model for quantitative diffusion MRI

    arXiv:2606.00156v1 Announce Type: cross Abstract: Understanding the human brain requires access to its microscopic tissue architecture. Diffusion magnetic resonance imaging (MRI) provides the only noninvasive window into whole-brain microstructure in vivo, yet reliable quantitati…

  112. arXiv cs.AI TIER_1 English(EN) · Duoduo Xue, Zhiyu Zhu, Junhui Hou ·

    Diffusion Image Generation with Explicit Modeling of Data Manifold Geometry

    arXiv:2606.00094v1 Announce Type: cross Abstract: Image generative models aim to sample data points from the underlying data manifold, a task that requires learning and decoding a dense, low-dimensional, and compact parameterization space. To achieve this, we propose the Data Man…

  113. arXiv cs.AI TIER_1 English(EN) · Naoki Murata, Yuhta Takida, Chieh-Hsin Lai, Toshimitsu Uesaka, Bac Nguyen, Stefano Ermon, Yuki Mitsufuji ·

    GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning

    arXiv:2601.22651v2 Announce Type: replace-cross Abstract: Training-data attribution for vision generative models aims to identify which training data influenced a given output. While most methods score individual examples, practitioners often need group-level answers (e.g., artis…

  114. arXiv cs.AI TIER_1 English(EN) · Yeongmin Kim, Donghyeok Shin, Byeonghu Na, Minsang Park, Richard Lee Kim, Il-Chul Moon ·

    Lookahead Sample Reward Guidance for Test-Time Scaling of Diffusion Models

    arXiv:2602.03211v2 Announce Type: replace-cross Abstract: Diffusion models have demonstrated strong generative performance; however, generated samples often fail to fully align with human intent. This paper studies an efficient test-time scaling method for sampling from regions w…

  115. arXiv cs.AI TIER_1 English(EN) · Sajad Ebrahimi, Bhaskar Mitra, Negar Arabzadeh, Ye Yuan, Haolun Wu, Fattane Zarrinkalam, Ebrahim Bagheri ·

    From Noise to Order: Learning to Rank via Denoising Diffusion

    arXiv:2602.11453v2 Announce Type: replace-cross Abstract: In information retrieval (IR), learning-to-rank (LTR) methods have traditionally limited themselves to discriminative machine learning approaches that model the probability of the document being relevant to the query given…

  116. arXiv cs.AI TIER_1 English(EN) · Zhiying Jiang, Raihan Seraj, Marcos Villagra, Bidhan Roy ·

    Heterogeneous Decentralized Diffusion Models

    arXiv:2603.06741v2 Announce Type: replace-cross Abstract: Training frontier-scale diffusion models often requires substantial computational resources concentrated in tightly-coupled clusters, limiting participation to well-resourced institutions. While Decentralized Diffusion Mod…

  117. arXiv cs.LG TIER_1 English(EN) · Jama Hussein Mohamud, Mohsin Hasan, Mirco Ravanelli, Yoshua Bengio ·

    Adaptive Order Policies for Masked Diffusion

    arXiv:2606.00295v1 Announce Type: new Abstract: Masked diffusion models have seen great success in capturing data distributions over discrete sequences in domains such as text and proteins. These models generate data by iteratively unmasking tokens starting from a fully masked se…

  118. arXiv cs.LG TIER_1 English(EN) · Constant Bourdrez, Alexandre V\'erine, Olivier Capp\'e ·

    Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning

    arXiv:2602.08689v2 Announce Type: replace Abstract: Diffusion models generate samples through an iterative denoising process guided by a pretrained neural network. Once the denoiser is fixed, the sampling algorithm itself (noise schedules, guidance scales, stochasticity profiles)…

  119. arXiv cs.LG TIER_1 English(EN) · Anjian Li, Bartolomeo Stellato, Ryne Beeson ·

    GLENS: Global Search via Learning from Solver Iterates with Diffusion Models

    arXiv:2606.00366v1 Announce Type: new Abstract: We consider the problem of generating a large collection of initial guesses for local minima of multimodal non-convex continuous optimization problems. The goal is for these initial guesses to be high-quality (i.e., a numerical solv…

  120. arXiv cs.LG TIER_1 English(EN) · Quanzheng Li ·

    Hallucination-Aware Diffusion Sampling for Inverse Problems via Robust Prior Updates

    Diffusion-based inverse problem solvers can produce realistic reconstructions, but realism alone does not ensure that the recovered details are supported by the measurement. We study this failure as measurement-conditioned hallucination: visually meaningful content that is either…

  121. Hugging Face Daily Papers TIER_1 English(EN) ·

    Error Bounds for a Diffusion Model-Based Drift Estimator

    Parameter estimation in stochastic differential equations is a classical statistical problem of much importance in many scientific fields. Recent work of Tapia Costa et al. (2026) introduced a novel technique for estimating the drift when the diffusion parameter is known, using d…

  122. arXiv cs.AI TIER_1 English(EN) · Jinwoo Kim, S\'ekou-Oumar Kaba, Jiyun Park, Seunghoon Hong, Siamak Ravanbakhsh ·

    Inverting Data Transformations via Diffusion Sampling

    arXiv:2602.08267v2 Announce Type: replace-cross Abstract: We study the problem of transformation inversion on general Lie groups: a datum is transformed by an unknown group element, and the goal is to recover an inverse transformation that maps it back to the original data distri…

  123. arXiv cs.LG TIER_1 English(EN) · Victor M. Yeom-Song, Severi Rissanen, Arno Solin, Samuel Kaski, Mingfei Sun ·

    Softly Constrained Denoisers for Diffusion Models Applied to Partial Differential Equations

    arXiv:2512.14980v4 Announce Type: replace Abstract: Diffusion models have become a powerful generative prior for solutions of partial differential equations (PDEs). Existing approaches enforce physical constraints either by adding the PDE residuals as loss regularizers or through…

  124. arXiv cs.AI TIER_1 English(EN) · Enrico Cassano, Riccardo Renzulli, Marco Nurisso, Mirko Zaffaroni, Alan Perotti, Marco Grangetto ·

    SAEmnesia: Erasing Concepts in Diffusion Models with Supervised Sparse Autoencoders

    arXiv:2509.21379v3 Announce Type: replace-cross Abstract: Concept unlearning in diffusion models is hampered by feature splitting, where concepts are distributed across many latent features, making their removal challenging and computationally expensive. We introduce SAEmnesia, a…

  125. arXiv cs.CL TIER_1 English(EN) · Zekai Li, Ji Liu, Yiqing Huang, Ziqiong Liu, Dong Li, Emad Barsoum ·

    Efficient Diffusion LLMs via Temporal-Spatial Parallel Decoding and Confidence Extrapolation

    arXiv:2605.30753v1 Announce Type: new Abstract: Diffusion-based large language models (dLLMs) support parallel text generation via iterative denoising, yet inference remains latency-heavy because many steps are spent on redundant refinement and repeated remasking of tokens whose …

  126. arXiv cs.LG TIER_1 English(EN) · Alireza Kheirandish, Jihoon Hong, Sara Fridovich-Keil ·

    KLIP: localized distribution shift detection via KL-divergence with diffusion priors in Inverse Problems

    arXiv:2605.31596v1 Announce Type: cross Abstract: Diffusion models have shown promising performance as data-driven priors for computational imaging, as well as some capacity to detect out-of-distribution (OOD) images. However, existing approaches to OOD detection often require so…

  127. Hugging Face Daily Papers TIER_1 English(EN) ·

    Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

    Decoupled Residual Denoising Diffusion models (DRDD) improve unified image-to-image translation by separating noise diffusion for domain harmonization from residual diffusion for semantic mapping, enhancing data efficiency and performance.

  128. arXiv cs.LG TIER_1 English(EN) · Sara Fridovich-Keil ·

    KLIP: localized distribution shift detection via KL-divergence with diffusion priors in Inverse Problems

    Diffusion models have shown promising performance as data-driven priors for computational imaging, as well as some capacity to detect out-of-distribution (OOD) images. However, existing approaches to OOD detection often require some knowledge of the shifted distribution, fail to …

  129. arXiv cs.AI TIER_1 English(EN) · Xincheng Wang, Hanchi Sun, Wenjun Sun, Kejun Xue, Wangqiu Zhou, Jianbo Zhang, Wei Sun, Dandan Zhu, Xiongkuo Min, Jun Jia, Zhijun Fang ·

    Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approach

    arXiv:2511.19316v2 Announce Type: replace-cross Abstract: Recent fine-tuning techniques for diffusion models enable them to reproduce specific image sets, such as particular faces or artistic styles, but also introduce copyright and security risks. Dataset watermarking has been p…

  130. arXiv cs.AI TIER_1 English(EN) · Dueun Kim, Albert No ·

    The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models

    arXiv:2605.29123v1 Announce Type: new Abstract: Masked diffusion language models (MDMs) uniquely support any-order generation, with confidence-based decoding currently serving as the de facto standard inference policy. To optimize for this, recent training schemes attempt to alig…

  131. arXiv cs.AI TIER_1 English(EN) · Shuaidi Wang, Zhan Zhuang, Ruping Huang, Yu Zhang ·

    NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs

    arXiv:2605.29716v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising non-autoregressive generative paradigm. Given the prohibitive computational cost of full fine-tuning, Parameter-Efficient Fine-Tuning (PEFT) has become the standard…

  132. arXiv cs.AI TIER_1 English(EN) · Yuhao Sun, Lingyun Yu, Haoxiang Xu, Fengyuan Miao, Zhuoer Xu, Hongtao Xie ·

    Orthogonal Concept Erasure for Diffusion Models

    arXiv:2605.28902v1 Announce Type: new Abstract: Concept erasure has emerged as a promising approach to mitigate undesired or unsafe content in diffusion models, yet existing methods still face significant limitations. While training-based methods are effective, their high computa…

  133. arXiv cs.LG TIER_1 English(EN) · Itamar Levine, Yair Weiss ·

    Diffusion Models, Denoiser Architecture and Creativity

    arXiv:2605.16415v2 Announce Type: replace-cross Abstract: The creativity of diffusion models refers to their ability to generate highly realistic images that are different from their training data. Creativity is somewhat surprising since it is known that if the denoiser used in t…

  134. arXiv cs.AI TIER_1 English(EN) · Antoni Kowalczuk, Dominik Hintersdorf, Lukas Struppek, Kristian Kersting, Adam Dziedzic, Franziska Boenisch ·

    Finding DoRI: Discovery of Retained Images in Diffusion Models

    arXiv:2507.16880v3 Announce Type: replace-cross Abstract: Text-to-image diffusion models (DMs) have achieved remarkable success in image generation. However, concerns about data privacy and intellectual property remain due to their potential to inadvertently memorize and replicat…

  135. arXiv cs.LG TIER_1 English(EN) · Leyi Qi, Yiming Li, Siyuan Liang, Zhengzhong Tu, Dacheng Tao ·

    Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing

    arXiv:2605.29809v1 Announce Type: cross Abstract: Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingl…

  136. arXiv cs.LG TIER_1 English(EN) · Benjamin A. Burns, Sara Fridovich-Keil ·

    When, why, and how do diffusion posterior samplers fail? A finite-sample lens

    arXiv:2605.30330v1 Announce Type: new Abstract: Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampling in imaging inverse problems. Existing methods can incorporate any mea…

  137. arXiv cs.LG TIER_1 English(EN) · Danylo Boiko, Viktoriia Mishkurova ·

    Treatment-Conditioned Diffusion for Forecasting Neurodegenerative Disease Progression

    arXiv:2605.29932v1 Announce Type: new Abstract: Forecasting the progression of neurodegenerative diseases, such as Parkinson's disease, is essential for effective long-term planning and personalized therapeutic intervention. Existing systems typically produce scalar clinical scor…

  138. arXiv cs.LG TIER_1 English(EN) · Gabriel Moreira, Manuel Marques, Jo\~ao Paulo Costeira, Chenyan Xiong ·

    Spectral Guidance for Flexible and Efficient Control of Diffusion Models

    arXiv:2605.28900v1 Announce Type: new Abstract: We introduce Spectral Guidance, a framework for controlling diffusion models by leveraging the intrinsic geometry of the generative process. As data is progressively corrupted by noise, only a small number of features remain informa…

  139. arXiv cs.LG TIER_1 English(EN) · Sara Fridovich-Keil ·

    When, why, and how do diffusion posterior samplers fail? A finite-sample lens

    Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampling in imaging inverse problems. Existing methods can incorporate any measurement model at inference time but must use an…

  140. arXiv cs.AI TIER_1 English(EN) · Seunghyeok Shin, Minwoo Kim, Dabin Kim, Hongki Lim ·

    Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping

    arXiv:2605.27990v1 Announce Type: cross Abstract: Diffusion posterior sampling conditions diffusion priors on measurements, but data-consistency updates are typically scaled by hand-tuned guidance weights and can destabilize sampling under stiff, operator-dependent curvature. We …

  141. arXiv cs.LG TIER_1 English(EN) · Abduragim Shtanchaev, Albina Ilina, Yazid Janati, Arip Asadulaev, Martin Takac, Eric Moulines ·

    Sparse Scheduled Diffusion Guidance for Inverse Problems

    arXiv:2603.07860v2 Announce Type: replace Abstract: Pretrained diffusion models are effective priors for Bayesian inverse problems, but posterior sampling with these priors is often costly because data-consistency guidance is applied throughout the full reverse trajectory. Existi…

  142. arXiv cs.LG TIER_1 English(EN) · Andrew Millard, Fredrik Lindsten, Zheng Zhao ·

    Particle-Guided Diffusion Models for Partial Differential Equations

    arXiv:2601.23262v2 Announce Type: replace Abstract: We introduce a guided stochastic sampling method that augments sampling from diffusion models with physics-based guidance derived from partial differential equation (PDE) residuals and observational constraints, ensuring generat…

  143. arXiv cs.LG TIER_1 English(EN) · Nithesh Chandher Karthikeyan, Jonas Unger, Gabriel Eilertsen ·

    Representation-Conditioned Diffusion Models for Guided Training Data Generation

    arXiv:2605.27495v1 Announce Type: cross Abstract: Data availability remains a critical bottleneck in many deep learning applications. Large-scale datasets are often expensive to collect, curate and annotate, which can limit the scalability and applicability of supervised learning…

  144. arXiv cs.LG TIER_1 English(EN) · Zhengyang Liang, Qihang Zhang, Ceyuan Yang ·

    Explicit Critic Guidance for Aligning Diffusion Models

    arXiv:2605.27736v1 Announce Type: new Abstract: Online reinforcement learning is becoming increasingly important for aligning diffusion models with non-differentiable objectives. However, existing methods still face limitations in assigning fine-grained credit along denoising tra…

  145. arXiv cs.AI TIER_1 English(EN) · Peiliang Cai, Jiacheng Liu, Haowen Xu, Xinyu Wang, Chang Zou, Linfeng Zhang ·

    LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration

    arXiv:2602.20497v3 Announce Type: replace-cross Abstract: Diffusion models have achieved remarkable success in image and video generation tasks. However, the high computational demands of Diffusion Transformers (DiTs) pose a significant challenge to their practical deployment. Wh…

  146. arXiv cs.AI TIER_1 English(EN) · Gabriel Raya, Bac Nguyen, Georgios Batzolis, Yuhta Takida, Dejan Stancevic, Naoki Murata, Chieh-Hsin Lai, Yuki Mitsufuji, Luca Ambrogioni ·

    Noise Scheduling as Information-Guided Allocation in Diffusion Training

    arXiv:2602.18647v2 Announce Type: replace-cross Abstract: We introduce InfoNoise, an online adaptive noise schedule for diffusion training that reallocates optimization effort toward noise levels where denoising is most informative. Together with loss weighting, a noise schedule …

  147. arXiv cs.AI TIER_1 English(EN) · Chieh-Hsin Lai, Yang Song, Dongjun Kim, Yuki Mitsufuji, Stefano Ermon ·

    The Principles of Diffusion Models

    arXiv:2510.21890v2 Announce Type: replace-cross Abstract: This book presents the core principles that have guided the development of diffusion models, tracing their origins and showing how diverse formulations arise from shared mathematical ideas. Diffusion modeling starts by def…

  148. arXiv cs.AI TIER_1 English(EN) · Calvin Yeung, Prathyush Poduval, Ali Zakeri, Zhuowen Zou, Mohsen Imani ·

    Residualized Temporal Sparse Autoencoders for Interpreting Diffusion Models

    arXiv:2605.27813v1 Announce Type: cross Abstract: Text-to-image diffusion models generate images through an iterative denoising process, so internal neural layers produce trajectories of activations rather than single static representations. Sparse autoencoders (SAEs) have recent…

  149. arXiv cs.AI TIER_1 English(EN) · Hyunmin Cho, Woo Kyoung Han, Kyong Hwan Jin ·

    Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective

    arXiv:2605.27476v1 Announce Type: cross Abstract: We characterize the pre-softmax attention matrix $\mathbf{QK^\top}$ in transformers as an associative memory matrix encoding pairwise associations between input features. By decomposing this matrix into its symmetric and skew-symm…

  150. Hugging Face Daily Papers TIER_1 English(EN) ·

    Colored Noise Diffusion Sampling

    Diffusion models exhibit spectral bias in image synthesis, and a new sampling method called Colored Noise Sampling addresses this by dynamically allocating energy based on frequency-dependent schedules, leading to improved image quality metrics.

  151. arXiv cs.LG TIER_1 English(EN) · Loukas Sfountouris, Giannis Daras, Paris Giampouras ·

    Align & Invert: Solving Inverse Problems with Diffusion and Flow-based Models via Representation Alignment

    arXiv:2511.16870v3 Announce Type: replace-cross Abstract: Enforcing alignment between the internal representations of diffusion or flow-based generative models and those of pretrained self-supervised encoders has recently been shown to provide a powerful inductive bias, improving…

  152. arXiv cs.LG TIER_1 English(EN) · Funing Fu, Tenghui Wang, Junyong Cen, Qichao Zhu, Guanyu Zhou ·

    JLT: Clean-Latent Prediction in Latent Diffusion Transformers

    arXiv:2605.27102v1 Announce Type: cross Abstract: Flow matching with clean-data prediction has shown that regressing the clean point can exploit low-dimensional structure more effectively than predicting an ambient noised quantity. We ask whether this principle remains useful aft…

  153. arXiv cs.LG TIER_1 English(EN) · Nithesh Chandher Karthikeyan, Jonas Unger, Gabriel Eilertsen ·

    Towards Controllable Image Generation through Representation-Conditioned Diffusion Models

    arXiv:2605.27343v1 Announce Type: cross Abstract: Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such…

  154. arXiv cs.LG TIER_1 English(EN) · Nicola Novello, Federico Fontana, Luigi Cinque, Deniz Gunduz, Andrea M. Tonello ·

    A Unified Framework for Diffusion Model Unlearning with f-Divergence

    arXiv:2509.21167v2 Announce Type: replace Abstract: Most existing methods for concept unlearning in text-to-image diffusion models minimize a mean squared error (MSE) loss between the denoiser outputs conditioned on a target and an anchor concept, which is implicitly the KL diver…

  155. arXiv cs.AI TIER_1 English(EN) · William Yuan, Sungwon Jeong, Amirali Aghazadeh ·

    On the Error-Correcting Effects of Stochasticity in Discrete Diffusion

    arXiv:2605.26582v1 Announce Type: cross Abstract: Discrete diffusion models achieve strong performance in text and image generation, but their inference remains slow and must inherently balance sampling efficiency and sample quality. In this work, we present a systematic study of…

  156. arXiv cs.AI TIER_1 English(EN) · Junseo Bang, Joonhee Lee, Kyeonghyun Lee, Haechang Lee, Dong Un Kang, Se Young Chun ·

    Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution

    arXiv:2506.07813v2 Announce Type: replace-cross Abstract: Arbitrary-scale image super-resolution aims to upsample images to any desired resolution, offering greater flexibility than traditional fixed-scale super-resolution. Recent approaches based on regression-based or generativ…

  157. arXiv cs.AI TIER_1 English(EN) · Mahdi Farahbakhsh, Vishnu Teja Kunde, Dileep Kalathil, Krishna Narayanan, Jean-Francois Chamberland ·

    Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction

    arXiv:2510.03352v3 Announce Type: replace-cross Abstract: Diffusion models have been used as priors for solving inverse problems. However, existing approaches typically overlook side information that could significantly improve reconstruction quality, especially in severely ill-p…

  158. arXiv cs.LG TIER_1 English(EN) · Xing Cong, Hanlin Tang, Kan Liu, Lan Tao, Lin Qu, Chenhao Xie ·

    RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

    arXiv:2605.26632v1 Announce Type: new Abstract: Diffusion Transformers (DiT) achieve strong performance in image generation but incur substantial inference costs. While prior work has reduced this cost via quantization and distillation, semi-structured sparsity, which can nearly …

  159. arXiv cs.LG TIER_1 English(EN) · Gwangho Kim, Sungyoon Lee ·

    Localizing Memorized Regions in Diffusion Models via Coordinate-Wise Curvature Differences

    arXiv:2605.26756v1 Announce Type: new Abstract: Diffusion models can unintentionally memorize training samples, raising concerns about privacy and copyright. While recent methods can detect memorization, they often rely on global or model-specific signals and provide limited insi…

  160. arXiv cs.LG TIER_1 English(EN) · Gabriel Eilertsen ·

    Towards Controllable Image Generation through Representation-Conditioned Diffusion Models

    Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text prompts or semantic maps, which require e…

  161. arXiv cs.LG TIER_1 English(EN) · Guanyu Zhou ·

    JLT: Clean-Latent Prediction in Latent Diffusion Transformers

    Flow matching with clean-data prediction has shown that regressing the clean point can exploit low-dimensional structure more effectively than predicting an ambient noised quantity. We ask whether this principle remains useful after images are mapped into a learned latent space, …

  162. Hugging Face Daily Papers TIER_1 English(EN) ·

    On the Error-Correcting Effects of Stochasticity in Discrete Diffusion

    Discrete diffusion models achieve strong performance in text and image generation, but their inference remains slow and must inherently balance sampling efficiency and sample quality. In this work, we present a systematic study of how the \emph{degree of stochasticity} in Markov …

  163. arXiv cs.AI TIER_1 English(EN) · Sol Park, Soobin Um ·

    Beyond Generative Priors: Minority Sampling with JEPA-Guided Diffusion

    arXiv:2605.24631v1 Announce Type: cross Abstract: Minority sampling aims to generate low-density instances on a data manifold and is of central importance in applications such as medical diagnosis, anomaly detection, and creative AI. Existing approaches, however, define minority …

  164. arXiv cs.LG TIER_1 English(EN) · R\'emi Bourgerie, \v{S}ar\=unas Girdzijauskas, Viktoria Fodor ·

    Deep Neural Sheaf Diffusion

    arXiv:2605.19021v2 Announce Type: replace Abstract: Deep Graph Neural Networks (GNNs) are essential for capturing complex dependencies in graph-structured data. However, scaling GNNs to depth remains challenging, as stacking layers leads to representation collapse and diminishing…

  165. arXiv cs.LG TIER_1 English(EN) · Arran Carter, Sanghyeok Choi, Kirill Tamogashev, V\'ictor Elvira, Nikolay Malkin ·

    Discrete diffusion samplers and bridges: Off-policy algorithms and applications in latent spaces

    arXiv:2602.05961v2 Announce Type: replace Abstract: Sampling from a distribution $p(x) \propto e^{-\mathcal{E}(x)}$ known up to a normalising constant is an important and challenging problem in statistics. Recent years have seen the rise of a new family of amortised sampling algo…

  166. arXiv cs.AI TIER_1 English(EN) · Weixin Wang, Yu Yang, Wei Deng, Pan Xu ·

    Inference-Time Alignment of Diffusion Models via Trust-Region Iterative Twisted Sequential Monte Carlo

    arXiv:2605.25123v1 Announce Type: cross Abstract: We study inference-time alignment for diffusion-based generative models, aiming to steer a base model toward high-reward outputs without updating its weights. Recent Sequential Monte Carlo (SMC)-based steering methods approximate …

  167. arXiv cs.AI TIER_1 English(EN) · Ziheng Cheng, Yixiao Huang, Hanlin Zhu, Haoran Geng, Somayeh Sojoudi, Jitendra Malik, Pieter Abbeel, Xin Guo ·

    Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning

    arXiv:2605.25210v1 Announce Type: cross Abstract: Diffusion models are increasingly used as powerful conditional generators, yet real deployments often involve multiple target distributions arising from different tasks, e.g., diverse prompt domains in text-to-image generation, or…

  168. arXiv cs.AI TIER_1 English(EN) · Qingyuan Zeng, Pengxiang Cai, Zixin Guan, Ziyang Chen, Anglin Liu, Lang Qin, Xinyao Lai, Jintai Chen ·

    Don't Retrain, Just Reuse: Recovering Dual-Target Molecules from Single-Target Diffusion Models

    arXiv:2605.25681v1 Announce Type: cross Abstract: Designing a single molecule that modulates two targets is a promising strategy for polypharmacology, but it remains substantially harder than standard single-target generation because one candidate must satisfy two binding require…

  169. arXiv cs.AI TIER_1 English(EN) · Zixin Jessie Chen, Zhuo Chen, Archer Wang, Jeff Gore, William T. Freeman, Congyue Deng, Marin Solja\v{c}i\'c ·

    Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

    arXiv:2605.26032v1 Announce Type: cross Abstract: Creating images from noise is image generation; reconstructing fine details from coarse inputs is super-resolution. Despite their practical differences, both can be understood as reversing information loss across scales. We introd…

  170. arXiv cs.AI TIER_1 English(EN) · Matthew Niedoba, Berend Zwartsenberg, Frank Wood ·

    Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization

    arXiv:2605.24192v1 Announce Type: cross Abstract: The neural-network denoising functions which form the backbone of image diffusion models are remarkably consistent in their generalization behaviour across a wide variety of network architectures and training procedure hyperparame…

  171. arXiv cs.AI TIER_1 English(EN) · Shaorong Zhang, Rob Brekelmans, Greg Ver Steeg ·

    Local MAP Sampling for Diffusion Models

    arXiv:2510.07343v3 Announce Type: replace-cross Abstract: Diffusion Posterior Sampling (DPS) provides a principled Bayesian approach to inverse problems by sampling from $p(x_0 \mid y)$. While posterior sampling is valuable for capturing uncertainty and multi-modality, many class…

  172. arXiv cs.CL TIER_1 English(EN) · Ke Lin, Yiyang Luo, Zhaolong Su, Yunya Song, Anyi Rao ·

    Triplet-Block Diffusion RWKV

    arXiv:2605.25969v1 Announce Type: new Abstract: Causal Transformer language models suffer from strictly sequential decoding and a quadratic per-step attention cost. While linear-time causal models and discrete diffusion models each address these weaknesses, their integration rema…

  173. arXiv cs.LG TIER_1 English(EN) · Zichao Yue, Zhiru Zhang ·

    Revisiting Pre-Propagation GNNs: Robust Diffusion Operators and Hidden-State Re-Propagation

    arXiv:2605.25111v1 Announce Type: new Abstract: Pre-propagation graph neural networks (PPGNNs) decouple node feature propagation from transformation: graph diffusion is performed once as preprocessing, and training reduces to dense per-node transformations. This design enables mi…

  174. arXiv cs.LG TIER_1 English(EN) · Yanbo Xu, Yu Wu, Sungjae Park, Zhizhuo Zhou, Shubham Tulsiani ·

    Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models

    arXiv:2510.01184v2 Announce Type: replace Abstract: We present a mechanism to steer the sampling diversity of denoising diffusion and flow matching models, allowing users to sample from a sharper or broader distribution than the training distribution. We build on the observation …

  175. arXiv cs.LG TIER_1 English(EN) · Jaeyeon Kim, Seunggeun Kim, Taekyun Lee, David Z. Pan, Hyeji Kim, Sham Kakade, Sitan Chen ·

    Fine-Tuning Masked Diffusion for Provable Self-Correction

    arXiv:2510.01384v4 Announce Type: replace Abstract: A natural desideratum for generative models is self-correction--detecting and revising low-quality tokens at inference. While Masked Diffusion Models (MDMs) have emerged as a promising approach for generative modeling in discret…

  176. arXiv cs.LG TIER_1 English(EN) · Nishanth Shetty, Madhava Prasath, Chandra Sekhar Seelamantula ·

    Dale meets Langevin: A Multiplicative Denoising Diffusion Model

    arXiv:2510.02730v2 Announce Type: replace Abstract: Exponentiated gradient descent (EGD), a biologically motivated optimisation algorithm that respects Dale's law, produces log-normally distributed synaptic weights at convergence, in alignment with experimental observations in ne…

  177. arXiv cs.LG TIER_1 English(EN) · Sungwon Park, Anthony Zhou, Hongjoong Kim, Amir Barati Farimani ·

    Generative Neural Operators through Diffusion Last Layer

    arXiv:2602.04139v2 Announce Type: replace Abstract: Neural operators provide a powerful framework for learning discretization invariant mappings between function spaces, but standard deterministic models do not capture predictive uncertainty. We introduce diffusion last layer (DL…

  178. Hugging Face Daily Papers TIER_1 English(EN) ·

    Triadic Dynamics Aware Diffusion Posterior Sampling for Inverse Problems: Optimizing Guidance and Stochasticity Schedules

    Generative posterior sampling using diffusion models has emerged as a dominant paradigm for solving inverse problems in imaging, which usually consists of three main components: data consistency (DC) guidance, classifier-free guidance (CFG) and stochasticity. While prior arts hav…

  179. Hugging Face Daily Papers TIER_1 English(EN) ·

    Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective

    The symmetric and skew-symmetric components of transformer attention matrices are analyzed as governing energy landscape structure and circulation dynamics, respectively, with implications for generation trade-offs.

  180. Hugging Face Daily Papers TIER_1 English(EN) ·

    JLT: Clean-Latent Prediction in Latent Diffusion Transformers

    Latent diffusion models using clean-data prediction outperform velocity prediction in compressed representations, demonstrating that prediction targets are geometrically dependent rather than algebraically interchangeable.

  181. Hugging Face Daily Papers TIER_1 English(EN) ·

    RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

    Diffusion Transformers achieve strong image generation performance but face high inference costs; this work proposes RT-Lynx, which uses activation sparsification and optimized CUDA kernels to accelerate inference while maintaining generation quality.

  182. Hugging Face Daily Papers TIER_1 English(EN) ·

    Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

    Creating images from noise is image generation; reconstructing fine details from coarse inputs is super-resolution. Despite their practical differences, both can be understood as reversing information loss across scales. We introduce $\textbf{SKILD}$, a $\textbf{S}$cale-invariant…

  183. arXiv cs.AI TIER_1 English(EN) · Marin Soljačić ·

    Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

    Creating images from noise is image generation; reconstructing fine details from coarse inputs is super-resolution. Despite their practical differences, both can be understood as reversing information loss across scales. We introduce $\textbf{SKILD}$, a $\textbf{S}$cale-invariant…

  184. arXiv cs.CL TIER_1 English(EN) · Anyi Rao ·

    Triplet-Block Diffusion RWKV

    Causal Transformer language models suffer from strictly sequential decoding and a quadratic per-step attention cost. While linear-time causal models and discrete diffusion models each address these weaknesses, their integration remains inherently inconsistent: diffusion requires …

  185. arXiv cs.AI TIER_1 English(EN) · Jintai Chen ·

    Don't Retrain, Just Reuse: Recovering Dual-Target Molecules from Single-Target Diffusion Models

    Designing a single molecule that modulates two targets is a promising strategy for polypharmacology, but it remains substantially harder than standard single-target generation because one candidate must satisfy two binding requirements while preserving drug-likeness and synthesiz…

  186. arXiv cs.LG TIER_1 English(EN) · Jaihoon Kim, Taehoon Yoon, Prin Phunyaphibarn, Seungjun Kim, Morteza Mardani, Minhyuk Sung ·

    Contrastive Distribution Matching for Amortized Sequential Monte Carlo in Discrete Diffusion

    arXiv:2605.23346v1 Announce Type: new Abstract: Discrete diffusion models have emerged as powerful frameworks for generating structured categorical data. However, efficiently sampling from reward-tilted distributions remains a fundamental challenge. While Twisted Sequential Monte…

  187. arXiv cs.CL TIER_1 English(EN) · Kaisen Yang, Jayden Teoh, Kaicheng Yang, Yitong Zhang, Alex Lamb ·

    Improving Sampling for Masked Diffusion Models via Information Gain

    arXiv:2602.18176v3 Announce Type: replace Abstract: Masked Diffusion Models (MDMs) enable flexible decoding orders, yet existing samplers remain largely greedy, selecting locally certain tokens without accounting for their downstream effects. We show that this myopia can increase…

  188. arXiv cs.LG TIER_1 English(EN) · Benjamin Rozonoyer, Jacopo Minniti, Dhruvesh Patel, Neil Band, Avishek Joey Bose, Tim G. J. Rudner, Andrew McCallum ·

    Learned Relay Representations for Forward-Thinking Discrete Diffusion Models

    arXiv:2605.22967v1 Announce Type: new Abstract: When Masked Diffusion Models (MDMs) generate sequences through iterative refinement, the rich internal computation over masked positions is discarded, forcing every subsequent refinement step to recompute the valuable internal infor…

  189. arXiv cs.LG TIER_1 English(EN) · Egor Lifar, Semyon Savkin, Timur Garipov, Shangyuan Tong, Tommi Jaakkola ·

    Diffusion Domain Expansion: Learning to Coordinate Pre-trained Diffusion Models

    arXiv:2605.23275v1 Announce Type: new Abstract: In this paper, we propose Diffusion Domain Expansion (DDE), a method that efficiently extends pre-trained diffusion models to generate larger objects and handle more complex conditioning beyond their original capabilities. Our metho…

  190. Hugging Face Daily Papers TIER_1 English(EN) ·

    Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

    SKILD is a scale-invariant k-space image learning diffusion model that unifies image generation and continuous super-resolution through a single unconditional framework by leveraging scale invariance in image content and physics systems.

  191. Hugging Face Daily Papers TIER_1 English(EN) ·

    Triplet-Block Diffusion RWKV

    B³D-RWKV combines diffusion and RWKV architectures to achieve parallel, bidirectional processing with improved decoding speed while maintaining competitive accuracy.

  192. Hugging Face Daily Papers TIER_1 English(EN) ·

    Injecting Image Guidance into Text-Conditioned Diffusion Models at Inference

    Visual Concept Fusion enables dual text and image conditioning in diffusion models through feature alignment and fusion strategies without requiring retraining.

  193. arXiv cs.LG TIER_1 English(EN) · Mengni Jia, Mengyu Zhou, Yihao Liu, Xiaoxi Jiang, Guanjun Jiang ·

    Bringing Stability to Diffusion: Decomposing and Reducing Variance of Training Masked Diffusion Models

    arXiv:2511.18159v2 Announce Type: replace Abstract: Masked diffusion models (MDMs) are a promising alternative to autoregressive models (ARMs), but they suffer from inherently much higher training variance. High variance leads to noisier gradient estimates and unstable optimizati…

  194. arXiv cs.CL TIER_1 English(EN) · Chunsan Hong, Sanghyun Lee, Jong Chul Ye ·

    Unifying Masked Diffusion Models with Various Generation Orders and Beyond

    arXiv:2602.02112v2 Announce Type: replace-cross Abstract: Masked diffusion models (MDMs) are a potential alternative to autoregressive models (ARMs) for language generation, but generation quality depends critically on the generation order. Prior work either hard-codes an orderin…

  195. arXiv cs.LG TIER_1 English(EN) · Jack Kendall ·

    Equilibrium Propagation and Hamiltonian Inference in the Diffusive Fitzhugh-Nagumo Model

    arXiv:2605.21568v1 Announce Type: new Abstract: In this work, we extend the Equilibrium Propagation framework to skew-gradient systems and show an equivalence between deep Energy-Based Models and Hamiltonian neural networks. We focus on networks of diffusively coupled Fitzhugh-Na…

  196. arXiv cs.LG TIER_1 English(EN) · Seo Taek Kong, Weina Wang, R. Srikant ·

    Noise Schedule Design for Diffusion Models: An Optimal Control Perspective

    arXiv:2605.21911v1 Announce Type: new Abstract: We develop a principled framework for analyzing and designing noise schedules in diffusion models. We show that one can recast this design problem as an optimal control problem, whose state is the Fisher information of the diffusion…

  197. arXiv cs.LG TIER_1 English(EN) · Samuel Koovely, Alexandre Bovet ·

    Conditional Entropy of Heat Diffusion on Temporal Networks

    arXiv:2605.21514v1 Announce Type: cross Abstract: Many complex systems can be modeled by temporal networks, whose organization often evolves through distinct structural phases. Detecting the change points that delimit these phases is both important and challenging. In this work, …

  198. arXiv cs.AI TIER_1 English(EN) · Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau, Marylou Gabri\'e ·

    The critical slowing down in diffusion models

    arXiv:2605.12597v2 Announce Type: replace-cross Abstract: Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theor…

  199. arXiv cs.AI TIER_1 English(EN) · Chenyang An, Xiaoqian Xu ·

    Lower Bounds for Advection-Diffusion Equations: An Exploration with AI-Generated Proofs

    arXiv:2605.20623v1 Announce Type: cross Abstract: We establish explicit lower bounds for advection-diffusion equations in three settings: a polynomial $\dot H^{-1}$ bound for inviscid shears with $u\in L^\infty_t W^{1,1}_y$, a uniform positive lower bound on the mixing scale for …

  200. arXiv cs.AI TIER_1 English(EN) · Wei Huang, Andi Han, Mingyuan Bai, Huanjian Zhou, Qixin Zhang, Taiji Suzuki, Kenji Fukumizu ·

    Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine

    arXiv:2605.20235v1 Announce Type: cross Abstract: Diffusion models generate high-dimensional data with remarkable quality, yet how their training efficiently learns the score function, bypassing the curse of dimensionality when data is supported on low-dimensional manifolds, rema…

  201. arXiv cs.CL TIER_1 English(EN) · Jiayi Fu, Yuxia Wang ·

    A Tutorial on Diffusion Theory: From Differential Equations to Diffusion Models

    arXiv:2605.22586v1 Announce Type: cross Abstract: This tutorial develops diffusion models from the viewpoint of differential equations. We begin with the conditional Gaussian forward process and show that this path admits both an ordinary differential equation (ODE) representatio…

  202. Hugging Face Daily Papers TIER_1 English(EN) ·

    Contrastive Distribution Matching for Amortized Sequential Monte Carlo in Discrete Diffusion

    Contrastive Distribution Matching addresses efficient sampling from reward-tilted distributions in discrete diffusion models through learned twist functions that reduce computational overhead while maintaining accuracy across diverse applications.

  203. arXiv cs.CL TIER_1 English(EN) · Yuxia Wang ·

    A Tutorial on Diffusion Theory: From Differential Equations to Diffusion Models

    This tutorial develops diffusion models from the viewpoint of differential equations. We begin with the conditional Gaussian forward process and show that this path admits both an ordinary differential equation (ODE) representation and a stochastic differential equation (SDE) rep…

  204. Hugging Face Daily Papers TIER_1 English(EN) ·

    Rethinking Token Reduction for Diffusion Models via Output-Similarity-Awareness

    Diffusion Transformers (DiTs) achieve superior image generation quality but suffer from quadratic computational complexity relative to token count. While various token reduction (TR) methods have been proposed to mitigate this cost, they overlook the primary objective of generati…

  205. Hugging Face Daily Papers TIER_1 English(EN) ·

    Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation

    Discrete diffusion models are often trained through clean-data prediction, but the prediction can be used in different ways to define the reverse dynamics. In Masked Diffusion Models (MDM) these choices largely coincide, whereas in Uniform Diffusion Models (UDM) they do not. We s…

  206. arXiv cs.AI TIER_1 English(EN) · Xiaoqian Xu ·

    Lower Bounds for Advection-Diffusion Equations: An Exploration with AI-Generated Proofs

    We establish explicit lower bounds for advection-diffusion equations in three settings: a polynomial $\dot H^{-1}$ bound for inviscid shears with $u\in L^\infty_t W^{1,1}_y$, a uniform positive lower bound on the mixing scale for diffusive shears, and an exponential $L^2$ bound f…

  207. Hugging Face Daily Papers TIER_1 English(EN) ·

    On the Limits of Latent Reuse in Diffusion Models

    Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a source-target setting in which both datasets are app…

  208. arXiv cs.LG TIER_1 English(EN) · Lu Yu ·

    On the Limits of Latent Reuse in Diffusion Models

    Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a source-target setting in which both datasets are app…

  209. arXiv cs.CL TIER_1 English(EN) · Yo-Sub Han ·

    Adaptive Steering and Remasking for Safe Generation in Diffusion Language Models

    Diffusion Language Models (DLMs) provide a promising alternative to autoregressive language models by generating text through iterative denoising and bidirectional refinement. However, this iterative generation paradigm also introduces unique safety vulnerabilities when harmful t…

  210. arXiv cs.CL TIER_1 English(EN) · Jong Chul Ye ·

    Understanding and Accelerating the Training of Masked Diffusion Language Models

    Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models (ARMs) for language modeling. However, MDMs are known to learn substantially more slowly than ARMs, which may become problematic when scaling MDMs to larger models. Therefore, we ask t…

  211. arXiv cs.AI TIER_1 English(EN) · Shubhankar Mohapatra ·

    DriftXpress: Faster Drifting Models via Projected RKHS Fields

    Drifting Models have emerged as a new paradigm for one-step generative modeling, achieving strong image quality without iterative inference. The premise is to replace the iterative denoising process in diffusion models with a single evaluation of a generator. However, this create…

  212. arXiv cs.CL TIER_1 English(EN) · Haoliang Li ·

    Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models

    Diffusion Language Models (DLMs) have recently emerged as a promising alternative to autoregressive language models, offering stronger global awareness and highly parallel generation. However, post-training DLMs with standard Negative Evidence Lower Bound (NELBO)-based supervised…

  213. arXiv cs.LG TIER_1 English(EN) · Rui Yu ·

    Elucidating Representation Degradation Problem in Diffusion Model Training

    Diffusion models have achieved remarkable success, yet their training remains inefficient due to a severe optimization bottleneck, which we term Representation Degradation. As noise levels increase, the outputs of the trained model exhibit progressive structural distortion, which…

  214. Hugging Face Daily Papers TIER_1 Deutsch(DE) ·

    Kernel Gradient Drifting Models

    We propose kernel-gradient drifting, a one-step generative modeling framework that replaces the fixed Euclidean displacement direction in drifting models with directions induced by the kernel itself. Standard drifting is attractive because it enables fast, high-quality generation…

  215. arXiv cs.LG TIER_1 Deutsch(DE) · Floor Eijkelboom ·

    Kernel Gradient Drifting Models

    We propose kernel-gradient drifting, a one-step generative modeling framework that replaces the fixed Euclidean displacement direction in drifting models with directions induced by the kernel itself. Standard drifting is attractive because it enables fast, high-quality generation…

  216. arXiv cs.LG TIER_1 English(EN) · Aaron R. Dinner ·

    Composing diffusion priors with explicit physical context via generative Gibbs sampling

    Pretrained diffusion models provide powerful learned priors, but in scientific sampling the target distribution often depends on physical context that is not fully represented by one generative model. We introduce Generative Gibbs for Physics-Aware Sampling (GG-PA), a training-fr…

  217. arXiv cs.CL TIER_1 English(EN) · Difan Zou ·

    Relative Score Policy Optimization for Diffusion Language Models

    Diffusion large language models (dLLMs) offer a promising route to parallel and efficient text generation, but improving their reasoning ability requires effective post-training. Reinforcement learning with verifiable rewards (RLVR) is a natural choice for this purpose, yet its a…

  218. arXiv cs.AI TIER_1 English(EN) · Andrea M. Tonello ·

    Empty SPACE: Cross-Attention Sparsity for Concept Erasure in Diffusion Models

    Erasing specific concepts from text-to-image diffusion models is essential for avoiding the generation of copyrighted and explicit content. Closed-form concept erasure methods offer a fast alternative to backpropagation-based techniques, but they become less effective when scalin…

  219. arXiv cs.CL TIER_1 English(EN) · Hongsheng Li ·

    Edit-Based Refinement for Parallel Masked Diffusion Language Models

    Masked diffusion language models enable parallel token generation and offer improved decoding efficiency over autoregressive models. However, their performance degrades significantly when generating multiple tokens simultaneously, due to a mismatch between token-level training ob…

  220. arXiv cs.LG TIER_1 English(EN) · Erhan Bayraktar ·

    When Diffusion Model Can Ignore Dimension: An Entropy-Based Theory

    Diffusion models perform remarkably well on high-dimensional data such as images, often using only a modest number of reverse-time steps. Despite this practical success, existing convergence theory does not fully explain why such samplers remain efficient in high dimensions. Many…

  221. arXiv cs.CL TIER_1 English(EN) · Dmitry Vetrov ·

    How to Train Your Latent Diffusion Language Model Jointly With the Latent Space

    Latent diffusion models offer an attractive alternative to discrete diffusion for non-autoregressive text generation by operating on continuous text representations and denoising entire sequences in parallel. The major challenge in latent diffusion modeling is constructing a suit…

  222. arXiv cs.CL TIER_1 English(EN) · Tim Van de Cruys ·

    Guidance Is Not a Hyperparameter: Learning Dynamic Control in Diffusion Language Models

    Classifier-Free Guidance (CFG) is a widely used mechanism for controlling diffusion-based generative models, yet its guidance scale is typically treated as a fixed hyperparameter throughout generation. This static design yields a suboptimal controllability and quality tradeoff, a…

  223. arXiv cs.LG TIER_1 English(EN) · Chun Kai Ling ·

    Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

    Inference-time controllable generation is essential for real-world applications of unconditional diffusion models. However, most existing techniques focus on individual samples, struggling in applications that require the sample population to follow specific attribute distributio…

  224. arXiv cs.LG TIER_1 English(EN) · Benjamin Sterling, Yousef El-Laham, M\'onica F. Bugallo ·

    Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics

    arXiv:2509.14225v3 Announce Type: replace Abstract: Recent advances in generative artificial intelligence applications have raised new data security concerns. This paper focuses on defending diffusion models against membership inference attacks. This type of attack occurs when th…

  225. arXiv cs.LG TIER_1 English(EN) · Canyu Zhao, Hao Chen, Yunze Tong, Yu Qiao, Jiacheng Li, Chunhua Shen ·

    MARBLE: Multi-Aspect Reward Balance for Diffusion RL

    arXiv:2605.06507v1 Announce Type: cross Abstract: Reinforcement learning fine-tuning has become the dominant approach for aligning diffusion models with human preferences. However, assessing images is intrinsically a multi-dimensional task, and multiple evaluation criteria need t…

  226. arXiv cs.LG TIER_1 English(EN) · Flavio Nicoletti, Chenxiao Ma, Enrico Ventura, Luca Saglietti, Stefano Sarao Mannelli ·

    The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models

    arXiv:2605.06367v1 Announce Type: cross Abstract: Real-world datasets are inherently heterogeneous, yet how per-class structural differences and sampling imbalance shape the training dynamics of diffusion models-and potentially exacerbate disparities-remains poorly understood. Wh…

  227. arXiv cs.LG TIER_1 English(EN) · Meira Iske, Carola-Bibiane Sch\"onlieb ·

    Expressivity of Bi-Lipschitz Normalizing Flows: A Score-Based Diffusion Perspective

    arXiv:2605.06172v1 Announce Type: cross Abstract: Many normalizing flow architectures impose regularity constraints, yet their distributional approximation properties are not fully characterized. We study the expressivity of bi-Lipschitz normalizing flows through the lens of scor…

  228. arXiv cs.LG TIER_1 English(EN) · Sankarshana Venugopal (Seoul National University), Mohammad Mostafavi (Seoul National University), Jonghyun Choi (Seoul National University) ·

    DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation

    arXiv:2605.05889v1 Announce Type: cross Abstract: Diffusion-based image-to-image (I2I) translation excels in high-fidelity generation but suffers from slow sampling in state-of-the-art Diffusion Bridge Models (DBMs), often requiring dozens of function evaluations (NFEs). We intro…

  229. arXiv cs.LG TIER_1 English(EN) · Gal Vinograd, Idan Achituve, Ethan Fetaya ·

    Diverse Sampling in Diffusion Models with Marginal Preserving Particle Guidance

    arXiv:2605.06553v1 Announce Type: new Abstract: We present EDDY (Exact-marginal Diversification via Divergence-free dYnamics), a guidance mechanism for diffusion and flow matching models that promotes diversity among samples generated while maintaining quality. EDDY exploits symm…

  230. arXiv cs.LG TIER_1 English(EN) · Matias G. Delgadino, Sebastien Motsch, Advait Parulekar, William Porteous, Sanjay Shakkottai ·

    Diffusion-Based Posterior Sampling: A Feynman-Kac Analysis of Bias and Stability

    arXiv:2605.06538v1 Announce Type: new Abstract: Diffusion-based posterior samplers use pretrained diffusion priors to sample from measurement- or reward-conditioned posteriors, and are widely used for inverse problems. Yet their theoretical behavior remains poorly understood: eve…

  231. arXiv cs.LG TIER_1 English(EN) · Alexander Conzelmann, Albert Catalan-Tatjer, Shiwei Liu ·

    Layer Collapse in Diffusion Language Models

    arXiv:2605.06366v1 Announce Type: new Abstract: Diffusion language models (DLMs) have recently emerged as competitive alternatives to autoregressive (AR) language models, yet differences in their activation dynamics remain poorly understood. We characterize these dynamics in LLaD…

  232. arXiv cs.LG TIER_1 English(EN) · Eugenio Lomurno, Filippo Balzarini, Francesco Benelle, Francesca Pia Panaccione, Matteo Matteucci ·

    Inference-Time Refinement Closes the Synthetic-Real Gap in Tabular Diffusion

    arXiv:2605.06261v1 Announce Type: new Abstract: Diffusion-based generators set the current state of the art for synthetic tabular data. These methods approach but rarely exceed real-data utility, and closing this synthetic-real gap has so far been pursued exclusively at training …

  233. arXiv cs.LG TIER_1 English(EN) · Pengqi Lu ·

    Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

    arXiv:2605.06169v1 Announce Type: new Abstract: Scaling Diffusion Transformers (DiTs) to hundreds of layers introduces a structural vulnerability: networks can enter a silent, mean-dominated collapse state that homogenizes token representations and suppresses centered variation. …

  234. arXiv cs.LG TIER_1 English(EN) · Pierre Marion, Yu-Han Wu ·

    Understanding diffusion models requires rethinking (again) generalization

    arXiv:2605.06077v1 Announce Type: new Abstract: This position paper argues that understanding generalization in diffusion models requires fundamentally new theoretical frameworks that go beyond both classical statistical learning theory and the benign overfitting paradigm develop…

  235. arXiv cs.LG TIER_1 English(EN) · Ahmad Aghapour, Erhan Bayraktar, Asaf Cohen ·

    Conditional Diffusion Under Linear Constraints: Langevin Mixing and Information-Theoretic Guarantees

    arXiv:2605.05387v1 Announce Type: new Abstract: We study zero-shot conditional sampling with pretrained diffusion models for linear inverse problems, including inpainting and super-resolution. In these problems, the observation determines only part of the unknown signal. The rema…

  236. arXiv cs.CL TIER_1 Français(FR) · Hongcan Guo, Qinyu Zhao, Yian Zhao, Shen Nie, Rui Zhu, Qiushan Guo, Feng Wang, Tao Yang, Hengshuang Zhao, Guoqiang Wei, Yan Zeng ·

    Continuous Latent Diffusion Language Model

    arXiv:2605.06548v1 Announce Type: new Abstract: Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve gene…

  237. arXiv cs.LG TIER_1 English(EN) · Tongda Xu, Mingwei He, Shady Abu-Hussein, Jose Miguel Hernandez-Lobato, Chunhang Zheng, Kai Zhao, Chao Zhou, Ya-Qin Zhang, Yan Wang ·

    Making Reconstruction FID Predictive of Diffusion Generation FID

    arXiv:2603.05630v2 Announce Type: replace-cross Abstract: It is well known that the reconstruction FID (rFID) of a VAE is poorly correlated with the generation FID (gFID) of a latent diffusion model. We propose interpolated FID (iFID), a simple variant of rFID that exhibits a str…

  238. arXiv cs.LG TIER_1 English(EN) · Manyi Li, Yufan Liu, Lai Jiang, Bing Li, Yuming Li, Weiming Hu ·

    The Illusion of Forgetting: Attack Unlearned Diffusion via Initial Latent Variable Optimization

    arXiv:2602.00175v2 Announce Type: replace Abstract: Text-to-image diffusion models (DMs) are frequently abused to produce harmful or copyrighted content, violating public interests. Concept erasure (unlearning) is a promising paradigm to alleviate this issue. However, there exist…

  239. arXiv cs.LG TIER_1 English(EN) · Ethan Fetaya ·

    Diverse Sampling in Diffusion Models with Marginal Preserving Particle Guidance

    We present EDDY (Exact-marginal Diversification via Divergence-free dYnamics), a guidance mechanism for diffusion and flow matching models that promotes diversity among samples generated while maintaining quality. EDDY exploits symmetries of the Fokker-Planck equation, using drif…

  240. arXiv cs.LG TIER_1 English(EN) · Sanjay Shakkottai ·

    Diffusion-Based Posterior Sampling: A Feynman-Kac Analysis of Bias and Stability

    Diffusion-based posterior samplers use pretrained diffusion priors to sample from measurement- or reward-conditioned posteriors, and are widely used for inverse problems. Yet their theoretical behavior remains poorly understood: even with exact prior scores, their outputs are bia…

  241. arXiv cs.LG TIER_1 English(EN) · Arthur Gretton, Li Kevin Wenliang, Alexandre Galashov, James Thornton, Valentin De Bortoli, Arnaud Doucet ·

    On the Wasserstein Gradient Flow Interpretation of Drifting Models

    arXiv:2605.05118v1 Announce Type: new Abstract: Recently, Deng et al. (2026) proposed Generative Modeling via Drifting (GMD), a novel framework for generative tasks. This note presents an analysis of GMD through the lens of Wasserstein Gradient Flows (WGF), i.e., the path of stee…

  242. arXiv cs.LG TIER_1 English(EN) · Christopher Nemeth ·

    Hypergraph Generation via Structured Stochastic Diffusion

    arXiv:2605.05024v1 Announce Type: cross Abstract: Hypergraphs model higher-order interactions, but realistic hypergraph generation remains difficult because incidence, hyperedge-size heterogeneity, and overlap structure are not faithfully captured by pairwise reductions. We propo…

  243. arXiv cs.LG TIER_1 Deutsch(DE) · Xiaoyu Wu, Yifei Wang, Tsu-Jui Fu, Liang-Chieh Chen, Zhe Gan, Chen Wei ·

    Taming Outlier Tokens in Diffusion Transformers

    arXiv:2605.05206v1 Announce Type: cross Abstract: We study outlier tokens in Diffusion Transformers (DiTs) for image generation. Prior work has shown that Vision Transformers (ViTs) can produce a small number of high-norm tokens that attract disproportionate attention while carry…

  244. arXiv cs.LG TIER_1 English(EN) · Kaiwen Zheng, Yuji Wang, Qianli Ma, Huayu Chen, Jintao Zhang, Yogesh Balaji, Jianfei Chen, Ming-Yu Liu, Jun Zhu, Qinsheng Zhang ·

    Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

    arXiv:2510.08431v3 Announce Type: replace-cross Abstract: Although continuous-time consistency models (e.g., sCM, MeanFlow) are theoretically principled and empirically powerful for fast academic-scale diffusion, its applicability to large-scale text-to-image and video tasks rema…

  245. arXiv cs.LG TIER_1 English(EN) · Riccardo de Lutio, Tobias Fischer, Yen-Yu Chang, Yuxuan Zhang, Jay Zhangjie Wu, Xuanchi Ren, Tianchang Shen, Katarina Tothova, Zan Gojcic, Haithem Turki ·

    ArtiFixer: Enhancing and Extending 3D Reconstruction with Auto-Regressive Diffusion Models

    arXiv:2603.00492v2 Announce Type: replace-cross Abstract: Per-scene optimization methods such as 3D Gaussian Splatting provide state-of-the-art novel view synthesis quality but extrapolate poorly to under-observed areas. Methods that leverage generative priors to correct artifact…

  246. arXiv cs.LG TIER_1 English(EN) · Adrien Jacquet Cr\'etides, Mouad Abrini, Hamed Rahimi, Mohamed Chetouani ·

    Encoding Predictability and Legibility for Style-Conditioned Diffusion Policy

    arXiv:2603.16368v2 Announce Type: replace-cross Abstract: Striking a balance between efficiency and transparent motion is a core challenge in human-robot collaboration, as highly expressive movements often incur unnecessary time and energy costs. In collaborative environments, le…

  247. arXiv cs.LG TIER_1 English(EN) · Michael Rottoli, Subhankar Roy, Stefano Paraboschi ·

    Predict-then-Diffuse: Adaptive Response Length for Compute-Budgeted Inference in Diffusion LLMs

    arXiv:2605.04215v1 Announce Type: new Abstract: Diffusion-based Large Language Models (D-LLMs) represent a promising frontier in generative AI, offering fully parallel token generation that can lead to significant throughput advantages and superior GPU utilization over traditiona…

  248. arXiv cs.LG TIER_1 English(EN) · Francesca Romana Crucinio ·

    A note on the unique properties of the Kullback--Leibler divergence for sampling via gradient flows

    arXiv:2507.04330v2 Announce Type: replace-cross Abstract: We consider the problem of sampling from a probability distribution $\pi$ which admits a density w.r.t. a dominating measure. It is well known that this can be written as an optimisation problem over the space of probabili…

  249. arXiv cs.LG TIER_1 English(EN) · Eitan Kosman, Gabriele Serussi, Chaim Basking ·

    Structured Diffusion Bridges: Inductive Bias for Denoising Diffusion Bridges

    arXiv:2605.02973v1 Announce Type: new Abstract: Modality translation is inherently under-constrained, as multiple cross-modal mappings may yield the same marginals. Recent work has shown that diffusion bridges are effective for this task. However, most existing approaches rely on…

  250. arXiv cs.LG TIER_1 English(EN) · James Rowbottom, Elizabeth L. Baker, Nick Huang, Ben Adcock, Carola-Bibiane Sch\"onlieb, Alexander Denker ·

    GRIFDIR: Graph Resolution-Invariant FEM Diffusion Models in Function Spaces over Irregular Domains

    arXiv:2605.03497v1 Announce Type: new Abstract: Score-based diffusion models in infinite-dimensional function spaces provide a mathematically principled framework for modelling function-valued data, offering key advantages such as resolution invariance and the ability to handle i…

  251. arXiv cs.LG TIER_1 English(EN) · Andreas Makris, Paul Fearnhead, Chris Nemeth ·

    Tempered Guided Diffusion

    arXiv:2605.03712v1 Announce Type: cross Abstract: Training-free conditional diffusion provides a flexible alternative to task-specific conditional model training, but existing samplers often allocate computation inefficiently: independent guided trajectories can vary widely in qu…

  252. arXiv cs.LG TIER_1 English(EN) · Francisco M. Castro-Mac\'ias, Pablo Morales-\'Alvarez, Saifuddin Syed, Daniel Hern\'andez-Lobato, Rafael Molina, Jos\'e Miguel Hern\'andez-Lobato ·

    Conditional Diffusion Sampling

    arXiv:2605.04013v1 Announce Type: cross Abstract: Sampling from unnormalized multimodal distributions with limited density evaluations remains a fundamental challenge in machine learning and natural sciences. Successful approaches construct a bridge between a tractable reference …

  253. arXiv cs.LG TIER_1 English(EN) · Alexandre Alouadi, Pierre Henry-Labord\`ere, Gr\'egoire Loeper, Othmane Mazhar, Huy\^en Pham, Nizar Touzi ·

    LightSBB-M: Bridging Schr\"odinger and Bass for Generative Diffusion Modeling

    arXiv:2601.19312v2 Announce Type: replace Abstract: The Schrodinger Bridge and Bass (SBB) formulation, which jointly controls drift and volatility, is an established extension of the classical Schrodinger Bridge (SB). Building on this framework, we introduce LightSBB-M, an algori…

  254. arXiv cs.LG TIER_1 English(EN) · Aaron Havens, Brian Karrer, Neta Shaul ·

    Flow Sampling: Learning to Sample from Unnormalized Densities via Denoising Conditional Processes

    arXiv:2605.03984v1 Announce Type: new Abstract: Sampling from unnormalized densities is analogous to the generative modeling problem, but the target distribution is defined by a known energy function instead of data samples. Because evaluating the energy function is often costly,…

  255. arXiv cs.LG TIER_1 English(EN) · José Miguel Hernández-Lobato ·

    Conditional Diffusion Sampling

    Sampling from unnormalized multimodal distributions with limited density evaluations remains a fundamental challenge in machine learning and natural sciences. Successful approaches construct a bridge between a tractable reference and the target distribution. Parallel Tempering (P…

  256. arXiv cs.AI TIER_1 English(EN) · Neta Shaul ·

    Flow Sampling: Learning to Sample from Unnormalized Densities via Denoising Conditional Processes

    Sampling from unnormalized densities is analogous to the generative modeling problem, but the target distribution is defined by a known energy function instead of data samples. Because evaluating the energy function is often costly, a primary challenge is to learn an efficient sa…

  257. arXiv cs.AI TIER_1 English(EN) · Min Zhang ·

    DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models

    Dataset distillation enables efficient training by distilling the information of large-scale datasets into significantly smaller synthetic datasets. Diffusion based paradigms have emerged in recent years, offering novel perspectives for dataset distillation. However, they typical…

  258. arXiv cs.LG TIER_1 English(EN) · Chris Nemeth ·

    Tempered Guided Diffusion

    Training-free conditional diffusion provides a flexible alternative to task-specific conditional model training, but existing samplers often allocate computation inefficiently: independent guided trajectories can vary widely in quality, and additional function evaluations along a…

  259. arXiv cs.LG TIER_1 English(EN) · Alexander Denker ·

    GRIFDIR: Graph Resolution-Invariant FEM Diffusion Models in Function Spaces over Irregular Domains

    Score-based diffusion models in infinite-dimensional function spaces provide a mathematically principled framework for modelling function-valued data, offering key advantages such as resolution invariance and the ability to handle irregular discretisations. However, practical imp…

  260. arXiv cs.LG TIER_1 English(EN) · Tongzhen Dang, Weiyang Ding, Michael K. Ng ·

    Complex Diffusion Maps with $\omega$-Parameterized Kernels Revealing Inherent Harmonic Representations

    arXiv:2605.01691v1 Announce Type: new Abstract: In this paper, we propose Complex Diffusion Maps (CDM), a novel diffusion mapping framework that aims to reveal the dominant complex harmonics of high-dimensional data. Inspired by the local Gaussian kernel relevant to the heat equa…

  261. arXiv cs.LG TIER_1 English(EN) · Phil Sidney Ostheimer, Mayank Nagda, Andriy Balinskyy, Gabriel Vicente Rodrigues, Jean Radig, Carl Herrmann, Stephan Mandt, Marius Kloft, Sophie Fellenz ·

    Skipping the Zeros in Diffusion Models for Sparse Data Generation

    arXiv:2605.01817v1 Announce Type: new Abstract: Diffusion models (DMs) excel on dense continuous data, but are not designed for sparse continuous data. They do not model exact zeros that represent the deliberate absence of a signal. As a result, they erase sparsity patterns and p…

  262. arXiv cs.LG TIER_1 English(EN) · Carles Domingo-Enrich, Yuanqi Du, Michael S. Albergo ·

    A unified perspective on fine-tuning and sampling with diffusion and flow models

    arXiv:2605.00229v1 Announce Type: cross Abstract: We study the problem of training diffusion and flow generative models to sample from target distributions defined by an exponential tilting of a base density; a formulation that subsumes both sampling from unnormalized densities a…

  263. arXiv cs.LG TIER_1 Deutsch(DE) · Hasan Amin, Yuan Gao, Yaser Souri, Subhojit Som, Ming Yin, Rajiv Khanna, Xia Song ·

    Consistent Diffusion Language Models

    arXiv:2605.00161v1 Announce Type: new Abstract: Diffusion language models (DLMs) are an attractive alternative to autoregressive models because they promise sublinear-time, parallel generation, yet practical gains remain elusive as high-quality samples still demand hundreds of re…

  264. arXiv cs.LG TIER_1 English(EN) · Zihan Zhou, Chenguang Wang, Hongyi Ye, Yongtao Guan, Tianshu Yu ·

    Incomplete Data, Complete Dynamics: A Diffusion Approach

    arXiv:2509.20098v2 Announce Type: replace Abstract: Learning physical dynamics from data is a fundamental challenge in machine learning and scientific modeling. Real-world observational data are inherently incomplete and irregularly sampled, posing significant challenges for exis…

  265. arXiv cs.AI TIER_1 English(EN) · Yonggan Fu, Lexington Whalen, Zhifan Ye, Xin Dong, Shizhe Diao, Jingyu Liu, Chengyue Wu, Hao Zhang, Enze Xie, Song Han, Maksim Khadkevich, Jan Kautz, Yingyan Celine Lin, Pavlo Molchanov ·

    Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

    arXiv:2512.14067v2 Announce Type: replace-cross Abstract: Diffusion language models (dLMs) have emerged as a promising paradigm that enables parallel, non-autoregressive generation, but their learning efficiency lags behind that of autoregressive (AR) language models when trained…

  266. arXiv cs.AI TIER_1 English(EN) · Gabe Guo, Thanawat Sornwanee, Lutong Hao, Elon Litman, Stefano Ermon, Jose Blanchet ·

    ABC: Any-Subset Autoregression via Non-Markovian Diffusion Bridges in Continuous Time and Space

    arXiv:2604.27443v1 Announce Type: cross Abstract: Generating continuous-time, continuous-space stochastic processes (e.g., videos, weather forecasts) conditioned on partial observations (e.g., first and last frames) is a fundamental challenge. Existing approaches, (e.g., diffusio…

  267. arXiv cs.AI TIER_1 English(EN) · Michael Cardei, Huu Binh Ta, Ferdinando Fioretto ·

    Simple Self-Conditioning Adaptation for Masked Diffusion Models

    arXiv:2604.26985v1 Announce Type: cross Abstract: Masked diffusion models (MDMs) generate discrete sequences by iterative denoising under an absorbing masking process. In standard masked diffusion, if a token remains masked after a reverse update, the model discards its clean-sta…

  268. arXiv cs.CL TIER_1 English(EN) · Yihong Dong, Zhaoyu Ma, Xue Jiang, Zhiyuan Fan, Jiaru Qian, Yongmin Li, Jianha Xiao, Zhi Jin, Rongyu Cao, Binhua Li, Fei Huang, Yongbin Li, Ge Li ·

    Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model

    arXiv:2510.18165v3 Announce Type: replace-cross Abstract: Diffusion language models (DLMs) are emerging as a compelling alternative to the dominant autoregressive paradigm, offering inherent advantages in parallel generation and bidirectional context modeling. However, for the ta…

  269. arXiv cs.LG TIER_1 English(EN) · Hyukjun Lim, Soojung Yang, Lucas Pin\`ede, Miguel Steiner, Yuanqi Du, Rafael G\'omez-Bombarelli ·

    A Priori Sampling of Transition States with Guided Diffusion

    arXiv:2603.25980v2 Announce Type: replace-cross Abstract: Transition states, the first-order saddle points on the potential energy surfaces, govern the kinetics and mechanisms of chemical reactions and conformational changes. Locating them is challenging because transition pathwa…

  270. arXiv cs.LG TIER_1 English(EN) · Zixuan Zhang, Kaixuan Huang, Tuo Zhao, Mengdi Wang, Minshuo Chen ·

    Diffusion Model for Manifold Data: Score Decomposition, Curvature, and Statistical Complexity

    arXiv:2603.20645v2 Announce Type: replace Abstract: Diffusion models have become a leading framework in generative modeling, yet their theoretical understanding -- especially for high-dimensional data concentrated on low-dimensional structures -- remains incomplete. This paper in…

  271. arXiv cs.LG TIER_1 English(EN) · Yuxiang Wang, Yu Xiang, Baojian Zhou, Qifang Zhao, Keyue Jiang, Yanghua Xiao, Xiaoxiao Xu ·

    On the Trainability of Masked Diffusion Language Models via Blockwise Locality

    arXiv:2604.24832v1 Announce Type: new Abstract: Masked diffusion language models (MDMs) have recently emerged as a promising alternative to standard autoregressive large language models (AR-LLMs), yet their optimization can be substantially less stable. We study blockwise MDMs an…

  272. Hugging Face Daily Papers TIER_1 English(EN) ·

    Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds

    Practically, training diffusion models typically requires explicit time conditioning to guide the network through the denoising sampling process. Especially in deterministic methods like DDIM, the absence of time conditioning leads to significant performance degradation. However,…

  273. arXiv cs.LG TIER_1 English(EN) · Weiguo Gao, Ming Li ·

    Toward Theoretical Insights into Diffusion Trajectory Distillation via Operator Merging

    arXiv:2505.16024v2 Announce Type: replace Abstract: Diffusion trajectory distillation accelerates sampling by training a student model to approximate the multi-step denoising trajectories of a pretrained teacher model using far fewer steps. Despite strong empirical results, the t…

  274. arXiv cs.LG TIER_1 English(EN) · Zicheng Lyu, Zengfeng Huang ·

    Radial Load--Reserve Certificates for Wasserstein Propagation in Isotropic Diffusion Samplers

    arXiv:2603.19670v3 Announce Type: replace Abstract: Nonasymptotic diffusion analyses often decompose sampling error into score estimation, continuous reverse-time propagation, discretization, and terminal conversion. We isolate the propagation module on certified scalar-isotropic…

  275. arXiv cs.LG TIER_1 English(EN) · Dake Bu, Wei Huang, Andi Han, Hau-San Wong, Qingfu Zhang, Taiji Suzuki, Atsushi Nitanda ·

    DPRM: A Plug-in Doob h transform-induced Token-Ordering Module for Diffusion Language Models

    arXiv:2604.24357v1 Announce Type: new Abstract: Diffusion language models generate without a fixed left-to-right order, making token ordering a central algorithmic choice: which tokens should be revealed, retained, revised or verified at each step? Existing systems mainly use ran…

  276. arXiv cs.LG TIER_1 English(EN) · Dong Liu, Haisheng Wang, Yanxuan Yu ·

    Accelerating Frequency Domain Diffusion Models with Error-Feedback Event-Driven Caching

    arXiv:2604.22901v1 Announce Type: new Abstract: Diffusion models achieve remarkable success in time series generation. However, slow inference limits their practical deployment. We propose E$^2$-CRF (Error-Feedback Event-Driven Cumulative Residual Feature caching) to accelerate f…

  277. arXiv cs.LG TIER_1 English(EN) · Aditi De ·

    Symmetric Equilibrium Propagation for Thermodynamic Diffusion Training

    arXiv:2604.23806v1 Announce Type: new Abstract: The reverse process in score-based diffusion models is formally equivalent to overdamped Langevin dynamics in a time-dependent energy landscape. In our prior work we showed that a bilinearly-coupled analog substrate can physically r…

  278. arXiv cs.LG TIER_1 English(EN) · Yiming Zhang, Sitong Liu, Ke Li, Zhihong Wu, Alex Cloninger, Melvin Leok ·

    GeoEdit: Local Frames for Fast, Training-Free On-Manifold Editing in Diffusion Models

    arXiv:2604.24238v1 Announce Type: new Abstract: Diffusion models are a leading paradigm for data generation, but training-free editing typically re-runs the full denoising trajectory for every edit strength, making iterative refinement expensive. To address this issue, we instead…

  279. arXiv cs.LG TIER_1 English(EN) · Enshu Liu, Xuefei Ning, Yu Wang, Zinan Lin ·

    NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization

    arXiv:2604.18471v2 Announce Type: replace Abstract: Discrete diffusion language models (dLLMs) have recently emerged as a promising alternative to traditional autoregressive approaches, offering the flexibility to generate tokens in arbitrary orders and the potential of parallel …

  280. arXiv cs.AI TIER_1 English(EN) · Atsushi Nitanda ·

    DPRM: A Plug-in Doob h transform-induced Token-Ordering Module for Diffusion Language Models

    Diffusion language models generate without a fixed left-to-right order, making token ordering a central algorithmic choice: which tokens should be revealed, retained, revised or verified at each step? Existing systems mainly use random masking or confidence-driven ordering. Rando…

  281. arXiv cs.LG TIER_1 English(EN) · Melvin Leok ·

    GeoEdit: Local Frames for Fast, Training-Free On-Manifold Editing in Diffusion Models

    Diffusion models are a leading paradigm for data generation, but training-free editing typically re-runs the full denoising trajectory for every edit strength, making iterative refinement expensive. To address this issue, we instead edit near the data manifold, where small local …

  282. arXiv cs.LG TIER_1 English(EN) · Luca Ambrogioni ·

    How Out-of-Equilibrium Phase Transitions can Seed Pattern Formation in Trained Diffusion Models

    arXiv:2603.20092v4 Announce Type: replace Abstract: Diffusion models generate structure by progressively transforming noise into data, yet the mechanisms underlying this transition remain poorly understood. In this work, we show that pattern formation in trained diffusion models …

  283. arXiv cs.CV TIER_1 English(EN) · Kailin Tan, Jincheng Dai, Sixian Wang, Guo Lu, Shuo Shao, Kai Niu, Wenjun Zhang, Ping Zhang ·

    DiT-JSCC: Rethinking Deep JSCC with Diffusion Transformers and Semantic Representations

    arXiv:2601.03112v2 Announce Type: replace-cross Abstract: Generative joint source-channel coding (GJSCC) has emerged as a new Deep JSCC paradigm for achieving high-fidelity and robust image transmission under extreme wireless channel conditions, such as ultra-low bandwidth and lo…

  284. arXiv cs.CV TIER_1 English(EN) · Bingshuo Qian, Xiang Cheng ·

    Learning When to Denoise: Optimizing Asynchronous Schedules for Latent Diffusion

    arXiv:2606.19662v1 Announce Type: new Abstract: Multi-representation diffusion models can improve visual synthesis by denoising complementary views of an image, but their performance depends critically on the asynchronous schedule that determines when each representation is denoi…

  285. arXiv cs.CV TIER_1 English(EN) · Wei Pan, Xuhan Zheng, Yilin Shi, Huiguo He, Hiuyi Cheng, Dezhi Peng, Minghui Liao, Lianwen Jin ·

    DiffMath: Symbol- and Graph-Aware Latent Diffusion Transformer for Handwritten Mathematical Expression Generation

    arXiv:2606.19939v1 Announce Type: new Abstract: Handwritten Mathematical Expression Generation (HMEG) is challenging due to the complex two-dimensional layouts and long-range structural dependencies of mathematical expressions. Existing methods typically rely on explicit spatial …

  286. arXiv cs.CV TIER_1 English(EN) · Kaili Wang, Martin Dimitrievski, Jose Maria Salvador, Ben Stoffelen, David Van Hamme, Lore Goetschalckx ·

    Addressing Detail Bottlenecks in Latent Diffusion for RGB-to-SWIR Image Translation

    arXiv:2606.19961v1 Announce Type: new Abstract: Latent diffusion models (LDMs) enable efficient image-to-image translation but discard fine spatial details during compression, degrading downstream perception tasks. We identify two bottlenecks: the autoencoder, which loses spatial…

  287. arXiv cs.CV TIER_1 English(EN) · Qinghong Yin, Yu Tian, Heming Yang, Xiang Chen, Xianlin Zhang, Yue Ming, Xueming Li, Yue Zhang ·

    Rethinking Robust Adversarial Concept Erasure in Diffusion Models

    arXiv:2510.27285v3 Announce Type: replace Abstract: Concept erasure aims to selectively unlearning undesirable content in diffusion models (DMs) to reduce the risk of sensitive content generation. As a novel paradigm in concept erasure, most existing methods employ adversarial tr…

  288. arXiv cs.CV TIER_1 English(EN) · José A. Chávez ·

    On the Redundancy of Timestep Embeddings in Diffusion Models

    Diffusion models rely heavily on explicit timestep embeddings to modulate the denoising process across various noise scales. In this work, we challenge the necessity of these temporal signals by analyzing their impact on U-Net and Diffusion Transformer architectures. Beyond empir…

  289. arXiv cs.CV TIER_1 English(EN) · Lore Goetschalckx ·

    Addressing Detail Bottlenecks in Latent Diffusion for RGB-to-SWIR Image Translation

    Latent diffusion models (LDMs) enable efficient image-to-image translation but discard fine spatial details during compression, degrading downstream perception tasks. We identify two bottlenecks: the autoencoder, which loses spatial information, and the conditioning pathway, whic…

  290. arXiv cs.CV TIER_1 English(EN) · Lianwen Jin ·

    DiffMath: Symbol- and Graph-Aware Latent Diffusion Transformer for Handwritten Mathematical Expression Generation

    Handwritten Mathematical Expression Generation (HMEG) is challenging due to the complex two-dimensional layouts and long-range structural dependencies of mathematical expressions. Existing methods typically rely on explicit spatial supervision, such as symbol-level bounding boxes…

  291. arXiv stat.ML TIER_1 English(EN) · M. Forzo, E. Monzio Compagnoni, A. Russo, A. Pacchiano ·

    A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise

    arXiv:2606.18183v1 Announce Type: new Abstract: Temporal difference (TD) learning with linear function approximation is a core method for policy evaluation. Its classical continuous-time description is an ordinary differential equation (ODE), which captures the asymptotic mean dy…

  292. arXiv cs.CV TIER_1 English(EN) · Xinyuan Zhao, Eero P. Simoncelli ·

    Learning a Maximum Entropy Model for Visual Textures using Diffusion

    arXiv:2606.17342v1 Announce Type: new Abstract: Visual textures -- spatially homogeneous image regions containing repeated elements (e.g. a field of grass, the bark of a tree) -- are ubiquitous in visual scenes and provide important cues for recognizing and analyzing materials an…

  293. arXiv cs.CV TIER_1 English(EN) · Fangzheng Wu, Brian Summa ·

    Attention Sinks in Diffusion Transformers: A Causal Analysis

    arXiv:2605.09313v3 Announce Type: replace Abstract: Attention sinks -- tokens that receive disproportionate attention mass -- are assumed to be functionally important in autoregressive language models, but their role in diffusion transformers remains unclear. We present a causal …

  294. arXiv stat.ML TIER_1 English(EN) · A. Pacchiano ·

    A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise

    Temporal difference (TD) learning with linear function approximation is a core method for policy evaluation. Its classical continuous-time description is an ordinary differential equation (ODE), which captures the asymptotic mean dynamics but neglects stochastic fluctuations dete…

  295. arXiv cs.CV TIER_1 English(EN) · Mengping Yang, Zhiyu Tan, Binglei Li, Xiaomeng Yang, Hesen Chen, Hao Li ·

    DiverseDiT: Towards Diverse Representation Learning in Diffusion Transformers

    arXiv:2603.04239v2 Announce Type: replace Abstract: Recent breakthroughs in Diffusion Transformers (DiTs) have revolutionized the field of visual synthesis due to their superior scalability. To facilitate DiTs' capability of capturing meaningful internal representations, recent w…

  296. arXiv cs.CV TIER_1 English(EN) · Shuai Wang, Liang Li, Yang Chen, Ruopeng Gao, Yao Teng, Limin Wang ·

    UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

    arXiv:2606.16255v1 Announce Type: new Abstract: Unified Multimodal Models (UMMs) have emerged as a critical direction for general-purpose multimodal intelligence, integrating understanding and generation into a single framework. However, existing UMMs face prominent challenges: (…

  297. arXiv cs.CV TIER_1 English(EN) · Xiang Gao, Yunpeng Jia ·

    Structure-Semantic Co-optimized Latent Diffusion Model for Fast Visual Anagram Synthesis

    arXiv:2606.16241v1 Announce Type: new Abstract: Visual anagram is an intriguing form of art creation wherein a single image presents different conceptual interpretations under transformations such as flipping or rotation. Recent work has achieved visual anagram synthesis by lever…

  298. arXiv cs.CV TIER_1 English(EN) · Xiang Gao, Chenxin Zhu, Yushun Fang, Qiang Hu, Xiaoyun Zhang ·

    teasr: training-efficient any-step diffusion transformer for real-world image super-resolution

    arXiv:2606.16188v1 Announce Type: new Abstract: Diffusion models excel in Real-World Image Super-Resolution (Real-ISR) due to their powerful generative priors but suffer from slow iterative sampling. Although existing one-step distillation methods accelerate inference, they typic…

  299. arXiv cs.CV TIER_1 English(EN) · Ramin Nakhli, Mahesh Ramachandran, Luca Ballan ·

    Unlocking Diffusion Hierarchies: Adaptive Timestep Selection for Zero-Shot Segmentation

    arXiv:2606.15590v1 Announce Type: new Abstract: Zero-shot segmentation has recently shown notable improvement by leveraging the rich visual priors in large-scale text-to-image diffusion models, such as Stable Diffusion. However, current diffusion-based methods often face limitati…

  300. arXiv cs.CV TIER_1 English(EN) · Weichen Fan, Haiwen Diao, Penghao Wu, Ziwei Liu ·

    Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

    arXiv:2606.15236v1 Announce Type: new Abstract: Pixel-space diffusion models are trained on full-bandwidth noisy images, yet the useful signal available to the denoiser is strongly frequency dependent. Under rectified-flow diffusion and natural-image power-law spectra, the per-ba…

  301. arXiv cs.CV TIER_1 English(EN) · Limin Wang ·

    UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

    Unified Multimodal Models (UMMs) have emerged as a critical direction for general-purpose multimodal intelligence, integrating understanding and generation into a single framework. However, existing UMMs face prominent challenges: (1) the inherent learning conflicts between visua…

  302. arXiv stat.ML TIER_1 English(EN) · Na\"il B. Khelifa, Richard E. Turner, Ramji Venkataramanan ·

    Recursively Trained Diffusion Models: Limiting Collapse Distribution and Spectral Characterization

    arXiv:2606.13796v1 Announce Type: new Abstract: Recursive training of generative models on their own outputs can lead to model collapse, a compounding drift away from the true data distribution. Existing theoretical works bound finite-round error accumulation in the context of di…

  303. arXiv stat.ML TIER_1 English(EN) · Makoto Shing, Masanori Koyama, Takuya Akiba ·

    DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

    arXiv:2506.14202v4 Announce Type: replace-cross Abstract: End-to-end backpropagation requires storing activations throughout all layers, creating memory bottlenecks that limit model scalability. Existing block-wise training methods offer means to alleviate this problem, but they …

  304. arXiv stat.ML TIER_1 English(EN) · Ziao Wang, Lei Ying ·

    Diffusion-Network Alignment: An Efficient Algorithm and Explicit Probability Bounds

    arXiv:2606.12879v1 Announce Type: cross Abstract: This paper studies a variation of the classic network alignment problem, named diffusion-network alignment. The goal is to align the vertices of a rooted diffusion tree to the vertices of a network, where the diffusion tree could …

  305. arXiv cs.CV TIER_1 English(EN) · Lidia Troeshestova, Alexander Ustyuzhanin, Sergey Kastryulin ·

    DuET: Dual Expert Trajectories for Diffusion Image Editing

    arXiv:2606.13303v1 Announce Type: new Abstract: Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the re…

  306. arXiv stat.ML TIER_1 English(EN) · Ramji Venkataramanan ·

    Recursively Trained Diffusion Models: Limiting Collapse Distribution and Spectral Characterization

    Recursive training of generative models on their own outputs can lead to model collapse, a compounding drift away from the true data distribution. Existing theoretical works bound finite-round error accumulation in the context of diffusion models, but two questions remain open:~w…

  307. arXiv cs.CV TIER_1 English(EN) · Sergey Kastryulin ·

    DuET: Dual Expert Trajectories for Diffusion Image Editing

    Recent diffusion editors perform diverse instruction-based edits while conditioning on the source image at every denoising step. Yet persistent source-image conditioning can limit how fully an edit is executed and how natural the result appears, especially when the target scene d…

  308. arXiv stat.ML TIER_1 English(EN) · Pierre-Alexandre Mattei ·

    Towards More General Control of Diffusion Models Using Jeffrey Guidance

    A key strength of diffusion models lies in their flexibility, since their outputs can be controlled at sampling time through guidance. However, beyond simple cases such as conditional sampling, the target distribution is often left implicit, defined only through a sampling rule o…

  309. arXiv cs.CV TIER_1 English(EN) · Benedetta Tondi ·

    Efficient, Robust, and Anti-Collusion Fingerprinting of Image Diffusion Models

    Model fingerprinting, embedding user-specific identifiers (fingerprints) into generated outputs, has recently emerged as a popular solution to protect the intellectual property rights (IPR) of generative text-to-image (T2I) models and prevent unauthorized redistribution. In this …

  310. arXiv stat.ML TIER_1 English(EN) · Lei Ying ·

    Diffusion-Network Alignment: An Efficient Algorithm and Explicit Probability Bounds

    This paper studies a variation of the classic network alignment problem, named diffusion-network alignment. The goal is to align the vertices of a rooted diffusion tree to the vertices of a network, where the diffusion tree could be from a communication trace or contact tracing, …

  311. arXiv stat.ML TIER_1 English(EN) · Lorenzo Bardone, Claudia Merger, Sebastian Goldt ·

    A theory of learning data statistics in diffusion models, from easy to hard

    arXiv:2603.12901v2 Announce Type: replace Abstract: While diffusion models have emerged as a powerful class of generative models, their learning dynamics remain poorly understood. We address this issue first by empirically showing that standard diffusion models trained on natural…

  312. arXiv cs.CV TIER_1 English(EN) · Ruitong Sun, Tianze Yang, Wei Niu, Jin Sun ·

    RSTR: Reducing SpatioTemporal Redundancy in Diffusion Transformers

    arXiv:2512.14096v2 Announce Type: replace Abstract: Diffusion Transformers (DiTs) have achieved remarkable success in image generation, yet their deployment is hindered by high computational costs. We identify two sources of redundancy. First, temporal redundancy: Classifier-Free…

  313. arXiv cs.CV TIER_1 English(EN) · Chunlin Qiu, Ang Li, Tianxiao Huang, Ruilin Gan, Yunjie Ge, Shenyi Zhang, Huayi Duan, Lingchen Zhao, Chao Shen, Qian Wang ·

    VOID: Defeating Unauthorized Mimicry in Latent Diffusion Models

    arXiv:2606.12263v1 Announce Type: new Abstract: While Latent Diffusion Models (LDMs) have revolutionized visual synthesis, they are increasingly exploited for unauthorized mimicry of individuals. Existing defenses inject deceptive perturbations to steer the generated images towar…

  314. arXiv cs.CV TIER_1 English(EN) · Qian Wang ·

    VOID: Defeating Unauthorized Mimicry in Latent Diffusion Models

    While Latent Diffusion Models (LDMs) have revolutionized visual synthesis, they are increasingly exploited for unauthorized mimicry of individuals. Existing defenses inject deceptive perturbations to steer the generated images toward irrelevant targets. However, this approach hin…

  315. arXiv stat.ML TIER_1 English(EN) · Dennis Elbr\"achter, Giovanni S. Alberti, Matteo Santacesaria ·

    MAD: Manifold Attracted Diffusion

    arXiv:2509.24710v2 Announce Type: replace Abstract: Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an…

  316. arXiv cs.CV TIER_1 English(EN) · Zhengxuan Wei, Yi Dong, Zonghui Li, Xianhui Lin, Xing Liu, Hong Gu, Shaofeng Zhang, Wenbin Li, Qi Fan ·

    SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models

    arXiv:2606.10617v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) merging can efficiently combine diverse generative capabilities from multiple trained LoRAs for a diffusion model. However, existing LoRA merging techniques often suffer from severe parameter interference,…

  317. arXiv stat.ML TIER_1 English(EN) · Zahra Kadkhodaie, Aram-Alexandre Pooladian, Sinho Chewi, Eero Simoncelli ·

    Blind denoising diffusion models and the blessings of dimensionality

    arXiv:2602.09639v2 Announce Type: replace-cross Abstract: Denoising diffusion models (DDMs) are state-of-the-art methods for learning densities from data across numerous domains, yet many aspects of the training and sampling pipeline remain poorly understood. In particular, noise…

  318. arXiv cs.CV TIER_1 English(EN) · Qi Fan ·

    SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models

    Low-Rank Adaptation (LoRA) merging can efficiently combine diverse generative capabilities from multiple trained LoRAs for a diffusion model. However, existing LoRA merging techniques often suffer from severe parameter interference, causing destructive collisions in the shared pa…

  319. arXiv stat.ML TIER_1 English(EN) · Al Zadid Sultan Bin Habib, Md Younus Ahamed, Prashnna Gyawali, Gianfranco Doretto, Donald A. Adjeroh ·

    BSTabDiff: Block-Subunit Diffusion Priors for High-Dimensional Tabular Data Generation

    arXiv:2606.09257v1 Announce Type: cross Abstract: High-Dimensional Low-Sample Size (HDLSS) tabular domains (e.g., omics) are characterized by $n \ll m$, where $n$ = number of samples, and $m$ = number of features. Such domains often exhibit strong local correlation groups, sparse…

  320. arXiv cs.CV TIER_1 English(EN) · Kaizhen Zhu, Mokai Pan, Zhechuan Yu, Jingya Wang, Jingyi Yu, Ye Shi ·

    Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis

    arXiv:2509.24531v2 Announce Type: replace Abstract: Diffusion Bridge and Flow Matching have both demonstrated compelling empirical performance in transformation between arbitrary distributions. However, there remains confusion about which approach is generally preferable, and the…

  321. arXiv cs.CV TIER_1 English(EN) · Lianyu Pang, Tianlin Pan, Cheng Da, Changqian Yu, Huan Yang, Kun Gai, Song Guo, Wenhan Luo ·

    MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training

    arXiv:2606.08788v1 Announce Type: new Abstract: Representation alignment with pretrained vision models has recently shown strong potential for accelerating diffusion transformer training. By aligning intermediate diffusion features with clean-image representations from self-super…

  322. arXiv stat.ML TIER_1 English(EN) · Yilin Zheng, Haowei Wang, Szu Hui Ng, Enlu Zhou ·

    Improving Bayesian Optimization via Training-Aware Conditional Diffusion Models

    arXiv:2606.08438v1 Announce Type: new Abstract: Bayesian optimization (BO) is a widely used approach for black-box optimization that uses a Gaussian process (GP) as a surrogate and guides sequential evaluations via an acquisition function, with the ultimate goal of locating the g…

  323. arXiv stat.ML TIER_1 English(EN) · Donald A. Adjeroh ·

    BSTabDiff: Block-Subunit Diffusion Priors for High-Dimensional Tabular Data Generation

    High-Dimensional Low-Sample Size (HDLSS) tabular domains (e.g., omics) are characterized by $n \ll m$, where $n$ = number of samples, and $m$ = number of features. Such domains often exhibit strong local correlation groups, sparse cross-group dependencies, heavy-tailed non-Gaussi…

  324. arXiv stat.ML TIER_1 English(EN) · Enlu Zhou ·

    Improving Bayesian Optimization via Training-Aware Conditional Diffusion Models

    Bayesian optimization (BO) is a widely used approach for black-box optimization that uses a Gaussian process (GP) as a surrogate and guides sequential evaluations via an acquisition function, with the ultimate goal of locating the global optimum $\mathbf{x}^{\star}$. To align wit…

  325. arXiv cs.CV TIER_1 English(EN) · Xiao Cui, Yulei Qin, Mo Zhu, Wengang Zhou, Hongsheng Li, Houqiang Li ·

    Geometry-Aware Dataset Condensation for Diffusion Model Training

    arXiv:2606.05883v1 Announce Type: new Abstract: Dataset condensation aims to construct compact datasets from real data via synthesis or selection. However, existing approaches are ill-suited for diffusion model training: synthetic data generation often yields low-fidelity samples…

  326. arXiv stat.ML TIER_1 English(EN) · Hongfan Gao, Wangmeng Shen, Bin Yang, Jilin Hu ·

    HyFAD: Hybrid Time-Frequency Diffusion with Frequency-Aware Embedding for Time Series Imputation

    arXiv:2606.05239v1 Announce Type: new Abstract: Diffusion models have demonstrated strong performance in time series modeling due to their ability to progressively capture complex data distributions through iterative denoising. However, existing approaches struggle with frequency…

  327. arXiv stat.ML TIER_1 English(EN) · Na\"il B. Khelifa, Richard E. Turner, Ramji Venkataramanan ·

    Diffusion Models Observe Only Gradients: A Geometric Perspective on Score Matching Errors

    arXiv:2606.06179v1 Announce Type: new Abstract: Score-based diffusion models are typically trained by minimizing the $L^2$ score matching error, and standard theoretical analyses rely on this quantity to bound the sampling discrepancy between the learned and target distributions.…

  328. arXiv stat.ML TIER_1 English(EN) · Hongkai Zheng, Austin Wang, Zihui Wu, Zhengyu Huang, Ricardo Baptista, Yisong Yue ·

    Blade: A Derivative-free Bayesian Inversion Method using Diffusion Priors

    arXiv:2510.10968v3 Announce Type: replace-cross Abstract: Derivative-free Bayesian inversion arises in science and engineering applications, particularly when forward model is costly or infeasible to differentiate through. Existing derivative-free methods collapse the posterior t…

  329. arXiv cs.CV TIER_1 English(EN) · Micha\"el Soumm, Alexandre Fournier Montgieux, Yunlong He, Pietro Gori, Alasdair Newson ·

    Diff-CA: Separating Common and Salient Factors with Diffusion Models

    arXiv:2606.06120v1 Announce Type: new Abstract: Contrastive Analysis aims to separate factors that are common between two data distributions from those that are salient to only one of them. Existing contrastive methods are based on generative models (e.g., VAEs or GANs) that ofte…

  330. arXiv cs.CV TIER_1 English(EN) · Noam Issachar, Dani Lischinski, Raanan Fattal ·

    Complexity-Balanced Diffusion Splitting

    arXiv:2606.06477v1 Announce Type: new Abstract: Standard continuous-time generative models rely on monolithic architectures that must navigate vastly different signal regimes, from isotropic noise to intricate data distributions. While scaling model capacity improves performance,…

  331. arXiv cs.CV TIER_1 English(EN) · Weiyan Chen, Weijian Deng, Yao Xiao, Weijie Tu, ZiYi Dong, Ibrahim Radwan, Liang Lin, Pengxu Wei ·

    When Preference Labels Fall Short: Aligning Diffusion Models from Real Data

    arXiv:2605.19839v2 Announce Type: replace Abstract: Preference alignment aims to guide generative models by learning from comparisons between preferred and non-preferred samples. In practice, most existing approaches rely on preference pairs constructed from model-generated image…

  332. arXiv cs.CV TIER_1 English(EN) · Raanan Fattal ·

    Complexity-Balanced Diffusion Splitting

    Standard continuous-time generative models rely on monolithic architectures that must navigate vastly different signal regimes, from isotropic noise to intricate data distributions. While scaling model capacity improves performance, deploying a massive network uniformly across th…

  333. arXiv stat.ML TIER_1 English(EN) · Ramji Venkataramanan ·

    Diffusion Models Observe Only Gradients: A Geometric Perspective on Score Matching Errors

    Score-based diffusion models are typically trained by minimizing the $L^2$ score matching error, and standard theoretical analyses rely on this quantity to bound the sampling discrepancy between the learned and target distributions. We show the $L^2$ score error is not the right …

  334. arXiv cs.CV TIER_1 English(EN) · Alasdair Newson ·

    Diff-CA: Separating Common and Salient Factors with Diffusion Models

    Contrastive Analysis aims to separate factors that are common between two data distributions from those that are salient to only one of them. Existing contrastive methods are based on generative models (e.g., VAEs or GANs) that often suffer from limited reconstruction and image q…

  335. arXiv cs.CV TIER_1 English(EN) · Houqiang Li ·

    Geometry-Aware Dataset Condensation for Diffusion Model Training

    Dataset condensation aims to construct compact datasets from real data via synthesis or selection. However, existing approaches are ill-suited for diffusion model training: synthetic data generation often yields low-fidelity samples unsuitable for authentic modeling, while real s…

  336. arXiv stat.ML TIER_1 English(EN) · Riccardo Saporiti, Fabio Nobile ·

    Neural Galerkin Normalizing Flows for Bayesian Inference of Diffusions with Inaccessible Boundaries

    arXiv:2606.04324v1 Announce Type: cross Abstract: One of the primary challenges in Bayesian inference on the parameters of a diffusion model from discrete observations is the unavailability of an analytical expression for the transition density function between consecutive observ…

  337. arXiv stat.ML TIER_1 English(EN) · Jilin Hu ·

    HyFAD: Hybrid Time-Frequency Diffusion with Frequency-Aware Embedding for Time Series Imputation

    Diffusion models have demonstrated strong performance in time series modeling due to their ability to progressively capture complex data distributions through iterative denoising. However, existing approaches struggle with frequency-sensitive denoising, high-frequency reconstruct…

  338. arXiv cs.CV TIER_1 English(EN) · Yan Zeng, Masanori Suganuma, Takayuki Okatani ·

    An Improved Method for Personalizing Diffusion Models

    arXiv:2407.05312v2 Announce Type: replace Abstract: Diffusion models have demonstrated impressive image generation capabilities. Personalized approaches, such as textual inversion and Dreambooth, enhance model individualization using specific images. These methods enable generati…

  339. arXiv stat.ML TIER_1 English(EN) · Jungkyu Kim, Taeyoung Park, Kibok Lee ·

    AugMask: Training Diffusion Models on Incomplete Tabular Data via Stochastic Augmentation and Masking

    arXiv:2606.03347v1 Announce Type: cross Abstract: Score-based diffusion models have emerged as prominent deep generative models; however, their application to tabular data remains challenging because their backbones assume fully specified inputs, whereas real-world tabular data o…

  340. arXiv cs.CV TIER_1 English(EN) · Prithviraj Verma, Pawan Kumar, Chandan Deshani, Prasun Chandra Tripathi ·

    An Attention-Based Denoising Model for Diffusion Weighted Imaging

    arXiv:2606.03903v1 Announce Type: new Abstract: Diffusion-weighted imaging (DWI) is used for whole-body cancer screening, but it typically requires a long acquisition time. When the scan time is reduced, the image quality often suffers, leading to increased noise in the scans. Ma…

  341. arXiv stat.ML TIER_1 English(EN) · Weiguo Gao, Ming Li, Lei Shi, Hanfei Zhou ·

    A Quantitative Approximation Framework for Flow Distillation in Diffusion Models

    arXiv:2606.03820v1 Announce Type: new Abstract: We develop a quantitative approximation framework for diffusion distillation, viewing few-step sampling as error propagation under compositions of learned flow maps. Focusing on trajectory distillation for the probability-flow ODE, …

  342. arXiv cs.CV TIER_1 English(EN) · Tianxiong Zhong, Xingye Tian, Xuebo Wang, Xin Tao, Pengfei Wan ·

    Diffusing in the Right Space: A Systematic Study of Latent Diffusability

    arXiv:2606.03578v1 Announce Type: new Abstract: Latent diffusion models leverage visual tokenizers to compress images into latent spaces for efficient generative modeling. However, better reconstruction quality of a tokenizer does not necessarily translate into better generation …

  343. arXiv cs.CV TIER_1 English(EN) · Yan Zeng, Masanori Suganuma, Takayuki Okatani ·

    Inverting the Generation Process of Denoising Diffusion Implicit Models: Empirical Evaluation and a Novel Method

    arXiv:2606.03111v1 Announce Type: new Abstract: This paper studies the problem of inverting the DDIM image generation process to recover latent variables, particularly the initial noise map, from a generated image. Existing methods often struggle with accuracy in this task. We pr…

  344. arXiv stat.ML TIER_1 English(EN) · Jing Jia, Wei Yuan, Sifan Liu, Liyue Shen, Guanyang Wang ·

    Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance

    arXiv:2601.22443v2 Announce Type: replace-cross Abstract: Can a diffusion model trained on bedrooms recover human faces? Diffusion models are widely used as priors for inverse problems, but standard approaches usually assume a high-fidelity model trained on data that closely matc…

  345. arXiv cs.CV TIER_1 English(EN) · Katarzyna Zaleska, {\L}ukasz Popek, Monika Wysocza\'nska, Kamil Deja ·

    Attention, May I Have Your Decision? Localizing Generative Choices in Diffusion Models

    arXiv:2604.06052v2 Announce Type: replace Abstract: Text-to-image diffusion models exhibit remarkable generative capabilities, yet their internal operations remain opaque, particularly when handling prompts that are not fully descriptive. In such scenarios, models must make impli…

  346. arXiv stat.ML TIER_1 English(EN) · Fabio Nobile ·

    Neural Galerkin Normalizing Flows for Bayesian Inference of Diffusions with Inaccessible Boundaries

    One of the primary challenges in Bayesian inference on the parameters of a diffusion model from discrete observations is the unavailability of an analytical expression for the transition density function between consecutive observation times, which is needed to derive the likelih…

  347. arXiv cs.CV TIER_1 English(EN) · Prasun Chandra Tripathi ·

    An Attention-Based Denoising Model for Diffusion Weighted Imaging

    Diffusion-weighted imaging (DWI) is used for whole-body cancer screening, but it typically requires a long acquisition time. When the scan time is reduced, the image quality often suffers, leading to increased noise in the scans. Magnitude reconstruction in DWI introduces signal-…

  348. arXiv stat.ML TIER_1 English(EN) · Hanfei Zhou ·

    A Quantitative Approximation Framework for Flow Distillation in Diffusion Models

    We develop a quantitative approximation framework for diffusion distillation, viewing few-step sampling as error propagation under compositions of learned flow maps. Focusing on trajectory distillation for the probability-flow ODE, we show that local approximation errors can be s…

  349. arXiv cs.CV TIER_1 English(EN) · Pengfei Wan ·

    Diffusing in the Right Space: A Systematic Study of Latent Diffusability

    Latent diffusion models leverage visual tokenizers to compress images into latent spaces for efficient generative modeling. However, better reconstruction quality of a tokenizer does not necessarily translate into better generation quality, suggesting that latent representations …

  350. arXiv stat.ML TIER_1 English(EN) · Kibok Lee ·

    AugMask: Training Diffusion Models on Incomplete Tabular Data via Stochastic Augmentation and Masking

    Score-based diffusion models have emerged as prominent deep generative models; however, their application to tabular data remains challenging because their backbones assume fully specified inputs, whereas real-world tabular data often contain missing values. We propose AugMask, a…

  351. arXiv stat.ML TIER_1 English(EN) · Keito Wakatsuki, Hideaki Shimazaki ·

    Self-Regulating Annealing in Heavy-Tailed Diffusion Models

    arXiv:2606.01645v1 Announce Type: new Abstract: Diffusion models have emerged as a leading framework for deep generative modeling. While the standard Gaussian formulation is theoretically convenient, its suitability for heavy-tailed datasets remains unclear. To address this, heav…

  352. arXiv stat.ML TIER_1 English(EN) · Ioar Casado-Telletxea, Omar Rivasplata ·

    Error Bounds for a Diffusion Model-Based Drift Estimator

    arXiv:2606.02115v1 Announce Type: new Abstract: Parameter estimation in stochastic differential equations is a classical statistical problem of much importance in many scientific fields. Recent work of Tapia Costa et al. (2026) introduced a novel technique for estimating the drif…

  353. arXiv cs.CV TIER_1 English(EN) · Yongsen Cheng, Kai Liu, Kaiwen Tao, Junxian Li, Zhixin Wang, Zhikai Chen, Renjing Pei, Yulun Zhang ·

    PermuQuant: Lowering Per-Group Quantization Error by Reordering Channels for Diffusion Models

    arXiv:2605.09503v2 Announce Type: replace Abstract: Large-scale visual generative models have achieved remarkable performance. However, their high computational and memory costs make deployment challenging in resource-constrained scenarios, such as interactive applications and pe…

  354. arXiv stat.ML TIER_1 English(EN) · Florian Handke, Dejan Stan\v{c}evi\'c, Felix Koulischer, Thomas Demeester, Luca Ambrogioni ·

    The Entropic Signature of Class Speciation in Diffusion Models

    arXiv:2602.09651v2 Announce Type: replace Abstract: Diffusion models do not recover semantic structure uniformly over time. Instead, samples transition from semantic ambiguity to class commitment within a narrow regime. Recent theoretical work attributes this transition to dynami…

  355. arXiv cs.CV TIER_1 English(EN) · Tao Wu, Senmao Li, Yaxing Wang, Shiqi Yang, Kai Wang, Joost van de Weijer ·

    Training-free image inversion for one-step diffusion models

    arXiv:2606.01380v1 Announce Type: new Abstract: In this work, we introduce a novel training-free inversion (TFinv) framework for one-step diffusion models,addressing key challenges in real image inversion and editing. We first identify two critical factors hamperingreal-image inv…

  356. arXiv cs.CV TIER_1 English(EN) · Ziyue Lin, Jiahe Hou, Hongyu Xia, Xinrui Xie, Feifei Wang, Yuyin Zhou, Wei Wang, Jiawei Liu, Liangqiong Qu ·

    Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

    arXiv:2606.01048v1 Announce Type: new Abstract: We propose Decoupled Residual Denoising Diffusion models (DRDD) for unified and data-efficient image-to-image (I2I) translation. While diffusion models have advanced I2I translation in terms of quality and diversity, we uncover a pr…

  357. arXiv stat.ML TIER_1 English(EN) · Kijung Jeon, Michael Muehlebach, Molei Tao ·

    Efficient Diffusion Models under Nonconvex Equality and Inequality constraints via Landing

    arXiv:2604.17838v2 Announce Type: replace-cross Abstract: Generative modeling within constrained sets is essential for scientific and engineering applications involving physical, geometric, or safety requirements (e.g., molecular generation, robotics). We present a unified framew…

  358. arXiv cs.CV TIER_1 English(EN) · Xueji Fang, Liyuan Ma, Jianhao Zeng, Jinjin Cao, Mingyuan Zhou, Guo-Jun Qi ·

    FocusDiT: Masking Queries in Diffusion Transformers for Fine-grained Image Generation

    arXiv:2606.02090v1 Announce Type: new Abstract: Diffusion transformer (DiT) has been widely adopted in the generative diffusion field, advancing the denoising of query tokens through attention and Feed-Forward (\text{FFN}) layers. FFN actually acts as the key-value vocabulary for…

  359. arXiv stat.ML TIER_1 English(EN) · Omar Rivasplata ·

    Error Bounds for a Diffusion Model-Based Drift Estimator

    Parameter estimation in stochastic differential equations is a classical statistical problem of much importance in many scientific fields. Recent work of Tapia Costa et al. (2026) introduced a novel technique for estimating the drift when the diffusion parameter is known, using d…

  360. arXiv cs.CV TIER_1 English(EN) · Guo-Jun Qi ·

    FocusDiT: Masking Queries in Diffusion Transformers for Fine-grained Image Generation

    Diffusion transformer (DiT) has been widely adopted in the generative diffusion field, advancing the denoising of query tokens through attention and Feed-Forward (\text{FFN}) layers. FFN actually acts as the key-value vocabulary for decoding visual contents where the value embeds…

  361. arXiv stat.ML TIER_1 English(EN) · Shervin Khalafi, Alejandro Ribeiro, Dongsheng Ding ·

    Unlearning in Diffusion Models: A Unified Framework with KL Divergence and Likelihood Constraints

    arXiv:2605.30825v1 Announce Type: cross Abstract: Unlearning in diffusion models aims to remove undesirable data or concepts while preserving the utility of pretrained models -- two fundamentally conflicting objectives. We propose a principled constrained optimization framework t…

  362. arXiv cs.CV TIER_1 English(EN) · Nathan Kessler, Robin Magnet, Jean Feydy ·

    Sinkhorn Normalization of Diffusion Kernels

    arXiv:2507.06161v2 Announce Type: replace Abstract: Smoothing a signal based on local neighborhoods is a core operation in machine learning and geometry processing. On well-structured domains such as vector spaces and manifolds, the Laplace operator derived from differential geom…

  363. arXiv cs.CV TIER_1 English(EN) · Shreyansh Modi, Akshat Tomar, Aarush Aggarwal ·

    Guidance for Low-Level Perceptual Editing in Unconditional Diffusion Models

    arXiv:2605.31162v1 Announce Type: new Abstract: Unconditional diffusion models offer powerful generative priors, yet steering them toward aesthetically enhanced outputs remains largely unexplored. We show that h-space patching, the dominant paradigm for training-free diffusion ed…

  364. arXiv stat.ML TIER_1 English(EN) · Nail B. Khelifa, Richard E. Turner, Ramji Venkataramanan ·

    Quantifying Error Propagation and Model Collapse in Diffusion Models

    arXiv:2602.16601v2 Announce Type: replace Abstract: Machine learning models are increasingly trained or fine-tuned on synthetic data. Recursively training on such data has been observed to significantly degrade performance in a wide range of tasks, often characterized by a progre…

  365. arXiv stat.ML TIER_1 English(EN) · Hideaki Shimazaki ·

    Self-Regulating Annealing in Heavy-Tailed Diffusion Models

    Diffusion models have emerged as a leading framework for deep generative modeling. While the standard Gaussian formulation is theoretically convenient, its suitability for heavy-tailed datasets remains unclear. To address this, heavy-tailed diffusion models (HTDMs) extend the sta…

  366. arXiv cs.CV TIER_1 English(EN) · Aarush Aggarwal ·

    Guidance for Low-Level Perceptual Editing in Unconditional Diffusion Models

    Unconditional diffusion models offer powerful generative priors, yet steering them toward aesthetically enhanced outputs remains largely unexplored. We show that h-space patching, the dominant paradigm for training-free diffusion editing, systematically fails for global, low-leve…

  367. arXiv stat.ML TIER_1 English(EN) · Dongsheng Ding ·

    Unlearning in Diffusion Models: A Unified Framework with KL Divergence and Likelihood Constraints

    Unlearning in diffusion models aims to remove undesirable data or concepts while preserving the utility of pretrained models -- two fundamentally conflicting objectives. We propose a principled constrained optimization framework that formulates unlearning as minimizing the deviat…

  368. arXiv cs.CV TIER_1 English(EN) · Yurong Gao, Zicheng Zhang, Congying Han, Tiande Guo, Xinmin Qiu ·

    Resolving Endpoint Underfitting in Diffusion Bridges via Noise Alignment

    arXiv:2605.28962v1 Announce Type: new Abstract: Diffusion bridge models offer a powerful framework for connecting two data distributions, such as in image restoration and translation. Many existing methods learn this bridge by mimicking the score-matching formulation of standard …

  369. arXiv stat.ML TIER_1 English(EN) · Tassilo Schwarz, Cai Dieball, Constantin Kogler, Renaud Lambiotte, Arnaud Doucet, Alja\v{z} Godec, George Deligiannidis ·

    Permutation-Invariant Spectral Learning via Dyson Diffusion

    arXiv:2510.08535v2 Announce Type: replace Abstract: Diffusion models are central to generative modeling and have been adapted to graphs by diffusing adjacency matrix representations. The challenge of having up to $n!$ such representations for graphs with $n$ nodes is only partial…

  370. arXiv stat.ML TIER_1 English(EN) · Jingda Wu, Changxiao Cai ·

    Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

    arXiv:2605.30153v1 Announce Type: new Abstract: Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of …

  371. arXiv stat.ML TIER_1 Italiano(IT) · Jennifer Rosina Andersson, Zheng Zhao ·

    Diffusion differentiable resampling

    arXiv:2512.10401v3 Announce Type: replace Abstract: This paper is concerned with differentiable resampling in the context of sequential Monte Carlo (e.g., particle filtering). Drawing on reparametrisation, we propose a new resampling method that is informative and instantly diffe…

  372. arXiv cs.CV TIER_1 English(EN) · Hadar Davidson, Noam Issachar, Sagie Benaim ·

    Colored Noise Diffusion Sampling

    arXiv:2605.30332v1 Announce Type: new Abstract: Diffusion models achieve state-of-the-art image synthesis, with their generative trajectories fundamentally exhibiting a spectral bias, resolving low-frequency global structures early and high-frequency fine details later. Conventio…

  373. arXiv cs.CV TIER_1 English(EN) · Sagie Benaim ·

    Colored Noise Diffusion Sampling

    Diffusion models achieve state-of-the-art image synthesis, with their generative trajectories fundamentally exhibiting a spectral bias, resolving low-frequency global structures early and high-frequency fine details later. Conventional stochastic differential equation (SDE) solve…

  374. arXiv stat.ML TIER_1 English(EN) · Changxiao Cai ·

    Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

    Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of their statistical efficiency remains limited. Ex…

  375. arXiv cs.CV TIER_1 English(EN) · Viktoriia Mishkurova ·

    Treatment-Conditioned Diffusion for Forecasting Neurodegenerative Disease Progression

    Forecasting the progression of neurodegenerative diseases, such as Parkinson's disease, is essential for effective long-term planning and personalized therapeutic intervention. Existing systems typically produce scalar clinical scores that ignore the rich structure of longitudina…

  376. arXiv cs.CV TIER_1 English(EN) · Dacheng Tao ·

    Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing

    Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingly critical. We find that existing backdoor-based d…

  377. arXiv cs.CV TIER_1 Deutsch(DE) · Sean Man, Gilad Deutch, Roy Ganz, Roi Ronen, Shahar Tsiper, Shai Mazor, Niv Nayman ·

    DODO: Discrete OCR Diffusion Models

    arXiv:2602.16872v2 Announce Type: replace Abstract: Optical Character Recognition (OCR) is a fundamental task for digitizing information, serving as a critical bridge between visual data and textual understanding. While modern Vision-Language Models (VLM) have achieved high accur…

  378. arXiv cs.CV TIER_1 English(EN) · Shangwen Zhu, Han Zhang, Zhantao Yang, Qianyu Peng, Zhao Pu, Huangji Wang, Fan Cheng ·

    Accelerating Diffusion Sampling via Exploiting Local Transition Coherence

    arXiv:2503.09675v3 Announce Type: replace Abstract: Text-based diffusion models have made significant breakthroughs in generating high-quality images and videos from textual descriptions. However, the lengthy sampling time of the denoising process remains a significant bottleneck…

  379. arXiv stat.ML TIER_1 English(EN) · Gabriel Peyr\'e ·

    Optimal and Diffusion Transports in Machine Learning

    arXiv:2512.06797v2 Announce Type: replace-cross Abstract: Several problems in machine learning are naturally expressed as the design and analysis of time-evolving probability distributions. This includes sampling via diffusion methods, optimizing the weights of neural networks, a…

  380. arXiv cs.CV TIER_1 English(EN) · Hongki Lim ·

    Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping

    Diffusion posterior sampling conditions diffusion priors on measurements, but data-consistency updates are typically scaled by hand-tuned guidance weights and can destabilize sampling under stiff, operator-dependent curvature. We replace scalar guidance with a per-noise-level dam…

  381. arXiv stat.ML TIER_1 English(EN) · Hyunmo Kang, Noam Itzhak Levi, Corinna Elena Wegner, Daniel J. Korchinski, Matthieu Wyart ·

    Sampling Data with Chains of Forward-Backward Diffusion Steps

    arXiv:2605.27006v1 Announce Type: cross Abstract: Sampling from learned high-dimensional distributions is a foundational computational problem. We introduce U-turn chains: Markov chains obtained by iterating short forward-backward steps of a diffusion model, in which each step pr…

  382. arXiv stat.ML TIER_1 English(EN) · Yuchen Liang, Ness Shroff, Yingbin Liang ·

    From Scores to Gibbs Correctors: Accelerating Uniform-Rate Discrete Diffusion Models

    arXiv:2605.27352v1 Announce Type: cross Abstract: Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration met…

  383. arXiv cs.CV TIER_1 (AF) · Felix Krause, Stefan Andreas Baumann, Johannes Schusterbauer, Olga Grebenkova, Ming Gui, Vincent Tao Hu, Bj\"orn Ommer ·

    Guiding Token-Sparse Diffusion Models

    arXiv:2601.01608v2 Announce Type: replace Abstract: Diffusion models deliver high quality in image synthesis but remain expensive during training and inference. Recent works have leveraged the inherent redundancy in visual content to make training more affordable by training only…

  384. arXiv cs.CV TIER_1 English(EN) · Junseo Bang, Dong Ju Mun, Hoigi Seo, Seongmin Hong, Se Young Chun ·

    Triadic Dynamics Aware Diffusion Posterior Sampling for Inverse Problems: Optimizing Guidance and Stochasticity Schedules

    arXiv:2605.26470v1 Announce Type: new Abstract: Generative posterior sampling using diffusion models has emerged as a dominant paradigm for solving inverse problems in imaging, which usually consists of three main components: data consistency (DC) guidance, classifier-free guidan…

  385. arXiv stat.ML TIER_1 English(EN) · Yingbin Liang ·

    From Scores to Gibbs Correctors: Accelerating Uniform-Rate Discrete Diffusion Models

    Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration methods either rely on training additional quantities…

  386. arXiv stat.ML TIER_1 English(EN) · Matthieu Wyart ·

    Sampling Data with Chains of Forward-Backward Diffusion Steps

    Sampling from learned high-dimensional distributions is a foundational computational problem. We introduce U-turn chains: Markov chains obtained by iterating short forward-backward steps of a diffusion model, in which each step proposes a move that remains on the learned data man…

  387. arXiv cs.CV TIER_1 English(EN) · Jiacheng Yang, Jun Wu, Yaoyao Ding, Zhiying Xu, Yida Wang, Gennady Pekhimenko ·

    SwiftFusion: Scalable Sequence Parallelism for Distributed Inference of Diffusion Transformers on GPUs

    arXiv:2601.20273v2 Announce Type: replace-cross Abstract: Diffusion Transformers (DiTs) have gained increasing adoption in high-quality image and video generation. As demand for higher-resolution images and longer videos increases, single-GPU inference becomes inefficient due to …

  388. arXiv cs.CV TIER_1 English(EN) · Agata \.Zywot, Iason Skylitsis, Thijmen Nijdam, Zoe Tzifa-Kratira, Derck Prinzhorn, Konrad Szewczyk, Aritra Bhowmik ·

    Injecting Image Guidance into Text-Conditioned Diffusion Models at Inference

    arXiv:2605.25191v1 Announce Type: new Abstract: Text-to-image diffusion models like Stable Diffusion generate high-quality images from text, but lack a way to inject visual guidance (e.g. sketches, styles) at inference without retraining. Existing methods either require computati…

  389. arXiv cs.CV TIER_1 English(EN) · Weimin Bai, Yuxuan Gu, Yifei Wang, Weijian Luo, He Sun ·

    Unbiased Diffusion Variational Inversion via Principled Posterior Matching

    arXiv:2605.25042v1 Announce Type: new Abstract: Existing score-based methods for inverse problems often resort to approximate minimization of the KL divergence between the inversion distribution and the Bayesian posterior. Such an approximation leads to severe mode collapse and u…

  390. arXiv cs.CV TIER_1 English(EN) · Mingyu Liang, Dingkun Xu, Jingwei Xu ·

    Trajectory-Consistent Calibration for Cache-Accelerated Diffusion Models

    arXiv:2605.24870v1 Announce Type: new Abstract: Diffusion Transformers require repeated denoiser evaluations during iterative sampling, making inference computationally expensive. Cache-based acceleration reduces this cost by reusing intermediate representations across denoising …

  391. arXiv cs.CV TIER_1 English(EN) · Bingtian Qiao, Yue Shi, Yingjie Zhou, Yong Guo, Guangtao Zhai, Jiezhang Cao ·

    Efficient One-Step Diffusion Restoration Model with Compact Token Compression and Linear Attention

    arXiv:2605.23451v1 Announce Type: new Abstract: Real-world image super-resolution aims to recover high-quality images from complex and unknown real-world degradations. However, existing generative Real-ISR methods largely inherit the dense latent representations and quadratic-cos…

  392. arXiv stat.ML TIER_1 English(EN) · Xin Guo ·

    Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning

    Diffusion models are increasingly used as powerful conditional generators, yet real deployments often involve multiple target distributions arising from different tasks, e.g., diverse prompt domains in text-to-image generation, or multiple environments in robotics with diffusion …

  393. arXiv stat.ML TIER_1 English(EN) · Pan Xu ·

    Inference-Time Alignment of Diffusion Models via Trust-Region Iterative Twisted Sequential Monte Carlo

    We study inference-time alignment for diffusion-based generative models, aiming to steer a base model toward high-reward outputs without updating its weights. Recent Sequential Monte Carlo (SMC)-based steering methods approximate reward-tilted target distributions in a principled…

  394. arXiv cs.CV TIER_1 English(EN) · Jiezhang Cao ·

    Efficient One-Step Diffusion Restoration Model with Compact Token Compression and Linear Attention

    Real-world image super-resolution aims to recover high-quality images from complex and unknown real-world degradations. However, existing generative Real-ISR methods largely inherit the dense latent representations and quadratic-cost global modeling paradigm developed for high-re…

  395. arXiv cs.CV TIER_1 English(EN) · Le Zhang, Ning Mang, Aishwarya Agrawal ·

    RiT: Vanilla Diffusion Transformers Suffice in Representation Space

    arXiv:2605.21981v1 Announce Type: new Abstract: Flow matching with $x$-prediction -- regressing the clean data point rather than the ambient velocity -- is known to exploit low-dimensional manifold structure effectively in pixel space \cite{li2025back}. We ask whether a pretraine…

  396. arXiv cs.CV TIER_1 English(EN) · Hangyeol Lee, Hyojeong Lee, Joo-Young Kim ·

    Rethinking Token Reduction for Diffusion Models via Output-Similarity-Awareness

    arXiv:2605.22011v1 Announce Type: new Abstract: Diffusion Transformers (DiTs) achieve superior image generation quality but suffer from quadratic computational complexity relative to token count. While various token reduction (TR) methods have been proposed to mitigate this cost,…

  397. arXiv stat.ML TIER_1 English(EN) · Jonathan Lorraine ·

    Variance Reduction for Expectations with Diffusion Teachers

    Pretrained diffusion models serve as frozen teachers feeding downstream pipelines such as text-to-3D, single-step distillation, and data attribution. The teacher gradients these pipelines consume are Monte Carlo (MC) expectations over noise levels and Gaussian noise samples; thei…

  398. arXiv stat.ML TIER_1 English(EN) · Wenpin Tang, Nizar Touzi, Zikun Zhang, Xun Yu Zhou ·

    Tweedie's Formulae and Diffusion Generative Models Beyond Gaussian

    arXiv:2605.19391v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in generating samples from unknown data distributions. Most popular stochastic differential equation-based diffusion models perturb the target distribution by adding Gaussian noise, …

  399. arXiv stat.ML TIER_1 English(EN) · Benjamin Sterling, M\'onica F. Bugallo, Tom Tirer ·

    Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

    arXiv:2605.19170v1 Announce Type: new Abstract: Diffusion/score-based models have emerged as powerful generative models, capable of generating high-quality samples that mimic the training data distribution. However, it has been observed that they are prone to reproducing training…

  400. arXiv stat.ML TIER_1 English(EN) · Xun Yu Zhou ·

    Tweedie's Formulae and Diffusion Generative Models Beyond Gaussian

    Diffusion models have achieved remarkable success in generating samples from unknown data distributions. Most popular stochastic differential equation-based diffusion models perturb the target distribution by adding Gaussian noise, transforming it into a simple prior, and then us…

  401. arXiv stat.ML TIER_1 English(EN) · Kelvin Kan, Xingjian Li, Benjamin J. Zhang, Tuhin Sahai, Stanley Osher, Markos A. Katsoulakis ·

    Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space

    arXiv:2605.17232v1 Announce Type: cross Abstract: Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses d…

  402. arXiv stat.ML TIER_1 English(EN) · Chenyang Wang, Weizhong Wang, Yinuo Ren, Jose Blanchet, Yiping Lu ·

    Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures

    arXiv:2605.17850v1 Announce Type: new Abstract: iffusion-based generative models increasingly rely on inference-time guidance, adding a drift term or reweighting mixture of experts, to improve sample quality on task-specific objectives. However, most existing techniques require r…

  403. arXiv stat.ML TIER_1 English(EN) · Tom Tirer ·

    Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

    Diffusion/score-based models have emerged as powerful generative models, capable of generating high-quality samples that mimic the training data distribution. However, it has been observed that they are prone to reproducing training samples-known as "memorization"-potentially vio…

  404. arXiv cs.CV TIER_1 English(EN) · Yiping Lu ·

    Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures

    iffusion-based generative models increasingly rely on inference-time guidance, adding a drift term or reweighting mixture of experts, to improve sample quality on task-specific objectives. However, most existing techniques require repeated score or gradient evaluations, introduci…

  405. arXiv stat.ML TIER_1 English(EN) · Markos A. Katsoulakis ·

    Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space

    Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses diverge under singular priors such as the masked di…

  406. arXiv cs.CV TIER_1 English(EN) · Yimao Cai ·

    Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?

    Despite strong image-generation performance, diffusion models' reconstruction objectives limit alignment with human preferences. RL enables such alignment through explicit rewards. However, most studies apply RL to the full denoising trajectory, making it computationally costly a…

  407. arXiv stat.ML TIER_1 English(EN) · Yuzhen Zhao, Jiarong Fan, Yating Liu ·

    Plug-In Classification of Drift Functions in Diffusion Processes Using Neural Networks

    arXiv:2602.02791v2 Announce Type: replace Abstract: We study supervised multiclass classification for diffusion processes, where each class is characterized by a distinct drift function and trajectories are observed at discrete times. We first derive a multidimensional Bayes rule…

  408. arXiv stat.ML TIER_1 English(EN) · Yifeng Yu, Lu Yu ·

    On the Limits of Latent Reuse in Diffusion Models

    arXiv:2605.13448v1 Announce Type: new Abstract: Diffusion models are often trained in low-dimensional latent spaces, which are then reused for related but shifted datasets. In this work, we study when such latent reuse remains reliable under distribution shift. We consider a sour…

  409. arXiv stat.ML TIER_1 Deutsch(DE) · Dario Shariatian, Alain Durmus, Umut Simsekli, Stefano Peluchetti ·

    Latent-Augmented Discrete Diffusion Models

    arXiv:2510.18114v3 Announce Type: replace-cross Abstract: Discrete diffusion models have emerged as a powerful class of models and a promising route to fast language generation, but practical implementations typically rely on factored reverse transitions ignoring cross-token depe…

  410. arXiv stat.ML TIER_1 English(EN) · Ye He, Yitong Qiu, Molei Tao ·

    Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold

    arXiv:2602.06021v2 Announce Type: replace Abstract: We study a data-dependent notion of diffusion-model generalization: when a model does not memorize the training set, where do its generated samples go relative to the geometry induced by the data? To answer this, we introduce a …

  411. arXiv cs.CV TIER_1 English(EN) · Zhi Wang ·

    Test-time Sparsity for Extreme Fast Action Diffusion

    Action diffusion excels at high-fidelity action generation but incurs heavy computational costs owing to its iterative denoising nature. Despite current technologies showing promise in accelerating diffusion transformers by reusing the cached features, they struggle to adapt to p…

  412. arXiv stat.ML TIER_1 English(EN) · Tim Salimans ·

    Covariance-aware sampling for Diffusion Models

    We present a covariance-aware sampler that improves the quality of pixel-space Diffusion Model (DM) sampling in the few-step regime. We hypothesize that in the few-step regime samplers fail because they rely solely on the predicted mean of the reverse distribution, while our solu…

  413. arXiv stat.ML TIER_1 English(EN) · Hao Chen, Renzheng Zhang, Scott S. Howard ·

    DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing

    arXiv:2511.17038v3 Announce Type: replace-cross Abstract: From a Bayesian perspective, score-based diffusion solves inverse problems through joint inference, embedding the likelihood with the prior to guide the sampling process. However, this formulation fails to explain its prac…

  414. arXiv stat.ML TIER_1 English(EN) · Jing Jia, Liyue Shen, Guanyang Wang ·

    Couple to Control: Joint Initial Noise Design in Diffusion Models

    arXiv:2605.11311v1 Announce Type: cross Abstract: Diffusion models typically generate image batches from independent Gaussian initial noises. We argue that this independence assumption is only one choice within a broader class of valid joint noise designs. Instead, one can specif…

  415. arXiv stat.ML TIER_1 English(EN) · Guillaume Coqueret, Martial Laguerre ·

    Overparametrized models with posterior drift

    arXiv:2506.23619v2 Announce Type: replace-cross Abstract: This paper investigates the impact of posterior drift on out-of-sample forecasting accuracy in overparametrized machine learning models. We document the loss in performance when the loadings of the data generating process …

  416. arXiv cs.CV TIER_1 English(EN) · Edmond S. L. Ho ·

    Generative Motion In-betweening by Diffusion over Continuous Implicit Representations

    Recent advances in generative models have yielded impressive progress on motion in-betweening, allowing for more complex, varied, and realistic motion transitions. However, recent methods still exhibit noticeable limitations in preserving keyframe information and ensuring motion …

  417. arXiv cs.CV TIER_1 English(EN) · Dong-Jun Han ·

    Disentangled Sparse Representations for Concept-Separated Diffusion Unlearning

    Unlearning specific concepts in text-to-image diffusion models has become increasingly important for preventing undesirable content generation. Among prior approaches, sparse autoencoder (SAE)-based methods have attracted attention due to their ability to suppress target concepts…

  418. arXiv cs.CV TIER_1 English(EN) · Konstantin Kulikov ·

    Few-Shot Synthetic Data Generation with Diffusion Models for Downstream Vision Tasks

    Class imbalance is a persistent challenge in visual recognition, particularly in safety-critical domains where collecting positive examples is expensive and rare events are inherently underrepresented. We propose a lightweight synthetic data augmentation pipeline that fine-tunes …

  419. arXiv stat.ML TIER_1 English(EN) · Guanyang Wang ·

    Couple to Control: Joint Initial Noise Design in Diffusion Models

    Diffusion models typically generate image batches from independent Gaussian initial noises. We argue that this independence assumption is only one choice within a broader class of valid joint noise designs. Instead, one can specify a coupling of the initial noises: each noise rem…

  420. arXiv cs.CV TIER_1 English(EN) · Wang Chen ·

    Filtering Memorization from Parameter-Space in Diffusion Models

    Low-Rank Adaptation (LoRA) has become a widely used mechanism for customizing diffusion models, enabling users to inject new visual concepts or styles through lightweight parameter updates. However, LoRAs can memorize training images, causing generated outputs to reproduce copyri…

  421. arXiv stat.ML TIER_1 English(EN) · Simon Bienewald, Lukas Trottner ·

    Statistical Convergence of Spherical First Hitting Diffusion Models

    arXiv:2605.07625v1 Announce Type: cross Abstract: Denoising diffusion models have evolved into a state-of-the-art method for tasks in various fields, such as denoising and generation of images, text generation, or generation of synthetic data for training of other machine learnin…

  422. arXiv stat.ML TIER_1 English(EN) · Dongqing Li, Geoff K. Nicholls, Shiyi Sun, You Luo ·

    A Differentiable Bayesian Relaxation for Latent Partial-Order Inference

    arXiv:2605.06976v1 Announce Type: new Abstract: Many ranking and agent trace datasets are recorded as linear orders even though their latent structure is only partially ordered. This is especially common in agent and workflow traces, where observed order may reflect arbitrary lin…

  423. arXiv stat.ML TIER_1 English(EN) · Qiao Wang ·

    Expectation-Maximization as a Spectrally Governed Relaxation Flow

    arXiv:2605.07818v1 Announce Type: new Abstract: The expectation--maximization (EM) algorithm combines global monotonicity, local linear convergence, and strong practical robustness, but these features are usually analyzed separately. Global descent is nonlinear, whereas local con…

  424. arXiv stat.ML TIER_1 English(EN) · Enrico Ventura, Beatrice Achilli, Luca Ambrogioni, Carlo Lucibello ·

    Emergence of Distortions in High-Dimensional Guided Diffusion Models

    arXiv:2602.00716v4 Announce Type: replace Abstract: Classifier-free guidance (CFG) is the de facto standard for conditional sampling in diffusion models, yet it often reduces sample diversity. Using tools from statistical physics, we analyze the emergence of generative distortion…

  425. arXiv stat.ML TIER_1 English(EN) · James Matthew Young, Paula Cordero-Encinar, Sebastian Reich, Andrew Duncan, O. Deniz Akyildiz ·

    Diffusion Path Samplers via Sequential Monte Carlo

    arXiv:2601.21951v2 Announce Type: replace Abstract: We develop diffusion-based samplers for target distributions known up to a normalising constant. To this end, we rely on the well-known diffusion path that smoothly interpolates between a simple base distribution and the target,…

  426. arXiv stat.ML TIER_1 English(EN) · Arnaud Doucet ·

    Metropolis-Adjusted Diffusion Models

    Sampling from score-based diffusion models incurs bias due to both time discretisation and the approximation of the score function. A common strategy for reducing this bias is to apply corrector steps based on the unadjusted Langevin algorithm (ULA) at each noise level within a p…

  427. arXiv cs.CV TIER_1 English(EN) · Kaushik Roy ·

    HEART: Hyperspherical Embedding Alignment via Kent-Representation Traversal in Diffusion Models

    Text-to-image diffusion models can generate visually stunning images, yet, controlling what appears and how it appears, remains surprisingly difficult, especially when operating solely within the constraints of the text-conditioning space. For example, changing a subject or adjus…

  428. arXiv cs.CV TIER_1 English(EN) · Yali Wang ·

    What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

    Tokenizers are a crucial component of latent diffusion models, as they define the latent space in which diffusion models operate. However, existing tokenizers are primarily designed to improve reconstruction fidelity or inherit pretrained representations, leaving unclear what kin…

  429. arXiv stat.ML TIER_1 English(EN) · Qiao Wang ·

    Expectation-Maximization as a Spectrally Governed Relaxation Flow

    The expectation--maximization (EM) algorithm combines global monotonicity, local linear convergence, and strong practical robustness, but these features are usually analyzed separately. Global descent is nonlinear, whereas local convergence is governed by the spectrum of the line…

  430. arXiv cs.CV TIER_1 English(EN) · Baoru Huang ·

    SARA: Semantically Adaptive Relational Alignment for Video Diffusion Models

    Recent video diffusion models (VDMs) synthesize visually convincing clips, yet still drop entities, mis-bind attributes, and weaken the interactions specified in the prompt. Representation-alignment objectives such as VideoREPA and MoAlign improve fine-grained text following by d…

  431. arXiv stat.ML TIER_1 English(EN) · Lukas Trottner ·

    Statistical Convergence of Spherical First Hitting Diffusion Models

    Denoising diffusion models have evolved into a state-of-the-art method for tasks in various fields, such as denoising and generation of images, text generation, or generation of synthetic data for training of other machine learning models. First hitting diffusion models (FHDM) ar…

  432. arXiv stat.ML TIER_1 English(EN) · You Luo ·

    A Differentiable Bayesian Relaxation for Latent Partial-Order Inference

    Many ranking and agent trace datasets are recorded as linear orders even though their latent structure is only partially ordered. This is especially common in agent and workflow traces, where observed order may reflect arbitrary linearization rather than true prerequisites. We in…

  433. arXiv cs.CV TIER_1 Français(FR) · Yan Zeng ·

    Continuous Latent Diffusion Language Model

    Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve generation efficiency, scalable representation learn…

  434. arXiv cs.CV TIER_1 English(EN) · Chunhua Shen ·

    MARBLE: Multi-Aspect Reward Balance for Diffusion RL

    Reinforcement learning fine-tuning has become the dominant approach for aligning diffusion models with human preferences. However, assessing images is intrinsically a multi-dimensional task, and multiple evaluation criteria need to be optimized simultaneously. Existing practice d…

  435. arXiv stat.ML TIER_1 English(EN) · Stefano Sarao Mannelli ·

    The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models

    Real-world datasets are inherently heterogeneous, yet how per-class structural differences and sampling imbalance shape the training dynamics of diffusion models-and potentially exacerbate disparities-remains poorly understood. While models typically transition from an initial ph…

  436. arXiv stat.ML TIER_1 English(EN) · Carola-Bibiane Schönlieb ·

    Expressivity of Bi-Lipschitz Normalizing Flows: A Score-Based Diffusion Perspective

    Many normalizing flow architectures impose regularity constraints, yet their distributional approximation properties are not fully characterized. We study the expressivity of bi-Lipschitz normalizing flows through the lens of score-based diffusion models. For the probability flow…

  437. arXiv cs.CV TIER_1 English(EN) · Bartlomiej Sobieski, Matthew Tivnan, Dawid P{\l}udowski, Micha{\l} Jan W{\l}odarczyk, Pengfei Jin, Przemyslaw Biecek, Quanzheng Li ·

    Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models

    arXiv:2605.05026v1 Announce Type: new Abstract: Diffusion models are prone to generating structural hallucinations - samples that match the statistical properties of the training data yet defy underlying structural rules, resulting in anomalies like hands with more than five fing…

  438. arXiv cs.CV TIER_1 English(EN) · Yiran Qiao, Yiren Lu, Yunlai Zhou, Disheng Liu, Linlin Hou, Rui Yang, Yu Yin, Jing Ma ·

    Structured 3D Latents Are Surprisingly Powerful: Unleashing Generalizable Style with 2D Diffusion

    arXiv:2605.04412v1 Announce Type: new Abstract: 3D asset generation plays a pivotal role in fields such as gaming and virtual reality, enabling the rapid synthesis of high-fidelity 3D objects from a single or multiple images. Building on this capability, enabling style-controllab…

  439. arXiv cs.CV TIER_1 Deutsch(DE) · Chen Wei ·

    Taming Outlier Tokens in Diffusion Transformers

    We study outlier tokens in Diffusion Transformers (DiTs) for image generation. Prior work has shown that Vision Transformers (ViTs) can produce a small number of high-norm tokens that attract disproportionate attention while carrying limited local information, but their role in g…

  440. arXiv stat.ML TIER_1 English(EN) · Arnaud Doucet ·

    On the Wasserstein Gradient Flow Interpretation of Drifting Models

    Recently, Deng et al. (2026) proposed Generative Modeling via Drifting (GMD), a novel framework for generative tasks. This note presents an analysis of GMD through the lens of Wasserstein Gradient Flows (WGF), i.e., the path of steepest descent for a functional in the space of pr…

  441. arXiv cs.CV TIER_1 English(EN) · Quanzheng Li ·

    Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models

    Diffusion models are prone to generating structural hallucinations - samples that match the statistical properties of the training data yet defy underlying structural rules, resulting in anomalies like hands with more than five fingers. Recent research studied this failure mode f…

  442. arXiv stat.ML TIER_1 English(EN) · Christopher Nemeth ·

    Hypergraph Generation via Structured Stochastic Diffusion

    Hypergraphs model higher-order interactions, but realistic hypergraph generation remains difficult because incidence, hyperedge-size heterogeneity, and overlap structure are not faithfully captured by pairwise reductions. We propose \HEDGE, a generative model defined directly on …

  443. arXiv cs.CV TIER_1 English(EN) · Qichao Wang, Yunhong Lu, Hengyuan Cao, Junyi Zhang, Min Zhang ·

    DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models

    arXiv:2605.03877v1 Announce Type: new Abstract: Dataset distillation enables efficient training by distilling the information of large-scale datasets into significantly smaller synthetic datasets. Diffusion based paradigms have emerged in recent years, offering novel perspectives…

  444. arXiv cs.CV TIER_1 English(EN) · Ruibin Min, Yexin Liu, Aimin Pan, Changsheng Lu, Jiafei Wu, Kelu Yao, Xiaogang Xu, Harry Yang ·

    AHPA: Adaptive Hierarchical Prior Alignment for Diffusion Transformers

    arXiv:2605.03317v1 Announce Type: new Abstract: Representation alignment has recently emerged as an effective paradigm for accelerating Diffusion Transformer training. Despite their success, existing alignment methods typically impose a fixed supervision target or a fixed alignme…

  445. arXiv cs.CV TIER_1 English(EN) · Fangzheng Wu, Brian Summa ·

    SteeringDiffusion: A Bottlenecked Activation Control Interface for Diffusion Models

    arXiv:2605.01653v1 Announce Type: new Abstract: We introduce SteeringDiffusion, a bottlenecked activation-level control interface for diffusion models that exposes a smooth, monotonic, and runtime-adjustable control surface over the content--style trade-off. Our method keeps the …

  446. arXiv stat.ML TIER_1 English(EN) · Kanishka Reddy ·

    Diffusion Operator Geometry of Feedforward Representations

    arXiv:2605.01107v1 Announce Type: cross Abstract: Neural networks transform data through learned representations whose geometry affects separation, contraction, and generalization. Recent work studies this geometry using discrete curvature on neighborhood graphs, suggesting Ricci…

  447. arXiv cs.CV TIER_1 English(EN) · Xun Su, Hiroyuki Kasai ·

    Noise is All You Need: Solving Linear Inverse Problems by Noise Combination Sampling with Diffusion Models

    arXiv:2510.23633v2 Announce Type: replace-cross Abstract: Pretrained diffusion models have demonstrated strong capabilities in zero-shot inverse problem solving by incorporating observation information into the generation process of the diffusion models. However, this presents an…

  448. arXiv cs.CV TIER_1 English(EN) · An Huang, Junggab Son, Zuobin Xiong ·

    Watch Your Step: Information Injection in Diffusion Models via Shadow Timestep Embedding

    arXiv:2605.00935v1 Announce Type: cross Abstract: Diffusion models have become the foundation of modern generative systems, with most research focusing primarily on improving generation efficiency and output quality. The timestep embedding component is a crucial part of the diffu…

  449. arXiv cs.CV TIER_1 English(EN) · Harry Yang ·

    AHPA: Adaptive Hierarchical Prior Alignment for Diffusion Transformers

    Representation alignment has recently emerged as an effective paradigm for accelerating Diffusion Transformer training. Despite their success, existing alignment methods typically impose a fixed supervision target or a fixed alignment granularity throughout the entire denoising t…

  450. arXiv cs.CV TIER_1 English(EN) · Saeed Mohseni-Sehdeh, Walid Saad, Kei Sakaguchi, Tao Yu ·

    Diffusion Models for Solving Inverse Problems via Posterior Sampling with Piecewise Guidance

    arXiv:2507.18654v2 Announce Type: replace-cross Abstract: Diffusion models are powerful tools for sampling from high-dimensional distributions by progressively transforming pure noise into structured data through a denoising process. When equipped with a guidance mechanism, these…

  451. arXiv cs.CV TIER_1 English(EN) · Anne Harrington, A. Sophia Koepke, Shyamgopal Karthik, Trevor Darrell, Alexei A. Efros ·

    It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models

    arXiv:2601.00090v2 Announce Type: replace Abstract: Contemporary text-to-image models exhibit a surprising degree of mode collapse, as can be seen when sampling several images given the same text prompt. Previous work has attempted to address this issue by steering the model usin…

  452. arXiv cs.CV TIER_1 English(EN) · Song Yan, Chenfeng Wang, Wei Zhai, Xinliang Bi, Jian Yang, Yancheng Cai, Yusen Zhang, Yunwei Lan, Tao Zhang, GuanYe Xiong, Min Li, Zheng-Jun Zha ·

    The Determinism of Randomness: Latent Space Degeneracy in Diffusion Model

    arXiv:2511.07756v4 Announce Type: replace Abstract: Diffusion models initialize generation from an isotropic Gaussian latent, yet changing only the random seed can substantially alter prompt faithfulness, composition, and visual quality. We explain this gap by distinguishing the …

  453. arXiv cs.CV TIER_1 Italiano(IT) · Simeon Allmendinger, Domenique Zipperling, Lukas Struppek, Niklas K\"uhl ·

    CollaFuse: Collaborative Diffusion Models

    arXiv:2406.14429v3 Announce Type: replace-cross Abstract: In the landscape of generative artificial intelligence, diffusion-based models have emerged as a promising method for generating synthetic images. However, the application of diffusion models poses numerous challenges, par…

  454. arXiv stat.ML TIER_1 English(EN) · Kanishka Reddy ·

    Diffusion Operator Geometry of Feedforward Representations

    Neural networks transform data through learned representations whose geometry affects separation, contraction, and generalization. Recent work studies this geometry using discrete curvature on neighborhood graphs, suggesting Ricci-flow-like behavior across layers. We develop a sm…

  455. arXiv stat.ML TIER_1 English(EN) · Michael S. Albergo ·

    A unified perspective on fine-tuning and sampling with diffusion and flow models

    We study the problem of training diffusion and flow generative models to sample from target distributions defined by an exponential tilting of a base density; a formulation that subsumes both sampling from unnormalized densities and reward fine-tuning of pre-trained models. This …

  456. arXiv cs.CV TIER_1 Italiano(IT) · Haosen Li, Wenshuo Chen, Lei Wang, Shaofeng Liang, Bowen Tian, Soning Lai, Yutao Yue ·

    Delta Score Matters! Spatial Adaptive Multi Guidance in Diffusion Models

    arXiv:2604.26503v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in synthesizing complex static and temporal visuals, a breakthrough largely driven by Classifier-Free Guidance (CFG). However, despite its pivotal role in aligning generated content …

  457. arXiv cs.CV TIER_1 English(EN) · Zhirong Shen, Rui Huang, Jiacheng Liu, Chang Zou, Peiliang Cai, Shikang Zheng, Zhengyi Shi, Liang Feng, Linfeng Zhang ·

    Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

    arXiv:2604.26365v1 Announce Type: new Abstract: To address the high sampling cost of Diffusion Transformers (DiTs), feature caching offers a training-free acceleration method. However, existing methods rely on hand-crafted forecasting formulas that fail under aggressive skipping.…

  458. arXiv cs.CV TIER_1 English(EN) · Rui Xu, Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Shiqing Xin, Changhe Tu, Taku Komura, Wenping Wang ·

    ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

    arXiv:2405.13729v3 Announce Type: replace-cross Abstract: In this paper, we study an under-explored but important factor of diffusion generative models, i.e., the combinatorial complexity. Data samples are generally high-dimensional, and for various structured generation tasks, a…

  459. arXiv cs.CV TIER_1 English(EN) · Miguel Espinosa, Eva Gmelich Meijling, Valerio Marsocci, Elliot J. Crowley, Mikolaj Czerkawski ·

    COP-GEN: Latent Diffusion Transformer for Copernicus Earth Observation Data

    arXiv:2603.03239v2 Announce Type: replace Abstract: Earth observation applications increasingly rely on data from multiple sensors, including optical, radar, elevation, and land-cover. Relationships between modalities are fundamental for data integration but are inherently non-in…

  460. arXiv cs.CV TIER_1 English(EN) · Yang Yang, Feifan Meng, Han Fang, Weiming Zhang ·

    ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance

    arXiv:2604.26348v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in image generation, yet their training is predominantly driven by full-reference objectives that enforce pixel-wise similarity to ground-truth images.Such supervision, while effecti…

  461. arXiv cs.CV TIER_1 Italiano(IT) · Yutao Yue ·

    Delta Score Matters! Spatial Adaptive Multi Guidance in Diffusion Models

    Diffusion models have achieved remarkable success in synthesizing complex static and temporal visuals, a breakthrough largely driven by Classifier-Free Guidance (CFG). However, despite its pivotal role in aligning generated content with textual prompts, standard CFG relies on a g…

  462. arXiv cs.CV TIER_1 English(EN) · Linfeng Zhang ·

    Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

    To address the high sampling cost of Diffusion Transformers (DiTs), feature caching offers a training-free acceleration method. However, existing methods rely on hand-crafted forecasting formulas that fail under aggressive skipping. We propose L2P (Learnable Linear Predictor), a …

  463. arXiv cs.CV TIER_1 English(EN) · Weiming Zhang ·

    ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance

    Diffusion models have achieved remarkable success in image generation, yet their training is predominantly driven by full-reference objectives that enforce pixel-wise similarity to ground-truth images.Such supervision, while effective for fidelity, may insufficient in terms of su…

  464. arXiv cs.CV TIER_1 English(EN) · Nishit Anand, Manan Suri, Christopher Metzler, Dinesh Manocha, Ramani Duraiswami ·

    Learning Illumination Control in Diffusion Models

    arXiv:2604.24877v1 Announce Type: new Abstract: Controlling illumination in images is essential for photography and visual content creation. While closed-source models have demonstrated impressive illumination control, open-source alternatives either require heavy control inputs …

  465. arXiv cs.CV TIER_1 English(EN) · Liuzhuozheng Li, Zhiyuan Zhan, Shuhong Liu, Dengyang Jiang, Zanyi Wang, Guang Dai, Jingdong Wang, Mengmeng Wang ·

    Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds

    arXiv:2604.25289v1 Announce Type: cross Abstract: Practically, training diffusion models typically requires explicit time conditioning to guide the network through the denoising sampling process. Especially in deterministic methods like DDIM, the absence of time conditioning lead…

  466. arXiv cs.CV TIER_1 English(EN) · Mengmeng Wang ·

    Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds

    Practically, training diffusion models typically requires explicit time conditioning to guide the network through the denoising sampling process. Especially in deterministic methods like DDIM, the absence of time conditioning leads to significant performance degradation. However,…

  467. arXiv stat.ML TIER_1 English(EN) · Fan Chen, Sinho Chewi, Constantinos Daskalakis, Alexander Rakhlin ·

    High-accuracy sampling for diffusion models and log-concave distributions

    arXiv:2602.01338v2 Announce Type: replace-cross Abstract: We present algorithms for diffusion model sampling which obtain $\delta$-error in $\mathrm{polylog}(1/\delta)$ steps, given access to $\widetilde O(\delta)$-accurate score estimates in $L^2$. This is an exponential improve…

  468. arXiv cs.CV TIER_1 English(EN) · Buddhi Wijenayake, Nichula Wasalathilake, Roshan Godaliyadda, Vijitha Herath, Parakrama Ekanayake, Vishal M. Patel ·

    Mitigating Long-Tail Bias via Prompt-Controlled Diffusion Augmentation

    arXiv:2602.04749v3 Announce Type: replace Abstract: Long-tailed class imbalance remains a fundamental obstacle in semantic segmentation of high-resolution remote-sensing imagery, where dominant classes shape learned representations and rare classes are systematically under-segmen…

  469. arXiv cs.CV TIER_1 English(EN) · Zhongjie Duan, Hong Zhang, Yingda Chen ·

    Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion

    arXiv:2604.24351v1 Announce Type: cross Abstract: Controllable diffusion methods have substantially expanded the practical utility of diffusion models, but they are typically developed as isolated, backbone-specific systems with incompatible training pipelines, parameter formats,…

  470. arXiv stat.ML TIER_1 English(EN) · Bingqing Jiang, Difan Zou ·

    On the Memorization of Consistency Distillation for Diffusion Models

    arXiv:2604.23552v1 Announce Type: cross Abstract: Diffusion models are central to modern generative modeling, and understanding how they balance memorization and generalization is critical for reliable deployment. Recent work has shown that memorization in diffusion models is sha…

  471. arXiv cs.CV TIER_1 English(EN) · Haosen Li, Wenshuo Chen, Shaofeng Liang, Lei Wang, Kaishen Yuan, Yutao Yue ·

    $Z^2$-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models

    arXiv:2604.23536v1 Announce Type: new Abstract: Diffusion models have achieved unprecedented success in text-aligned generation, largely driven by Classifier-Free Guidance (CFG). However, standard CFG operates strictly on instantaneous gradients, omitting the intrinsic curvature …

  472. arXiv cs.CV TIER_1 English(EN) · Ramani Duraiswami ·

    Learning Illumination Control in Diffusion Models

    Controlling illumination in images is essential for photography and visual content creation. While closed-source models have demonstrated impressive illumination control, open-source alternatives either require heavy control inputs like depth maps or do not release their data and…

  473. arXiv cs.CV TIER_1 English(EN) · Yingda Chen ·

    Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion

    Controllable diffusion methods have substantially expanded the practical utility of diffusion models, but they are typically developed as isolated, backbone-specific systems with incompatible training pipelines, parameter formats, and runtime hooks. This fragmentation makes it di…

  474. arXiv cs.CV TIER_1 English(EN) · Mingxing Rao, Bowen Qu, Daniel Moyer ·

    Score-based Membership Inference on Diffusion Models

    arXiv:2509.25003v2 Announce Type: replace-cross Abstract: Membership inference attacks (MIAs) against Diffusion Models (DMs) raise pressing privacy concerns by revealing whether a sample was part of the training set. While existing methods typically rely on measuring reconstructi…

  475. arXiv cs.CV TIER_1 English(EN) · Jincheng Ying, Yitao Chen, Li Wenlin, Minghui Xu, Yinhao Xiao ·

    Efficient Diffusion Distillation via Embedding Loss

    arXiv:2604.22379v1 Announce Type: new Abstract: Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limitin…

  476. arXiv stat.ML TIER_1 English(EN) · Difan Zou ·

    On the Memorization of Consistency Distillation for Diffusion Models

    Diffusion models are central to modern generative modeling, and understanding how they balance memorization and generalization is critical for reliable deployment. Recent work has shown that memorization in diffusion models is shaped by training dynamics, with generalization and …

  477. arXiv cs.CV TIER_1 English(EN) · Yinhao Xiao ·

    Efficient Diffusion Distillation via Embedding Loss

    Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limiting accessibility for resource-constrained researc…

  478. arXiv stat.ML TIER_1 English(EN) · Chang Liu ·

    Quotient-Space Diffusion Models

    Diffusion-based generative models have reformed generative AI, and have enabled new capabilities in the science domain, for example, generating 3D structures of molecules. Due to the intrinsic problem structure of certain tasks, there is often a symmetry in the system, which iden…

  479. arXiv stat.ML TIER_1 English(EN) · Houman Owhadi ·

    Adaptive Kernel Selection for Kernelized Diffusion Maps

    Selecting an appropriate kernel is a central challenge in kernel-based spectral methods. In \emph{Kernelized Diffusion Maps} (KDM), the kernel determines the accuracy of the RKHS estimator of a diffusion-type operator and hence the quality and stability of the recovered eigenfunc…

  480. arXiv stat.ML TIER_1 English(EN) · Molei Tao ·

    Efficient Diffusion Models under Nonconvex Equality and Inequality constraints via Landing

    Generative modeling within constrained sets is essential for scientific and engineering applications involving physical, geometric, or safety requirements (e.g., molecular generation, robotics). We present a unified framework for constrained diffusion models on generic nonconvex …

  481. Smol AINews TIER_1 English(EN) ·

    The Last Hurrah of Stable Diffusion?

    **Stability AI** launched **Stable Diffusion 3 Medium** with models ranging from **450M to 8B parameters**, featuring the MMDiT architecture and T5 text encoder for image text rendering. The community has shown mixed reactions following the departure of key researchers like Emad …

  482. Smol AINews TIER_1 English(EN) ·

    Stable Diffusion 3 — Rombach & Esser did it again!

    **Over 2500 new community members joined following Soumith Chintala's shoutout, highlighting growing interest in SOTA LLM-based summarization. The major highlight is the detailed paper release of **Stable Diffusion 3 (SD3)**, showcasing advanced text-in-image control and complex …

  483. 雷峰网 (Leiphone) TIER_1 中文(ZH) ·

    ICRA 2026 | EndoDDC: Diffusion Models Empower Sparse-to-Dense Depth Reconstruction

    <section style="background-color: rgb(246, 249, 251); font-size: 15px; line-height: 2; letter-spacing: 1px; font-style: normal; font-weight: 400; text-align: justify; color: rgb(62, 62, 62);"><section style="text-align: center; margin-top: 10px; margin-bottom: 10px; line-height: …

  484. Together AI blog TIER_1 English(EN) ·

    Chipmunk: Training-Free Acceleration of Diffusion Transformers with Dynamic Column-Sparse Deltas

  485. Hacker News — AI stories ≥50 points TIER_1 English(EN) · benanne ·

    Learning the Integral of a Diffusion Model

  486. Practical AI TIER_1 English(EN) · Practical AI LLC ·

    Stable Diffusion

    <p>The new stable diffusion model is everywhere! Of course you can use this model to quickly and easily create amazing, dream-like images to post on twitter, reddit, discord, etc., but this technology is also poised to be used in very pragmatic ways across industry. In this episo…

  487. MarkTechPost TIER_1 English(EN) · Asif Razzaq ·

    Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup

    <p>Zyphra's latest release shows that an autoregressive MoE model can be converted into a discrete diffusion model with no systematic loss in evaluation performance. ZAYA1-8B-Diffusion-Preview achieves up to 7.7x inference speedup over autoregression by shifting decoding from mem…

  488. r/LocalLLaMA TIER_1 English(EN) · /u/qubridInc ·

    DiffusionGemma under real workloads feels very different from benchmark demos

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1u30dhr/diffusiongemma_under_real_workloads_feels_very/"> <img alt="DiffusionGemma under real workloads feels very different from benchmark demos" src="https://preview.redd.it/zrnom6hrwn6h1.jpeg?width=640&amp;…

  489. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    Efficient and Training-Free Single-Image Diffusion Models https://arxiv.org/abs/2606.04299 # HackerNews # Tech # AI

    Efficient and Training-Free Single-Image Diffusion Models https://arxiv.org/abs/2606.04299 # HackerNews # Tech # AI

  490. r/StableDiffusion TIER_2 Italiano(IT) · /u/recoilme ·

    Simple diffusion, SDXS-2B (new model)

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1u1vbqa/simple_diffusion_sdxs2b_new_model/"> <img alt="Simple diffusion, sdxs-2b (new model)" src="https://external-preview.redd.it/IB3-LJ2W1KFbr3Sea_ZVMjEY0ZRSPzewEj2EDnJBvys.png?width=140&amp;height=75&…

  491. r/StableDiffusion TIER_2 English(EN) · /u/OptimisticPrompt ·

    Benchmarking local Stable Diffusion 1.5 generations on iPhone 17 - only 3 seconds per image

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1tvkyzu/benchmarking_local_stable_diffusion_15/"> <img alt="Benchmarking local Stable Diffusion 1.5 generations on iPhone 17 - only 3 seconds per image" src="https://preview.redd.it/9crj8fwtl15h1.png?widt…

  492. r/StableDiffusion TIER_2 English(EN) · /u/Ok-Cartographer-5471 ·

    Guidance on building 2D image to 3D image Diffusion model [D]

    <!-- SC_OFF --><div class="md"><p>I’m building a pipeline to turn 4-side product photos into professional studio images. I’m currently using SAM 2 for segmentation and an Inpainting pipeline to generate the studio background, but the model keeps hallucinating or degrading the pro…

  493. r/StableDiffusion TIER_2 English(EN) · /u/Elegant-Capital-9133 ·

    Stable Diffusion model recommendations for faster and cleaner outputs in 2026?

    <!-- SC_OFF --><div class="md"><p>I’ve been switching between a few models lately but I still can’t find something that feels both fast and consistently clean in results. Some models look great but slow everything down, while others are fast but lose detail pretty quickly. Even w…

  494. r/StableDiffusion TIER_2 English(EN) · /u/AgeNo5351 ·

    Colored Noise Diffusion Sampling - plug-and-play, inference-time sampler.

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1tray25/colored_noise_diffusion_sampling_plugandplay/"> <img alt="Colored Noise Diffusion Sampling - plug-and-play, inference-time sampler." src="https://preview.redd.it/pnumz6jlc44h1.png?width=140&amp;he…