A new research paper proposes that memorization in diffusion models is not a global property but is instead governed by local data coverage. The study, which connects diffusion models to kernel density estimation, derives a theoretical criterion predicting memorization based on the density of training data in a model's neighborhood and the overall dataset size. This framework suggests that regions with low data coverage are prone to memorization, while dense regions facilitate generalization, a phenomenon validated empirically. AI
IMPACT Provides a local view of memorization in diffusion models, explaining occurrence based on data geometry.
RANK_REASON The cluster contains a research paper published on arXiv detailing theoretical and empirical findings about diffusion models.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →