PulseAugur
EN
LIVE 12:31:22

Thesis Explores Bayesian Principles for Deep Learning Uncertainty and Generalization

This thesis explores the application of Bayesian principles to enhance the understanding of modern deep learning systems, focusing on generalization and uncertainty quantification. It introduces the Deep Variational Implicit Process (DVIP) as a scalable Bayesian framework for deep architectures and proposes post-hoc methods like Variational Linearized Laplace Approximation (VaLLA) and Fixed-Mean Gaussian Process (FMGP) to add calibrated uncertainty estimates to pre-trained networks. Theoretically, it connects diversity, smoothness, and stochasticity within a probabilistic framework using PAC-Bayesian and large-deviation theory to explain the generalization capabilities of over-parameterized neural networks. AI

IMPACT Introduces novel methods for uncertainty estimation and theoretical frameworks for understanding deep learning generalization.

RANK_REASON The cluster contains an academic paper detailing new methodologies and theoretical contributions in machine learning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Luis A. Ortega ·

    Uncertainty Estimation and Generalization Bounds for Modern Deep Learning

    arXiv:2606.13818v1 Announce Type: new Abstract: This thesis investigates how Bayesian principles can deepen our understanding of modern deep learning systems. While neural networks achieve remarkable predictive performance, their ability to generalize and to quantify uncertainty …