PulseAugur
EN
LIVE 12:49:08

New SOAP-Bubbles method enhances neural network uncertainty estimation

Researchers have introduced SOAP-Bubbles, a novel method for estimating structured weight uncertainty in neural networks. This approach adapts the SOAP optimizer by running a variational method called IVON within the eigenspace of SOAP's preconditioner. The preconditioner then transforms the diagonal estimate into a non-diagonal covariance, resulting in a method with costs comparable to SOAP and minimal disruption to existing training pipelines. Experiments show that SOAP-Bubbles (and its associated optimizer, Eigenspace-VON or EVON) can recover exact Gaussian covariance for logistic regression and achieve superior results in language model pretraining compared to current diagonal-covariance techniques. AI

IMPACT Enhances the estimation of expressive posterior distributions for deep learning at scale, potentially improving model performance and reliability.

RANK_REASON This is a research paper detailing a new method for neural networks published on arXiv. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New SOAP-Bubbles method enhances neural network uncertainty estimation

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Thomas Möllenhoff ·

    SOAP-Bubbles: Structured Weight Uncertainty for Neural Networks

    Structured weight-uncertainty can improve many aspects of deep learning, but it remains costly to estimate and difficult to implement. Here, we show that these issues can be addressed by adapting the SOAP optimizer. Our key idea is to run IVON, an existing diagonal-covariance var…