Brief · PulseAugur

RESEARCH · arXiv cs.LG English(EN) · 3w · [2 sources]

A lift for input-convex neural network training

Researchers have introduced a novel training technique called "the lift" for input-convex neural networks (ICNNs), which are crucial for tasks like density estimation and Bayesian inference. Traditional methods struggle with the non-negativity constraint on inter-layer weights, leading to stalled training. The proposed "lift" method uses an unconstrained hypernetwork to generate these weights, introducing stochasticity that smooths the loss landscape and enables deeper convergence. This approach has demonstrated superior performance over existing methods on various benchmarks, including image-flavored latents and high-dimensional tabular data. AI

IMPACT This new training technique could improve performance and convergence for specific types of neural networks used in complex inference tasks.

projected gradient descent
softplus reparametrization
Input-convex neural networks