New IGLU activation function offers improved gradient flow

By PulseAugur Editorial · [1 sources] · 2026-06-16 04:00

Researchers have introduced IGLU, a novel parametric activation function for deep neural networks designed to improve gradient flow and optimization stability. Derived from a mixture of GELU gates under a half-normal distribution, IGLU offers a continuous interpolation between identity-like and ReLU-like behavior through a single parameter. Its heavy-tailed Cauchy gate ensures non-zero gradients for all finite inputs, enhancing robustness against vanishing gradients. An efficient approximation, IGLU-Approx, utilizes only ReLU operations, reducing computational cost while maintaining competitive performance across vision and language datasets. AI

IMPACT Introduces a new activation function that may improve training stability and performance in deep learning models.

RANK_REASON Academic paper introducing a new activation function for neural networks. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Mingi Kang, Zai Yang, Jeova Farias Sales Rocha Neto · 2026-06-16 04:00

IGLU: The Integrated Gaussian Linear Unit Activation Function

arXiv:2603.06861v2 Announce Type: replace Abstract: Activation functions are fundamental to deep neural networks, governing gradient flow, optimization stability, and representational capacity. Within historic deep architectures, while ReLU has been the dominant choice for the ac…

COVERAGE [1]

IGLU: The Integrated Gaussian Linear Unit Activation Function

RELATED ENTITIES

RELATED TOPICS