PulseAugur
EN
LIVE 13:08:51

KromHC improves neural network training with Kronecker products

Researchers have introduced KromHC, a novel method for improving neural network training stability and scalability. KromHC addresses limitations in existing Hyper-Connections (HC) by using Kronecker products of smaller doubly stochastic matrices to parameterize the residual matrix. This approach guarantees exact double stochasticity while significantly reducing the number of trainable parameters compared to previous methods. Experiments demonstrate that KromHC performs comparably to or better than state-of-the-art variants with substantially fewer parameters. AI

IMPACT Introduces a more efficient and stable method for training neural networks, potentially improving performance and reducing computational costs.

RANK_REASON This is a research paper detailing a new method for neural network parameterization. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Wuyang Zhou, Yuxuan Gu, Giorgos Iacovides, Danilo Mandic ·

    KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices

    arXiv:2601.21579v2 Announce Type: replace Abstract: The success of Hyper-Connections (HC) in neural networks (NN) has also highlighted issues related to training instability and restricted scalability. The Manifold-Constrained Hyper-Connections (mHC) mitigate these challenges by …