PulseAugur
实时 14:22:01

Tilde Research launches Aurora optimizer to fix neuron death in Muon

Tilde Research has introduced Aurora, a novel optimizer designed to train neural networks more effectively. Aurora addresses a critical issue in the popular Muon optimizer where a significant number of neurons become permanently inactive during training. The new optimizer, demonstrated with a 1.1B parameter pretraining experiment, achieves state-of-the-art performance on the modded-nanoGPT speedrun benchmark and has its code released publicly. AI

影响 Fixes a critical flaw in a widely-used optimizer, potentially improving training efficiency and model performance for large-scale models.

排序理由 The cluster describes the release of a new optimizer for neural network training, including experimental results and open-source code.

在 MarkTechPost 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Tilde Research launches Aurora optimizer to fix neuron death in Muon

报道来源 [2]

  1. MarkTechPost TIER_1 English(EN) · Asif Razzaq ·

    Tilde Research 推出 Aurora:一种解决 Muon 中隐藏神经元死亡问题的、感知杠杆的优化器

    <p>Researchers at Tilde Research have released Aurora, a new optimizer for training neural networks that addresses a structural flaw in the widely-used Muon optimizer. The flaw quietly kills off a significant fraction of MLP neurons during training and keeps them permanently dead…

  2. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    Tilde Research 发布了 Aurora,一款用于训练神经网络的新优化器,修复了广泛使用的 Muon 优化器中一个隐藏的缺陷。该缺陷导致了 s

    Tilde Research has released Aurora, a new optimiser for training neural networks that fixes a hidden flaw in the widely-used Muon optimiser. The flaw causes a significant fraction of MLP neurons to die permanently during training. Aurora comes with a 1.1B parameter pretraining ex…