PulseAugur
LIVE 00:18:14
tool · [1 source] ·

New mechanism explains one-third scaling in online classification

Researchers have identified a boundary-layer mechanism that explains a one-third scaling in online softmax classification. This mechanism shows that only examples near the teacher's decision boundaries contribute significantly to learning at later stages. The study predicts a power-law learning curve of \(\\alpha^{-1/3}\\) for test loss and generalization error, which is slower than the Bayes-optimal reference. They also suggest that learning-rate schedules can improve generalization error towards a \(\\alpha^{-1/2}\\) power law. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Identifies a theoretical limitation in current classification methods and suggests potential improvements through learning-rate adjustments.

RANK_REASON Academic paper detailing a new theoretical mechanism for classification scaling. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 · Marcel K\"uhn, Yoon Thelge, Bernd Rosenow ·

    A Boundary-Layer Mechanism for One-Third Scaling in Online Softmax Classification

    arXiv:2605.22341v1 Announce Type: new Abstract: Hard-label classification is usually trained with smooth surrogate losses, most prominently softmax cross-entropy. We isolate an asymptotic mechanism by which this mismatch between smooth surrogate and discrete labels produces power…