Grokking in ML requires breaking data symmetry for generalization

By PulseAugur Editorial · [1 sources] · 2026-06-25 04:00

Researchers have investigated the phenomenon of grokking in machine learning, where a model achieves high training accuracy but only generalizes to new data much later. Their study, using the Recursive Feature Machine (RFM) algorithm on algebraic tasks, found that generalization is contingent upon breaking specific symmetries within the training dataset. The RFM algorithm appears to achieve this by recovering underlying invariance group actions inherent in the data, with learned feature matrices encoding elements of this invariance group, thus explaining the link between symmetry and generalization. AI

IMPACT Understanding grokking could lead to more robust and generalizable AI models by identifying key data properties.

RANK_REASON The cluster contains an academic paper detailing a new finding in machine learning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv stat.ML →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Grokking in ML requires breaking data symmetry for generalization

COVERAGE [1]

arXiv stat.ML TIER_1 English(EN) · Marcel Tom\`as Bernal, Neil Rohit Mallinar, Mikhail Belkin · 2026-06-25 04:00

Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels

arXiv:2604.00316v2 Announce Type: replace Abstract: Grokking occurs when a model achieves high training accuracy but generalization to unseen test points happens long after that. This phenomenon was initially observed on a class of algebraic problems, such as learning modular ari…

COVERAGE [1]

Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels

RELATED ENTITIES

RELATED TOPICS