Learning Through Noise: Why Subliminal Learning Works and When It Fails
Researchers have demonstrated that subliminal learning in neural networks, where knowledge is transferred via task-unrelated data, is primarily governed by compatible output heads rather than shared model initialization. By splitting outputs into auxiliary and class heads, they showed that compatible auxiliary heads facilitate the transfer of teacher signals, improving student model representations. This mechanism allows students trained on noise to achieve performance comparable to teachers, providing a theoretically grounded understanding of subliminal learning and its limitations. AI
IMPACT Explains a novel mechanism for knowledge transfer in neural networks, potentially improving training efficiency and model performance.