PulseAugur
实时 18:35:28
English(EN) Introducing Nested Learning: A new ML paradigm for continual learning

AI持续学习研究应对灾难性遗忘

研究人员正在探索AI持续学习的新方法,旨在克服“灾难性遗忘”的挑战,即模型在学习新技能时会丢失先前学到的信息。Google Research推出了“嵌套学习”,一种将模型视为相互关联的优化问题的范式,以缓解此问题。其他研究侧重于高效方法,如CIRCLE,它使用固定的存储库特征,以及用于LLM的SAE引导激活正则化,它在激活空间而非权重空间中运行。此外,正在开发新的指标来更好地表征遗忘,并提出了CoVON等新型优化器来平衡持续学习系统的稳定性和可塑性。调查结果表明,AI安全研究人员在广泛的持续学习代理的未来时间表和风险方面缺乏共识。 AI

影响 持续学习的进步可以使AI代理更具适应性和持久性,这对于长期任务和复杂环境至关重要。

排序理由 多篇研究论文介绍了AI持续学习的新方法和指标。

在 Google AI / Research 阅读 →

AI 生成摘要 · Google Gemini · 来自 11 个来源。 我们如何撰写摘要 →

AI持续学习研究应对灾难性遗忘

报道来源 [11]

  1. Google AI / Research TIER_1 English(EN) ·

    Introducing Nested Learning: A new ML paradigm for continual learning

    Algorithms & Theory

  2. arXiv cs.AI TIER_1 English(EN) · Augustinas Ju\v{c}as, Yangchen Pan ·

    Data-Free Reservoir Features for Efficient Long-Horizon Cold-Start Continual Learning

    arXiv:2606.27095v1 Announce Type: cross Abstract: Cold-start exemplar-free class-incremental learning requires learning a growing set of classes without replay, external pretraining, or a large initial task. Existing cold-start methods typically either train the backbone througho…

  3. arXiv cs.CL TIER_1 English(EN) · Evan Ning, Wei Xue, Dong Lou, Yike Guo ·

    From Weights to Features: SAE-Guided Activation Regularization for LLM Continual Learning

    arXiv:2606.26629v1 Announce Type: cross Abstract: Weight-space regularization methods such as Elastic Weight Consolidation (EWC) are the standard approach to catastrophic forgetting in continual learning. However, those methods tend to underperform when applied to large language …

  4. arXiv cs.LG TIER_1 English(EN) · Yangchen Pan ·

    Data-Free Reservoir Features for Efficient Long-Horizon Cold-Start Continual Learning

    Cold-start exemplar-free class-incremental learning requires learning a growing set of classes without replay, external pretraining, or a large initial task. Existing cold-start methods typically either train the backbone throughout the stream and compensate for semantic drift, o…

  5. arXiv cs.LG TIER_1 English(EN) · Yike Guo ·

    从权重到特征:SAE引导的激活正则化用于LLM持续学习

    Weight-space regularization methods such as Elastic Weight Consolidation (EWC) are the standard approach to catastrophic forgetting in continual learning. However, those methods tend to underperform when applied to large language models. We argue that such underperformance can be…

  6. arXiv cs.LG TIER_1 English(EN) · Xin Li ·

    The Urysohn Ladder: Recursive Metric Contraction for Scalable Continual Learning

    arXiv:2512.18471v2 Announce Type: replace Abstract: Continual learning systems face a fundamental geometric obstacle: as experience accumulates on a fixed-capacity manifold, covering numbers grow linearly with time, eventually forcing representational overlap and catastrophic int…

  7. arXiv cs.LG TIER_1 English(EN) · Ahmed Anwar, Andreas Wagner, Federico Raue, Tobias Nauen, Andreas Dengel ·

    The Gentle Collapse: Distributional Metrics for Continual Learning

    arXiv:2606.25165v1 Announce Type: new Abstract: Accuracy degradation is the standard metric for Catastrophic Forgetting (CF), however, it records only whether forgetting occurred or not. It saturates at the extremes and collapses discretely at task boundaries, hiding the internal…

  8. arXiv cs.AI TIER_1 English(EN) · Subarnaduti Paul, Yohan Jung, Mohammad Emtiyaz Khan, Siddharth Swaroop, Thomas M\"ollenhoff, Martin Mundt ·

    Fast and Slow Variational Continual Learning

    arXiv:2606.24007v1 Announce Type: cross Abstract: Continual learning remains a major challenge for modern deep networks, partly because commonly used optimizers lack inherent mechanisms for continual adaptation. One such natural mechanism is fast and slow adaptation to balance st…

  9. arXiv cs.LG TIER_1 English(EN) · Martin Mundt ·

    Fast and Slow Variational Continual Learning

    Continual learning remains a major challenge for modern deep networks, partly because commonly used optimizers lack inherent mechanisms for continual adaptation. One such natural mechanism is fast and slow adaptation to balance stability and plasticity. This mechanism has deep ro…

  10. LessWrong (AI tag) TIER_1 English(EN) · Rauno Arike ·

    Perspectives on Continual Learning: Survey Results and Forecasts

    <p><i><span>This is the fifth post in the sequence </span></i><a href="https://www.lesswrong.com/s/oc5Auteiibo56kNXw"><i><span>Implications of Continual Learning for LLM Agents</span></i></a><i><span>.</span></i></p><h1><span>Summary</span></h1><p><span>While writing our continua…

  11. r/MachineLearning TIER_1 English(EN) · /u/fourwheels2512 ·

    Live Continual Learning in Machine Learning [D]

    <!-- SC_OFF --><div class="md"><p>My question on live continual learning use cases was removed by moderators here because they think i asked basic level question about live continual learning which i thought is a frontier level research. But anyways. Is anyone interested in talki…