PulseAugur
实时 02:30:31

新的AI方法应对持续学习和函数梯度下降

研究人员提出了持续学习AI模型的新方法,解决了在不损害现有能力的情况下整合新知识的挑战。一种方法KeepLoRA++利用层缩放残差梯度自适应来平衡视觉-语言模型中的知识保留和可塑性。另一种方法ReGrad将梯度视为可检索单元,从而无需累积权重漂移即可进行参数化知识注入。此外,一种新的函数梯度下降算法可以自适应函数梯度表示,提供改进的收敛保证和性能。 AI

影响 这些论文探讨了模型自适应和学习的先进技术,有可能提高各种AI应用的效率和性能。

排序理由 多篇arXiv论文介绍了AI领域的新研究方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 9 个来源。 我们如何撰写摘要 →

新的AI方法应对持续学习和函数梯度下降

报道来源 [9]

  1. arXiv cs.LG TIER_1 English(EN) · Jean-Fran\c{c}ois Aujol, J\'er\'emie Bigot, Camille Castera ·

    无下降的随机自适应梯度下降

    arXiv:2509.14969v2 Announce Type: replace Abstract: We introduce a new adaptive step-size strategy for convex optimization with stochastic gradient that exploits the local geometry of the objective function only by means of a first-order stochastic oracle and without any hyper-pa…

  2. arXiv cs.AI TIER_1 English(EN) · Weihang Su, Jiacheng Kang, Jingyan Xu, Qingyao Ai, Jianming Long, Hanwen Zhang, Bangde Du, Xinyuan Cao, Min Zhang, Yiqun Liu ·

    可检索梯度:无需累积权重漂移的持续后训练

    arXiv:2606.15734v1 Announce Type: cross Abstract: Continual post-training enables models to absorb emerging knowledge after deployment, but repeatedly updating shared parameters can accumulate weight drift, potentially causing catastrophic forgetting and degrading general capabil…

  3. arXiv cs.AI TIER_1 English(EN) · Yudou Tian, Neeraj Mohan Sushma, Harshvardhan Mestha, Nicolo Colombo, David Kappel, Anand Subramoney ·

    循环状态下的学习:带线性循环网络的梯度下降

    arXiv:2410.11687v3 Announce Type: replace-cross Abstract: Linear recurrent networks (LRNNs) offer linear-time sequence modeling, but standard recurrent updates do not directly expose the supervised products needed for in-context gradient descent. We propose a sufficient construct…

  4. arXiv cs.LG TIER_1 English(EN) · Mao-Lin Luo, Yi-Lin Zhang, Zi-Hao Zhou, Yankun Hong, Xialiang Tong, Mingxuan Yuan, Tong Wei, Min-Ling Zhang ·

    KeepLoRA++:具有层缩放残差梯度自适应的持续学习

    arXiv:2606.16256v1 Announce Type: cross Abstract: Continual learning for pre-trained vision-language models requires balancing three competing objectives: retaining pre-trained knowledge, preserving knowledge from a sequence of learned tasks, and maintaining the plasticity to acq…

  5. arXiv cs.LG TIER_1 English(EN) · Daniel Csillag, Rodrigo Schuller, Pedro Dall'Antonia, Leonidas Guibas, Luiz Velho, Tiago Novello ·

    Functional Gradient Descent with Adaptive Representations

    arXiv:2606.16926v1 Announce Type: cross Abstract: Functional optimization problems are typically solved by optimizing the parameters of a fixed representation, such as a neural network, resulting in highly nonconvex losses that complicate both training and theoretical analysis. A…

  6. arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Yiqun Liu ·

    Retrievable Gradients: Continual Post-Training Without Cumulative Weight Drift

    Continual post-training enables models to absorb emerging knowledge after deployment, but repeatedly updating shared parameters can accumulate weight drift, potentially causing catastrophic forgetting and degrading general capabilities. Retrieval-augmented generation avoids such …

  7. arXiv stat.ML TIER_1 English(EN) · Hengzhi He, Shirong Xu, Guang Cheng ·

    无崩溃的递归学习:一种基于权重的稳定化框架

    arXiv:2502.18049v5 Announce Type: replace Abstract: Recent studies identified an intriguing phenomenon in recursive generative model training known as model collapse, where models trained on data generated by previous models exhibit severe performance degradation. Addressing this…

  8. arXiv stat.ML TIER_1 English(EN) · Tiago Novello ·

    具有自适应表示的函数梯度下降

    Functional optimization problems are typically solved by optimizing the parameters of a fixed representation, such as a neural network, resulting in highly nonconvex losses that complicate both training and theoretical analysis. An interesting alternative is functional gradient d…

  9. arXiv cs.CV TIER_1 English(EN) · Min-Ling Zhang ·

    KeepLoRA++:具有层缩放残差梯度自适应的持续学习

    Continual learning for pre-trained vision-language models requires balancing three competing objectives: retaining pre-trained knowledge, preserving knowledge from a sequence of learned tasks, and maintaining the plasticity to acquire new knowledge. This paper presents KeepLoRA++…