New AI methods tackle continual learning and functional gradient descent

By PulseAugur Editorial · [9 sources] · 2026-06-14 10:31

Researchers have introduced novel methods for continual learning in AI models, addressing the challenge of integrating new knowledge without degrading existing capabilities. One approach, KeepLoRA++, utilizes layer-scaled residual gradient adaptation to balance knowledge retention and plasticity in vision-language models. Another method, ReGrad, treats gradients as retrievable units, enabling parametric knowledge injection without cumulative weight drift. Additionally, a new algorithm for functional gradient descent adapts the representation of functional gradients, offering improved convergence guarantees and performance. AI

IMPACT These papers explore advanced techniques for model adaptation and learning, potentially improving efficiency and performance in various AI applications.

RANK_REASON Multiple arXiv papers introducing novel research methods in AI.

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 9 sources. How we write summaries →

New AI methods tackle continual learning and functional gradient descent

COVERAGE [9]

arXiv cs.LG TIER_1 English(EN) · Jean-Fran\c{c}ois Aujol, J\'er\'emie Bigot, Camille Castera · 2026-06-18 04:00

Stochastic Adaptive Gradient Descent Without Descent

arXiv:2509.14969v2 Announce Type: replace Abstract: We introduce a new adaptive step-size strategy for convex optimization with stochastic gradient that exploits the local geometry of the objective function only by means of a first-order stochastic oracle and without any hyper-pa…
arXiv cs.AI TIER_1 English(EN) · Weihang Su, Jiacheng Kang, Jingyan Xu, Qingyao Ai, Jianming Long, Hanwen Zhang, Bangde Du, Xinyuan Cao, Min Zhang, Yiqun Liu · 2026-06-16 04:00

Retrievable Gradients: Continual Post-Training Without Cumulative Weight Drift

arXiv:2606.15734v1 Announce Type: cross Abstract: Continual post-training enables models to absorb emerging knowledge after deployment, but repeatedly updating shared parameters can accumulate weight drift, potentially causing catastrophic forgetting and degrading general capabil…
arXiv cs.AI TIER_1 English(EN) · Yudou Tian, Neeraj Mohan Sushma, Harshvardhan Mestha, Nicolo Colombo, David Kappel, Anand Subramoney · 2026-06-16 04:00

Learning in the Recurrent State: Gradient Descent with Linear Recurrent Networks

arXiv:2410.11687v3 Announce Type: replace-cross Abstract: Linear recurrent networks (LRNNs) offer linear-time sequence modeling, but standard recurrent updates do not directly expose the supervised products needed for in-context gradient descent. We propose a sufficient construct…
arXiv cs.LG TIER_1 English(EN) · Mao-Lin Luo, Yi-Lin Zhang, Zi-Hao Zhou, Yankun Hong, Xialiang Tong, Mingxuan Yuan, Tong Wei, Min-Ling Zhang · 2026-06-16 04:00

KeepLoRA++: Continual Learning with Layer-Scaled Residual Gradient Adaptation

arXiv:2606.16256v1 Announce Type: cross Abstract: Continual learning for pre-trained vision-language models requires balancing three competing objectives: retaining pre-trained knowledge, preserving knowledge from a sequence of learned tasks, and maintaining the plasticity to acq…
arXiv cs.LG TIER_1 English(EN) · Daniel Csillag, Rodrigo Schuller, Pedro Dall'Antonia, Leonidas Guibas, Luiz Velho, Tiago Novello · 2026-06-16 04:00

Functional Gradient Descent with Adaptive Representations

arXiv:2606.16926v1 Announce Type: cross Abstract: Functional optimization problems are typically solved by optimizing the parameters of a fixed representation, such as a neural network, resulting in highly nonconvex losses that complicate both training and theoretical analysis. A…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Yiqun Liu · 2026-06-14 10:31

Retrievable Gradients: Continual Post-Training Without Cumulative Weight Drift

Continual post-training enables models to absorb emerging knowledge after deployment, but repeatedly updating shared parameters can accumulate weight drift, potentially causing catastrophic forgetting and degrading general capabilities. Retrieval-augmented generation avoids such …
arXiv stat.ML TIER_1 English(EN) · Hengzhi He, Shirong Xu, Guang Cheng · 2026-06-17 04:00

Recursive Learning Without Collapse: A Weighting-Based Stabilization Framework

arXiv:2502.18049v5 Announce Type: replace Abstract: Recent studies identified an intriguing phenomenon in recursive generative model training known as model collapse, where models trained on data generated by previous models exhibit severe performance degradation. Addressing this…
arXiv stat.ML TIER_1 English(EN) · Tiago Novello · 2026-06-15 16:27

Functional Gradient Descent with Adaptive Representations

Functional optimization problems are typically solved by optimizing the parameters of a fixed representation, such as a neural network, resulting in highly nonconvex losses that complicate both training and theoretical analysis. An interesting alternative is functional gradient d…
arXiv cs.CV TIER_1 English(EN) · Min-Ling Zhang · 2026-06-15 05:57

KeepLoRA++: Continual Learning with Layer-Scaled Residual Gradient Adaptation

Continual learning for pre-trained vision-language models requires balancing three competing objectives: retaining pre-trained knowledge, preserving knowledge from a sequence of learned tasks, and maintaining the plasticity to acquire new knowledge. This paper presents KeepLoRA++…

COVERAGE [9]

RELATED ENTITIES

RELATED TOPICS