PulseAugur
EN
LIVE 09:06:42

New framework combats catastrophic forgetting in MLLMs

Researchers have introduced Curvature-Guided Mixing (CGM), a new framework designed to improve the adaptation of Multimodal Large Language Models (MLLMs). This method addresses the issue of catastrophic forgetting, where fine-tuning on specific tasks degrades general capabilities. CGM utilizes a second-order approximation of loss landscapes to analytically determine an optimal mixing ratio for blending pre-trained and fine-tuned models, based on the curvature of their respective loss landscapes. An enhanced variant, CGM$ exttt{ ext{ extdagger}}$, offers robust parameter selection guided by a novel curvature-aware score. Experiments on LLaVA-1.5 and Qwen2.5VL demonstrated that CGM consistently enhances the balance between task specialization and general knowledge retention compared to existing techniques. AI

IMPACT This research offers a novel approach to mitigate catastrophic forgetting in MLLMs, potentially improving their versatility and performance on specialized tasks without sacrificing general knowledge.

RANK_REASON The cluster contains an academic paper detailing a new method for adapting MLLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New framework combats catastrophic forgetting in MLLMs

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Jinglong Yang, Jiaxuan He, Wenjian Huang, Zhan Zhuang, Jianguo Zhang ·

    Curvature-Guided Mixing for MLLM Adaptation

    arXiv:2606.24963v1 Announce Type: cross Abstract: Fine-tuning Multimodal Large Language Models (MLLMs) on specialized tasks often leads to catastrophic forgetting of their general capabilities. Existing model merging methods to combat this are often heuristic or use sub-optimal o…