New FINCH method cuts LLM forgetting by 93%

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed a new method called FINCH to address catastrophic forgetting during the fine-tuning of large language models. FINCH employs a loss-adaptive learning rate schedule that decreases the learning rate for high-loss batches and increases it as the model converges. This approach effectively reduces forgetting by an average of 93% across various benchmarks while maintaining task performance. FINCH also shows improvements in preserving model calibration and confidence. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT FINCH significantly reduces catastrophic forgetting in LLMs, potentially enabling more effective and stable fine-tuning for specialized tasks.

RANK_REASON The cluster contains an academic paper detailing a new method for fine-tuning language models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

COVERAGE [1]

arXiv cs.LG TIER_1 · Babak Salimi · 2026-05-19 15:36

Fine-Tuning Without Forgetting via Loss-Adaptive Learning Rates

Fine-tuning large language models on new data improves task performance but degrades capabilities learned during pretraining, a phenomenon known as catastrophic forgetting. Existing methods mitigate this by modifying the fine-tuning objective to suppress high-loss tokens or seque…

COVERAGE [1]

Fine-Tuning Without Forgetting via Loss-Adaptive Learning Rates

RELATED ENTITIES

RELATED TOPICS