PulseAugur
实时 22:40:20

New research tackles continual learning in multilingual and multimodal LLMs

Two new research papers explore advancements in continual learning for large language models. The first paper introduces a multi-stage framework for detecting reclaimed slurs in multilingual social media, utilizing XLM-RoBERTa as a foundation model and employing data augmentation and language-specific threshold optimization for improved accuracy. The second paper, named Octopus, proposes a history-free gradient orthogonalization method to enable multimodal large language models to acquire new knowledge sequentially without catastrophic forgetting, achieving state-of-the-art performance on the UCIT benchmark. AI

影响 Advances in continual learning for LLMs could lead to more adaptable and efficient models that can learn new information without forgetting previous knowledge.

排序理由 Two academic papers published on arXiv detailing new methodologies for continual learning in LLMs.

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

New research tackles continual learning in multilingual and multimodal LLMs

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Juuso Eronen ·

    Continual Learning with Multilingual Foundation Model

    This paper presents a multi-stage framework for detecting reclaimed slurs in multilingual social media discourse. It addresses the challenge of identifying reclamatory versus non-reclamatory usage of LGBTQ+-related slurs across English, Spanish, and Italian tweets. The framework …

  2. arXiv cs.CV TIER_1 English(EN) · Chao Ma ·

    Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models

    Continual learning in multimodal large language models (MLLMs) aims to sequentially acquire knowledge while mitigating catastrophic forgetting, yet existing methods face inherent limitations: architecture-based approaches incur additional computational overhead and often generali…