Two new research papers explore advancements in continual learning for large language models. The first paper introduces a multi-stage framework for detecting reclaimed slurs in multilingual social media, utilizing XLM-RoBERTa as a foundation model and employing data augmentation and language-specific threshold optimization for improved accuracy. The second paper, named Octopus, proposes a history-free gradient orthogonalization method to enable multimodal large language models to acquire new knowledge sequentially without catastrophic forgetting, achieving state-of-the-art performance on the UCIT benchmark. AI
影响 Advances in continual learning for LLMs could lead to more adaptable and efficient models that can learn new information without forgetting previous knowledge.
排序理由 Two academic papers published on arXiv detailing new methodologies for continual learning in LLMs.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →