GeoStack 框架实现高效的 VLM 知识组合，防止灾难性遗忘。

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-07 16:01

研究人员开发了 GeoStack，一个旨在增强视觉语言模型 (VLM) 知识组合的新框架。该方法解决了灾难性遗忘问题，即模型在训练新任务或领域时会丢失先前获得的知识。GeoStack 利用几何和结构约束来集成独立训练的领域专家，而不会损害基础模型的 foundational 知识。一项关键创新是权重折叠属性，它能够实现恒定时间推理，无论添加了多少专家。 AI

影响该框架可以实现 VLM 中更高效、更鲁棒的长期知识积累，有可能降低再训练成本并提高模型适应性。

排序理由该集群包含一篇详细介绍 VLM 新框架的学术论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Pranav Mantini, Shishir K. Shah · 2026-05-08 04:00

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

arXiv:2605.06477v1 Announce Type: new Abstract: We address the challenge of knowledge composition in Vision-Language Models (VLMs), where accumulating expertise across multiple domains or tasks typically leads to catastrophic forgetting. We introduce GeoStack (Geometric Stacking)…
arXiv cs.CV TIER_1 English(EN) · Shishir K. Shah · 2026-05-07 16:01

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

We address the challenge of knowledge composition in Vision-Language Models (VLMs), where accumulating expertise across multiple domains or tasks typically leads to catastrophic forgetting. We introduce GeoStack (Geometric Stacking), a modular framework that allows independently …

报道来源 [2]

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

相关实体

相关话题