English(EN) DIVE: Embedding Compression via Self-Limiting Gradient Updates

新的DIVE方法压缩LLM嵌入以实现高效向量搜索

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-20 04:35

研究人员开发了一种新的方法DIVE，用于压缩大型语言模型的高维嵌入，以降低向量搜索系统的存储和计算成本。DIVE采用自限制三元组损失来防止预训练嵌入的过度扰动，并采用对比损失，将嵌入的多个投影视为隐式视图。该方法旨在克服现有压缩方法中常见的过拟合问题，尤其是在标记数据稀缺的情况下，并在多个数据集上展示了优于先前技术的性能。 AI

影响降低了LLM嵌入的计算和存储开销，可能支持更高效和可扩展的向量搜索应用。

排序理由该集群包含一篇详细介绍新嵌入压缩方法的论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Dongfang Zhao · 2026-05-22 04:00

DIVE：通过自限制梯度更新进行嵌入压缩

arXiv:2605.20689v1 Announce Type: cross Abstract: High-dimensional embeddings from large language models impose significant storage and computational costs on vector search systems. Recent embedding compression methods, including Matryoshka-Adaptor (EMNLP 2024), Search-Adaptor (A…
arXiv cs.AI TIER_1 English(EN) · Dongfang Zhao · 2026-05-20 04:35

DIVE：通过自限制梯度更新进行嵌入压缩

High-dimensional embeddings from large language models impose significant storage and computational costs on vector search systems. Recent embedding compression methods, including Matryoshka-Adaptor (EMNLP 2024), Search-Adaptor (ACL 2024), and SMEC (EMNLP 2025), enable dimensiona…

报道来源 [2]

DIVE：通过自限制梯度更新进行嵌入压缩

DIVE：通过自限制梯度更新进行嵌入压缩

相关实体

相关话题