English(EN) Multi-Vector Embeddings are Provably More Expressive than Single Vector Embeddings

多向量嵌入被证明比单向量嵌入更具表现力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-22 15:22

一篇新发表在arXiv上的论文将多向量（MV）嵌入的概念引入为与单向量（SV）嵌入相比，神经信息检索更具表现力的范式。该研究正式证明了使用多个向量表示数据并使用Chamfer相似度的MV嵌入，相比依赖内积相似度的SV嵌入具有显著优势。研究结果确立了表示大小上的强烈分离，证实了信息检索界长期以来的观点，即MV嵌入可以捕捉SV嵌入无法近似表示的相似性。 AI

影响这项研究为多向量嵌入在信息检索中的优越性提供了理论支持，可能影响未来的模型架构和检索系统设计。

排序理由该集群包含一篇发表在arXiv上的研究论文，详细介绍了嵌入技术的理论进展。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.IR (Information Retrieval) 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Rajesh Jayaram · 2026-06-22 15:22

Multi-Vector Embeddings are Provably More Expressive than Single Vector Embeddings

Multi-vector (MV) embeddings have become a powerful paradigm in neural information retrieval (IR), achieving high retrieval accuracy by representing data with multiple vectors and scoring them via the non-linear Chamfer similarity. Despite their widely perceived superiority over …

报道来源 [1]

Multi-Vector Embeddings are Provably More Expressive than Single Vector Embeddings

相关实体

相关话题