English(EN) Can I Buy Your KV Cache?

AI 代理可以购买预计算的 KV 缓存以节省计算资源

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-11 13:47

研究人员提出了一种新颖的方法，通过预计算和销售文档的键值 (KV) 缓存来减少 AI 代理的计算量。该方法旨在消除冗余的预填充计算，这是大型模型中最耗费计算的步骤。通过允许代理加载预计算的 KV 缓存，系统可以节省大量的计算资源，对于热门文档，成本可能降低高达 50 倍。提出的解决方案涉及在提供商端的内容分发网络 (CDN) 上托管这些缓存，以避免高昂的出口成本。 AI

影响通过消除冗余计算，可以显著降低 AI 代理的推理成本。

排序理由学术论文，提出了一种新颖的 AI 计算效率技术方法。

在 arXiv cs.MA (Multiagent) 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Luoyuan Zhang · 2026-06-12 04:00

Can I Buy Your KV Cache?

arXiv:2606.13361v1 Announce Type: new Abstract: Right now, across the world, AI agents are repeating the same absurd act: to read one document, they each recompute it from scratch. Every agent re-runs prefill, the most compute-intensive step a large model takes, over identical te…
arXiv cs.MA (Multiagent) TIER_1 English(EN) · Luoyuan Zhang · 2026-06-11 13:47

Can I Buy Your KV Cache?

Right now, across the world, AI agents are repeating the same absurd act: to read one document, they each recompute it from scratch. Every agent re-runs prefill, the most compute-intensive step a large model takes, over identical text, only to rebuild a key-value (KV) cache ident…

报道来源 [2]

Can I Buy Your KV Cache?

Can I Buy Your KV Cache?

相关实体

相关话题