English(EN) NTILC: Neural Tool Invocation via Learned Compression

NTILC框架将LLM工具调用的上下文使用量削减95%

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-08 04:00

研究人员开发了NTILC，一个用于语言模型更高效调用工具的新框架。NTILC使用学习到的潜在检索将用户意图和工具规范映射到共享嵌入空间，无需在提示中包含完整的工具规范。与现有方法相比，该方法显著减少了95%以上的上下文窗口消耗和高达74%的推理延迟，同时提高了选择准确性。 AI

影响减少了LLM工具使用中的上下文窗口消耗和推理延迟，可能支持更复杂的代理行为。

排序理由该集群包含一篇详细介绍语言模型新框架的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Andrew Krikorian, Yayuan Li, Jason J. Corso · 2026-06-08 04:00

NTILC：通过学习压缩实现神经工具调用

arXiv:2606.06566v1 Announce Type: cross Abstract: Agentic tool-calling language models depend on large registries of callable APIs, functions, and local actions. Placing full tool specifications directly in the prompt incurs a cost that scales linearly with the size of the tool r…