English(EN) How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

新指标优化 LLM 代理工具选择

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-23 17:02

研究人员开发了一种称为“随机比特超额”（Bits-over-Random, BoR）的随机校正指标，用于评估 LLM 代理在给定查询时应考虑的最佳工具数量。该指标有助于确定在特定工具短名单深度下的成功是否优于随机选择。通过强化学习应用此原理，代理学会了根据查询调整其工具短名单的大小，显著减少了呈现的工具数量，同时保持或提高了覆盖率和 LLM 选择的准确性。 AI

影响通过减少不必要的工具考虑来优化 LLM 代理的效率，可能提高响应时间和准确性。

排序理由关于 LLM 代理新指标和评估方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.IR (Information Retrieval) 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Vyzantinos Repantis, Ameya Gawde, Harshvardhan Singh, Joey Blackwell II · 2026-05-26 04:00

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

arXiv:2605.24660v1 Announce Type: cross Abstract: Before an LLM agent can use a tool, a retrieval system must decide which candidate tools to show to the agent. How long should that shortlist be? Show too many tools and the model struggles to choose. Show too few and the correct …
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Joey Blackwell · 2026-05-23 17:02

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

Before an LLM agent can use a tool, a retrieval system must decide which candidate tools to show to the agent. How long should that shortlist be? Show too many tools and the model struggles to choose. Show too few and the correct tool may not appear. Most systems apply a fixed sh…

报道来源 [2]

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

相关实体

相关话题