PulseAugur
实时 07:21:40

New metric optimizes LLM agent tool selection

Researchers have developed a chance-corrected metric called Bits-over-Random (BoR) to evaluate the optimal number of tools an LLM agent should consider for a given query. This metric helps determine if success at a certain tool shortlist depth is better than random selection. Applying this principle through reinforcement learning, an agent learned to adapt its tool shortlist size per query, significantly reducing the number of tools presented while maintaining or improving coverage and LLM selection accuracy. AI

影响 Optimizes LLM agent efficiency by reducing unnecessary tool considerations, potentially improving response times and accuracy.

排序理由 Academic paper detailing a new metric and evaluation methodology for LLM agents. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. arXiv cs.AI TIER_1 English(EN) · Vyzantinos Repantis, Ameya Gawde, Harshvardhan Singh, Joey Blackwell II ·

    How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

    arXiv:2605.24660v1 Announce Type: cross Abstract: Before an LLM agent can use a tool, a retrieval system must decide which candidate tools to show to the agent. How long should that shortlist be? Show too many tools and the model struggles to choose. Show too few and the correct …