English(EN) Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

研究质疑AI智能体的工具使用有效性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-01 15:04

一项新研究质疑了多模态AI智能体在工具使用方面的有效性，认为观察到的基准提升可能并非源于真实的能力提升。研究人员发现，像Thyme和DeepEyesV2这样的智能体在获得工具访问权限后，一致性的提升非常小，即使没有工具也能解决大多数问题。研究表明，这些智能体可能只是学会了模仿工具调用模式，而不是真正利用工具来增强解决问题的能力。 AI

影响挑战了工具使用能固有地提升AI智能体能力的假设，促使重新评估当前的评估方法。

排序理由学术论文，提出了新的研究发现。

在 arXiv cs.AI 阅读 →

DeepEyesV2

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Garvin Guo, Donglei Yu, Yu Chen, Xiang Wang, Shuai Li, Xinpei Zhao, Huaxing Liu, Qinghao Wang, Minpeng Liao · 2026-06-02 04:00

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

arXiv:2606.02357v1 Announce Type: cross Abstract: Tool-augmented multimodal agents show strong benchmark gains, often taken as evidence that agents have learned to use tools. We argue that this interpretation can be premature: a tool-call trace alone does not show whether the too…
arXiv cs.AI TIER_1 English(EN) · Minpeng Liao · 2026-06-01 15:04

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

Tool-augmented multimodal agents show strong benchmark gains, often taken as evidence that agents have learned to use tools. We argue that this interpretation can be premature: a tool-call trace alone does not show whether the tool supplied answer-critical information. We study t…

报道来源 [2]

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains

相关话题