PulseAugur
实时 10:27:36
English(EN) A History-Aware Visually Grounded Critic for Computer Use Agents

新的HiViG批评者通过历史和视觉提升AI代理的GUI性能

研究人员开发了HiViG,一个旨在提高计算机使用代理(CUAs)在复杂图形用户界面环境中性能的新颖框架。HiViG通过结合过去行动的历史感知和视觉基础来检测错误,从而解决了现有批评者的局限性。这个多模态批评者在真实的GUI轨迹上进行训练,通过总结过去的成就并根据屏幕截图验证执行坐标来评估行动,从而在有缺陷的行动发生之前阻止它们。 AI

影响 通过减少规划和执行错误,增强了AI代理在复杂GUI任务中的可靠性。

排序理由 该集群包含一篇详细介绍AI代理新技术框架的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Jaewoo Lee, Zaid Khan, Archiki Prasad, Justin Chih-Yao Chen, Supriyo Chakraborty, Kartik Balasubramaniam, Sambit Sahu, Elias Stengel-Eskin, Hyunji Lee, Mohit Bansal ·

    A History-Aware Visually Grounded Critic for Computer Use Agents

    arXiv:2606.11078v1 Announce Type: new Abstract: Various test-time interventions for Computer Use Agents (CUAs), including critic models, have been developed to improve performance through pre-execution action evaluation in complex Graphical User Interface (GUI) environments. Howe…

  2. arXiv cs.CL TIER_1 English(EN) · Mohit Bansal ·

    A History-Aware Visually Grounded Critic for Computer Use Agents

    Various test-time interventions for Computer Use Agents (CUAs), including critic models, have been developed to improve performance through pre-execution action evaluation in complex Graphical User Interface (GUI) environments. However, existing critics suffer from two key limita…