新的HiViG批评者通过历史和视觉提升AI代理的GUI性能

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-05 00:00

研究人员开发了HiViG，一个旨在提高计算机使用代理（CUAs）在复杂图形用户界面环境中性能的新颖框架。HiViG通过结合过去行动的历史感知和视觉基础来检测错误，从而解决了现有批评者的局限性。这个多模态批评者在真实的GUI轨迹上进行训练，通过总结过去的成就并根据屏幕截图验证执行坐标来评估行动，从而在有缺陷的行动发生之前阻止它们。 AI

影响通过减少规划和执行错误，增强了AI代理在复杂GUI任务中的可靠性。

排序理由该集群包含一篇详细介绍AI代理新技术框架的学术论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.AI TIER_1 English(EN) · Jaewoo Lee, Zaid Khan, Archiki Prasad, Justin Chih-Yao Chen, Supriyo Chakraborty, Kartik Balasubramaniam, Sambit Sahu, Elias Stengel-Eskin, Hyunji Lee, Mohit Bansal · 2026-06-10 04:00

A History-Aware Visually Grounded Critic for Computer Use Agents

arXiv:2606.11078v1 Announce Type: new Abstract: Various test-time interventions for Computer Use Agents (CUAs), including critic models, have been developed to improve performance through pre-execution action evaluation in complex Graphical User Interface (GUI) environments. Howe…
arXiv cs.CL TIER_1 English(EN) · Mohit Bansal · 2026-06-09 16:39

A History-Aware Visually Grounded Critic for Computer Use Agents

Various test-time interventions for Computer Use Agents (CUAs), including critic models, have been developed to improve performance through pre-execution action evaluation in complex Graphical User Interface (GUI) environments. However, existing critics suffer from two key limita…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-05 00:00

ReVision: Scaling Computer-Use Agents via Temporal Visual Redundancy Reduction

ReVision improves computer-use agent efficiency by removing redundant visual patches from consecutive screenshots while preserving spatial structure, reducing token usage by 46% and improving success rates.

报道来源 [3]

A History-Aware Visually Grounded Critic for Computer Use Agents

A History-Aware Visually Grounded Critic for Computer Use Agents

ReVision: Scaling Computer-Use Agents via Temporal Visual Redundancy Reduction

相关实体

相关话题