PulseAugur
实时 21:46:18
English(EN) GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots

GUICrafter: 弱监督GUI智能体减少标注需求

研究人员开发了GUICrafter,一种新颖的弱监督GUI智能体,旨在克服训练此类系统中的数据收集挑战。通过利用海量未标注截图,GUICrafter显著减少了对昂贵人工标注的需求。该智能体采用两阶段课程学习框架,首先从未标注数据中学习视觉基础,然后通过强化学习用少量高质量数据进行校准。实验表明,GUICrafter使用的数据量仅为现有系统(如UI-TARS)的一小部分,即可达到与之相当或更好的性能。 AI

影响 这种方法通过降低数据标注成本,可能显著降低开发复杂GUI智能体的入门门槛。

排序理由 该集群描述了一篇详细介绍新型AI模型及其方法论的研究论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

GUICrafter: 弱监督GUI智能体减少标注需求

报道来源 [3]

  1. arXiv cs.AI TIER_1 English(EN) · Sunqi Fan, Lingshan Chen, Runqi Yin, Qingle Liu, Yongming Rao, Meng-Hao Guo, Shi-Min Hu ·

    GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots

    arXiv:2606.29705v1 Announce Type: new Abstract: Data, as the fundamental substrate of modern intelligence, has greatly driven the development of current foundation models. Naturally, researchers aim to extend this paradigm to the domain of GUI agents, hoping to build strong GUI a…

  2. arXiv cs.CL TIER_1 English(EN) · Shi-Min Hu ·

    GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots

    Data, as the fundamental substrate of modern intelligence, has greatly driven the development of current foundation models. Naturally, researchers aim to extend this paradigm to the domain of GUI agents, hoping to build strong GUI agents through a similar paradigm. However, GUI a…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots

    GUICrafter addresses GUI agent data challenges through a weakly-supervised approach using unannotated screenshots and a two-stage curriculum learning framework for visual grounding and reinforcement learning calibration.