PulseAugur
实时 06:52:49
English(EN) GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

新方法 GUI-CIDER 提升 GUI 代理知识

研究人员开发了 GUI-CIDER,一种新颖的中间训练方法,旨在增强使用多模态大型语言模型构建的 GUI 代理的世界知识。该方法通过因果内化和密度感知示例重选明确内化 GUI 操作知识,解决了传统训练后方法的局限性。GUI-CIDER 合成数据,通过优先考虑因果结构和减少冗余来精炼数据,然后使用这些精炼的数据进行中间训练。实验表明,使用此方法训练的代理在 GUI 理解和任务成功率方面有了显著提高。 AI

影响 该方法可能带来更强大、更可靠的 GUI 代理,从而改善用户与软件的交互。

排序理由 该集群包含一篇详细介绍 AI 代理新训练方法的论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →

报道来源 [5]

  1. arXiv cs.CL TIER_1 English(EN) · Zheng Wu, Chengcheng Han, Zhengxi Lu, Tianjie Ju, Yanyu Chen, Qi Gu, Xunliang Cai, Zhuosheng Zhang ·

    GUI-CIDER:通过因果内化和感知密度样本重选进行中间训练的 GUI 代理

    arXiv:2605.28534v1 Announce Type: new Abstract: Despite the rapid progress of multimodal large language models in building Graphical User Interface (GUI) agents, their real-world task completion is fundamentally bottlenecked by a lack of world knowledge about GUI operations. Exis…

  2. arXiv cs.CL TIER_1 English(EN) · Zhuosheng Zhang ·

    GUI-CIDER:通过因果内化和感知密度样本重选实现中途训练的GUI智能体

    Despite the rapid progress of multimodal large language models in building Graphical User Interface (GUI) agents, their real-world task completion is fundamentally bottlenecked by a lack of world knowledge about GUI operations. Existing solutions typically rely on expensive multi…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    GUI-CIDER:通过因果内化和感知密度样本重选实现中途训练的GUI代理

    Despite the rapid progress of multimodal large language models in building Graphical User Interface (GUI) agents, their real-world task completion is fundamentally bottlenecked by a lack of world knowledge about GUI operations. Existing solutions typically rely on expensive multi…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    GUI-CIDER:通过因果内化和感知密度样本重选进行中间训练的 GUI 代理

    GUI-CIDER is a mid-training method that explicitly incorporates GUI world knowledge through causal internalization and density-aware exemplar reselection to improve GUI agent performance.

  5. arXiv cs.CV TIER_1 English(EN) · Junlong Li, Chao Hao, Lap-Pui Chau, Yi Wang ·

    GUI-C$^2$:通过难度感知强化学习实现粗粒度到细粒度的GUI对齐

    arXiv:2605.30884v1 Announce Type: new Abstract: Existing agentic reinforcement learning methods for GUI grounding have limitations at two levels. At the data level, current approaches typically treat all training samples equally, although their training value to the baseline mode…