PulseAugur
EN
LIVE 06:53:47

New method GUI-CIDER boosts GUI agent knowledge

Researchers have developed GUI-CIDER, a novel mid-training method designed to enhance the world knowledge of GUI agents built with multimodal large language models. This approach explicitly internalizes GUI operational knowledge through causal internalization and density-aware exemplar reselection, addressing limitations of traditional post-training methods. GUI-CIDER synthesizes data, refines it by prioritizing causal structures and reducing redundancy, and then uses this refined data for mid-training. Experiments show significant improvements in GUI understanding and task success rates for agents trained with this method. AI

IMPACT This method could lead to more capable and reliable GUI agents, improving user interaction with software.

RANK_REASON The cluster contains a research paper detailing a new method for training AI agents.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 5 sources. How we write summaries →

COVERAGE [5]

  1. arXiv cs.CL TIER_1 English(EN) · Zheng Wu, Chengcheng Han, Zhengxi Lu, Tianjie Ju, Yanyu Chen, Qi Gu, Xunliang Cai, Zhuosheng Zhang ·

    GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

    arXiv:2605.28534v1 Announce Type: new Abstract: Despite the rapid progress of multimodal large language models in building Graphical User Interface (GUI) agents, their real-world task completion is fundamentally bottlenecked by a lack of world knowledge about GUI operations. Exis…

  2. arXiv cs.CL TIER_1 English(EN) · Zhuosheng Zhang ·

    GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

    Despite the rapid progress of multimodal large language models in building Graphical User Interface (GUI) agents, their real-world task completion is fundamentally bottlenecked by a lack of world knowledge about GUI operations. Existing solutions typically rely on expensive multi…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

    Despite the rapid progress of multimodal large language models in building Graphical User Interface (GUI) agents, their real-world task completion is fundamentally bottlenecked by a lack of world knowledge about GUI operations. Existing solutions typically rely on expensive multi…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

    GUI-CIDER is a mid-training method that explicitly incorporates GUI world knowledge through causal internalization and density-aware exemplar reselection to improve GUI agent performance.

  5. arXiv cs.CV TIER_1 English(EN) · Junlong Li, Chao Hao, Lap-Pui Chau, Yi Wang ·

    GUI-C$^2$: Coarse-to-Fine GUI Grounding via Difficulty-Aware Reinforcement Learning

    arXiv:2605.30884v1 Announce Type: new Abstract: Existing agentic reinforcement learning methods for GUI grounding have limitations at two levels. At the data level, current approaches typically treat all training samples equally, although their training value to the baseline mode…