Graphical User Interface
PulseAugur coverage of Graphical User Interface — every cluster mentioning Graphical User Interface across labs, papers, and developer communities, ranked by signal.
1 天有情绪数据
-
ScreenSearch system improves AI agent exploration of desktop GUIs
Researchers have developed ScreenSearch, a novel system designed to improve the exploration of desktop graphical user interface (GUI) states for AI agents. The system addresses the challenge of partial observability, wh…
-
EVE framework launches open-source LLMs for Earth Intelligence
Researchers have developed EVE, an open-source framework for creating specialized Large Language Models (LLMs) focused on Earth Intelligence. The core of EVE is EVE-Instruct, a 24 billion parameter model derived from Mi…
-
GoClick model offers lightweight GUI element grounding for on-device AI agents
Researchers have developed GoClick, a novel lightweight vision-language model designed for precise GUI element grounding on resource-constrained devices. Unlike existing large models, GoClick utilizes an encoder-decoder…
-
Rethinking Token Pruning for Historical Screenshots in GUI Visual Agents: Semantic, Spatial, and Temporal Perspectives
Researchers have explored token pruning strategies for GUI visual agents that utilize Multimodal Large Language Models (MLLMs). Their study revealed that background regions in screenshots, often overlooked, can provide …
-
Holo1:驱动 GUI 代理 Surfer-H 的新型 GUI 自动化 VLM 系列
研究人员推出 A11y-Compressor 框架,通过将线性化的可访问性树转换为结构化表示,旨在提高 GUI 代理观察的效率。该方法显著减少了输入 token,同时提高了任务成功率。同时,开发了一个名为 WindowsWorld 的新基准,用于评估 GUI 代理在复杂、多应用程序专业工作流上的表现,揭示了当前代理在此类场景中的糟糕表现。此外,VLAA-GUI 提供了一个模块化框架,以解决自主 GUI 代理中的早期停止和重复循环等挑战…