PulseAugur
EN
LIVE 12:47:16

New DragOn dataset aims to boost GUI agent drag-and-drop skills

Researchers have introduced DragOn, a new benchmark and dataset designed to improve the performance of GUI agents on drag-based interactions. The dataset includes 286,000 training screenshots and 3.5 million training tasks across four domains: text highlighting, cell selection, element resizing, and slider manipulation. Evaluations on proprietary and open-weight models, including a fine-tuned Qwen VLM, suggest that DragOn can enhance the capabilities of state-of-the-art models for complex computer-use tasks. AI

IMPACT Enhances AI's ability to automate complex GUI interactions, potentially accelerating digital task automation.

RANK_REASON The cluster contains an academic paper introducing a new benchmark and dataset for AI research. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Ronan Riochet ·

    DragOn: A Benchmark and Dataset for Drag-Based GUI Interactions

    GUI agents - vision-based models that control desktops, web browsers, and mobile devices through graphical user interfaces - promise to automate a wide range of digital tasks. While million-scale datasets have enabled substantial progress on click-grounding, drag grounding (e.g. …