tool · [1 source] · 2026-05-22 04:00

New GROW framework boosts VLM agents with adapted GRPO

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 sources

Researchers have introduced GROW, a novel reinforcement learning framework designed to enhance the capabilities of vision-language model (VLM) agents in open-world tasks. Unlike previous methods that relied heavily on supervised fine-tuning, GROW adapts the Group Relative Policy Optimization (GRPO) algorithm by decomposing trajectories into state-action samples. This approach mitigates issues with long contexts and noise inherent in standard GRPO, enabling more effective multi-turn learning. Experiments on over 800 Minecraft tasks demonstrated that GROW achieves state-of-the-art performance, showcasing its potential for advancing VLM agents. AI

Summary written by gemini-2.5-flash-lite from 1 sources. How we write summaries →

IMPACT Enhances VLM agent performance in open-world tasks by improving reinforcement learning efficiency.

RANK_REASON Publication of an academic paper detailing a new AI framework and its experimental results. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

COVERAGE [1]

arXiv cs.AI TIER_1 · Xiongbin Wu, Zhihao Luo, Shanzhe Lei, Lechao Zhang, Xuhong Wang, Jie Yang, Zhonglong Zheng, Yuanjie Zheng, Xin Tan, Wei Liu · 2026-05-22 04:00

GROW: Aligning GRPO with State-Action Modeling for Open-World VLM Agents

arXiv:2605.20246v2 Announce Type: cross Abstract: Recently, vision-language model (VLM) agents have shown promising progress in open-world tasks, where successful task completion often requires multiple turns of visual perception and action execution. However, existing methods st…

COVERAGE [1]

GROW: Aligning GRPO with State-Action Modeling for Open-World VLM Agents

RELATED ENTITIES

RELATED TOPICS