ActionMap improves robot policy learning with voxel heatmap

By PulseAugur Editorial · [2 sources] · 2026-06-05 04:42

Researchers have developed ActionMap, a novel voxel heatmap action head designed to improve robot policy learning in vision-language-action (VLA) models. This new head replaces the traditional action decoder, predicting a heatmap over the action space to better exploit the geometric proximity of actions. In simulations and real-world tests, ActionMap demonstrated superior performance and data efficiency compared to existing methods, suggesting that action representation is a key factor in VLA model effectiveness. AI

IMPACT ActionMap's improved data efficiency and performance could accelerate VLA model development and real-world robot deployment.

RANK_REASON The cluster contains a research paper detailing a new method for robot policy learning.

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.CV TIER_1 English(EN) · Pei Yang, Hai Ci, Yanzhe Chen, Qi Lv, Han Cai, Mike Zheng Shou · 2026-06-08 04:00

ActionMap: Robot Policy Learning via Voxel Action Heatmap

arXiv:2606.06904v1 Announce Type: cross Abstract: Vision-language-action (VLA) models have advanced rapidly across backbones, training recipes, and data scale, yet the action decoder, which converts the backbone's hidden state into a continuous control signal, has barely changed …
arXiv cs.CV TIER_1 English(EN) · Mike Zheng Shou · 2026-06-05 04:42

ActionMap: Robot Policy Learning via Voxel Action Heatmap

Vision-language-action (VLA) models have advanced rapidly across backbones, training recipes, and data scale, yet the action decoder, which converts the backbone's hidden state into a continuous control signal, has barely changed and remains a single-point predictor across the ma…

COVERAGE [2]

ActionMap: Robot Policy Learning via Voxel Action Heatmap

ActionMap: Robot Policy Learning via Voxel Action Heatmap

RELATED ENTITIES

RELATED TOPICS