GeneralVLA-2 advances robot planning with improved 3D reconstruction and memory

By PulseAugur Editorial · [2 sources] · 2026-06-16 00:00

Researchers have introduced GeneralVLA-2, an advancement in vision-language-action systems designed for robot planning. This system incorporates GeoFuse-MV3D for enhanced 3D reconstruction and an improved KnowledgeBank for better memory management in robotic tasks. The GeoFuse-MV3D component addresses limitations in single-view reconstruction by fusing geometry while preserving appearance, and the upgraded KnowledgeBank offers governed long-term memory with explicit metadata for quality and confidence. AI

IMPACT Enhances robot planning capabilities by improving 3D reconstruction and memory management, potentially leading to more sophisticated robotic manipulation and navigation.

RANK_REASON The cluster describes a new research paper detailing advancements in a vision-language-action system for robotics.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

GeneralVLA-2 advances robot planning with improved 3D reconstruction and memory

COVERAGE [2]

arXiv cs.CV TIER_1 English(EN) · Haoyu Wang, Guoqing Ma, Zeyu Zhang, Yandong Guo, Boxin Shi, Hao Tang · 2026-06-17 04:00

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

arXiv:2606.17480v1 Announce Type: new Abstract: Generalist vision-language-action systems need object-centric 3D evidence and reusable manipulation experience to plan reliable robot trajectories. GeneralVLA provides a hierarchical interface for converting language and RGB-D obser…
arXiv cs.CV TIER_1 English(EN) · Hao Tang · 2026-06-16 03:45

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

Generalist vision-language-action systems need object-centric 3D evidence and reusable manipulation experience to plan reliable robot trajectories. GeneralVLA provides a hierarchical interface for converting language and RGB-D observations into 3D end-effector paths, but two bott…

COVERAGE [2]

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

GeneralVLA-2: Geometry-Aware Reconstruction and Governed Memory for Robot Planning

RELATED ENTITIES

RELATED TOPICS