English(EN) Learning Action Priors for Cross-embodiment Robot Manipulation

机器人操作模型通过两阶段训练获得运动先验 · 跟踪 2 个来源

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-24 17:59

研究人员开发了一种新颖的两阶段训练框架，以改进用于机器人操作的视觉-语言-动作 (VLA) 模型。该方法首先使用无条件动作轨迹预训练具有运动先验的动作模块，然后将其与视觉和语言特征对齐。通过为动作模块提供明确的运动先验，该方法提高了收敛速度、成功率和性能，尤其是在数据有限的现实世界任务中。 AI

影响这种方法可以加速在复杂、现实世界的操作任务中开发和部署更强大、更高效的机器人。

排序理由该集群包含两篇相同的 arXiv 预印本，详细介绍了机器人操作的新研究方法。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Mingyu Ding · 2026-06-24 17:59

为跨体机器人操纵学习动作先验

Most Vision-Language-Action (VLA) models build on a Vision-Language Model (VLM) backbone by attaching an action module and optimizing the full policy jointly. This design inherits strong visual and linguistic priors from the VLM, but leaves the action module to learn physical mot…
arXiv cs.CV TIER_1 English(EN) · Dong Jing, Tianqi Zhang, Jiaqi Liu, Jinman Zhao, Zelong Sun, Li Erran Li, Zhiwu Lu, Mingyu Ding · 2026-06-25 04:00

为跨体机器人操控学习动作先验

arXiv:2606.26095v1 Announce Type: cross Abstract: Most Vision-Language-Action (VLA) models build on a Vision-Language Model (VLM) backbone by attaching an action module and optimizing the full policy jointly. This design inherits strong visual and linguistic priors from the VLM, …