PulseAugur
实时 07:54:21
English(EN) Task Robustness via Re-Labelling Vision-Action Robot Data

新的TREAD框架使用VLM来增强机器人学习数据

研究人员开发了一个名为TREAD的新框架,通过扩充现有数据集来改进机器人学习。该方法使用大型视觉语言模型(VLM)为机器人任务生成更多样化、语言更丰富的指令。通过将演示分解为基于语言-动作的配对,并添加文本目标的变体,TREAD增强了机器人理解新指令和场景并进行泛化的能力。 AI

影响 通过利用VLM的数据增强能力,增强了机器人指令遵循和泛化能力。

排序理由 该集群包含一篇详细介绍机器人学习新框架的研究论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.LG TIER_1 English(EN) · Artur Kuramshin, \"Ozg\"ur Aslan, Cyrus Neary, Glen Berseth ·

    Task Robustness via Re-Labelling Vision-Action Robot Data

    arXiv:2606.10918v1 Announce Type: cross Abstract: The recent trend in scaling models for robot learning has resulted in impressive policies that can perform various manipulation tasks and generalize to novel scenarios. However, these policies continue to struggle with following i…

  2. arXiv cs.LG TIER_1 English(EN) · Glen Berseth ·

    Task Robustness via Re-Labelling Vision-Action Robot Data

    The recent trend in scaling models for robot learning has resulted in impressive policies that can perform various manipulation tasks and generalize to novel scenarios. However, these policies continue to struggle with following instructions, likely due to the limited linguistic …