English(EN) Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

机器人VLA通过新自适应方法从过去的成功中学习

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 07:11

研究人员开发了一个名为Retrieve-then-Steer的新框架，以提高机器人操作任务中视觉-语言-动作（VLA）模型的可靠性。该方法允许一个部分有能力但冻结的VLA模型通过学习其在给定环境中自身成功的过往执行来适应和提高其性能。该系统存储成功的观察-动作片段，检索相关的片段，过滤以确保一致性，并利用这种聚合的经验来指导动作生成过程，从而提高任务成功率和稳定性，尤其是在复杂、长周期的任务中。 AI

影响通过使模型能够在不重新训练的情况下从成功的过往动作中学习，增强了机器人操作的可靠性。

排序理由发布了一篇详细介绍新AI模型自适应方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Yihong Gong · 2026-05-11 07:11

Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

Vision-Language-Action (VLA) models show strong potential for general-purpose robotic manipulation, yet their closed-loop reliability often degrades under local deployment conditions. Existing evaluations typically treat test episodes as independent zero-shot trials. However, rea…

报道来源 [1]

Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

相关实体

相关话题