English(EN) Temporal Self-Imitation Learning

机器人研究探索新的自监督和SSM用于模仿学习 · 跟踪2个来源

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-19 04:00

两篇新研究论文探讨了改进机器人强化学习的高级技术。第一篇，时间自模仿学习（TSIL），提出一种方法，利用成功机器人轨迹的时间效率作为自监督信号，提高各种操作任务的学习效率和鲁棒性。第二篇论文RoboSSM提出使用状态空间模型（SSMs）代替Transformer进行上下文模仿学习，展示了机器人处理具有有限演示的长时任务的改进的可扩展性和泛化能力。 AI

影响强化学习和模仿学习的这些进步可能导致机器人在复杂、长时任务中更具能力和适应性。

排序理由两篇在arXiv上发表的学术论文，详细介绍了机器人强化学习和模仿学习的新颖方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Yinsen Jia, Boyuan Chen · 2026-06-19 04:00

Temporal Self-Imitation Learning

arXiv:2606.19752v1 Announce Type: cross Abstract: Long-horizon robot manipulation policies trained with reward shaping can still exploit dense rewards through inefficient interaction, while rare efficient behaviors may be forgotten during training. We argue that temporal efficien…
arXiv cs.AI TIER_1 English(EN) · Youngju Yoo, Jiaheng Hu, Yifeng Zhu, Bo Liu, Qiang Liu, Roberto Mart\'in-Mart\'in, Peter Stone · 2026-06-19 04:00

RoboSSM: Scalable In-context Imitation Learning via State-Space Models

arXiv:2509.19658v2 Announce Type: replace-cross Abstract: In-context imitation learning (ICIL) enables robots to learn tasks from prompts consisting of just a handful of demonstrations. By eliminating the need for parameter updates at deployment time, this paradigm supports few-s…

报道来源 [2]

Temporal Self-Imitation Learning

RoboSSM: Scalable In-context Imitation Learning via State-Space Models

相关实体

相关话题