新的NavWAM模型集成了视觉预测用于机器人导航

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-12 04:00

研究人员开发了NavWAM，这是一种新颖的扩散-Transformer策略，旨在增强机器人面向目标的视觉导航。与需要单独规划模块的先前方法不同，NavWAM将未来预测与动作和价值目标直接集成到共享的潜在序列中。这使得视觉预测能够更直接地用于机器人控制，从而提高在模拟和现实世界导航任务中的性能。 AI

排序理由这是一篇描述用于机器人导航的新模型的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

arXiv
NavWAM

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Daichi Azuma, Taiki Miyanishi, Koya Sakamoto, Shuhei Kurita, Yaonan Zhu, Petr Khrapchenkov, Motoaki Kawanabe, Yusuke Iwasawa, Yutaka Matsuo · 2026-06-12 04:00

NavWAM: A Navigation World Action Model for Goal-Conditioned Visual Navigation

arXiv:2606.13494v1 Announce Type: cross Abstract: Goal-conditioned visual navigation requires a robot to act under partial observability by anticipating how its motion will change the future egocentric view and whether that change brings it closer to the goal. Navigation world mo…

报道来源 [1]

NavWAM: A Navigation World Action Model for Goal-Conditioned Visual Navigation

相关实体

相关话题