English(EN) CT-VAM: A Cerebello-Thalamic-Inspired Vision-Action Model for Efficient Visuomotor Control

新AI模型增强机器人视觉运动控制和记忆能力

作者 PulseAugur 编辑部 · [4 个来源] · 2026-06-05 04:00

研究人员开发了新的机器人视觉运动控制模型，专注于高效和预测性协调。CT-VAM是一个受小脑-丘脑启发的模型，采用紧凑的架构进行快速、任务条件化的动作预测，支持云边协同范式。Chameleon通过整合控制索引的前瞻性记忆来解决观察-动作延迟问题，显著提高了在具有挑战性基准上的性能。此外，一个基于扩散的框架通过整合多模态信号来预测人类运动，从而学习预测性的视觉运动协调。 AI

影响视觉运动控制的进步可能加速机器人自主性和人机交互。

排序理由多篇研究论文介绍了用于视觉运动控制的新模型和基准。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。我们如何撰写摘要 →

报道来源 [4]

arXiv cs.AI TIER_1 English(EN) · Jiacheng Li, Yize Guo, Jiabin Guo, Qingchen Liu, Jiahu Qin · 2026-06-09 04:00

CT-VAM：受小脑-丘脑启发的视觉-动作模型，用于高效的视觉运动控制

arXiv:2606.09572v1 Announce Type: cross Abstract: Vision-language-action models have shown strong promise for robot manipulation, yet raw language is primarily needed to specify task intent rather than to be repeatedly processed during high-frequency low-level execution. Motivate…
arXiv cs.AI TIER_1 English(EN) · Jiahu Qin · 2026-06-08 14:46

CT-VAM：受小脑-丘脑启发的视觉-动作模型，用于高效的视觉运动控制

Vision-language-action models have shown strong promise for robot manipulation, yet raw language is primarily needed to specify task intent rather than to be repeatedly processed during high-frequency low-level execution. Motivated by this separation, we propose a cerebello-thala…
arXiv cs.AI TIER_1 English(EN) · Xinying Guo, Chenxi Jiang, Hyun Bin Kim, Yuhang Han, Ying Sun, Yang Xiao, Jianfei Yang · 2026-06-08 04:00

Chameleon：用于视觉运动操控的控制索引前瞻性记忆

arXiv:2603.24576v2 Announce Type: replace-cross Abstract: Robots often observe information that determines a future action long before that action is executed. In a shell game, for example, a robot first sees which cup hides the ball, watches the cups move, and only later needs t…
arXiv cs.CV TIER_1 English(EN) · Wenqi Jia, Bolin Lai, Miao Liu, Danfei Xu, James M. Rehg · 2026-06-05 04:00

学习预测性视动协调

arXiv:2503.23300v2 Announce Type: replace Abstract: Understanding and predicting human visuomotor coordination is crucial for applications in robotics, human-computer interaction, and assistive technologies. This work introduces a forecasting-based task for visuomotor modeling, w…

报道来源 [4]

CT-VAM：受小脑-丘脑启发的视觉-动作模型，用于高效的视觉运动控制

CT-VAM：受小脑-丘脑启发的视觉-动作模型，用于高效的视觉运动控制

Chameleon：用于视觉运动操控的控制索引前瞻性记忆

学习预测性视动协调

相关实体

相关话题