PulseAugur
实时 11:53:39
English(EN) Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models

具身智能采用鸟瞰图以实现可扩展机器人数据

一种名为 Dexterity-BEV 的新方法正在被引入,通过改编自动驾驶中的鸟瞰图 (BEV) 方法论来解决具身智能中的数据挑战。该方法旨在将包括视觉输入、传感器读数和动作指令在内的异构机器人数据统一到通用的空间参考框架中。这种统一的表示旨在实现更具可扩展性和可转移性的机器人训练,超越简单的数据聚合,为具身人工智能建立基础数据基础设施。 AI

影响 Dexterity-BEVEmbodied-R1.5 等新框架旨在标准化机器人数据并提高泛化能力,有可能加速开发更强大、更适应性强的具身人工智能系统。

排序理由 多篇研究论文介绍了具身智能的新模型和框架。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

报道来源 [4]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 量子位的朋友们 ·

    BEV Enters Embodied Intelligence: Cross-Dimensional Bringing Robot Data onto the Scaling Fast Lane

  2. arXiv cs.AI TIER_1 English(EN) · Xin Zhou, Cong Miao ·

    EWAM: An Enhanced World Action Model for Closed-Loop Online Adaptation in Embodied Intelligence

    arXiv:2606.12690v1 Announce Type: cross Abstract: In this paper, we propose the Enhanced World Action Model (EWAM), a closed-loop online adaptation architecture built upon a pretrained and fully frozen Cosmos3 backbone network. Evaluated entirely under a zero-shot task protocol, …

  3. arXiv cs.AI TIER_1 English(EN) · Yifu Yuan, Yaoting Huang, Xianze Yao, Yutong Li, Shuoheng Zhang, Linqi Han, Pengyi Li, Jiangeng Sun, Wenting Jia, Zhao Zhang, Yuhao Liu, Ruihao Liao, Yucheng Hu, Qiyu Wu, Yuxiao Li, Zibin Dong, Fei Ni, Yan Zheng, Shuyang Gu, Yi Ma, Hongyao Tang, Han Hu, … ·

    Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models

    arXiv:2606.11324v1 Announce Type: cross Abstract: We introduce Embodied-R1.5, a unified Embodied Foundation Model (EFM) that integrates comprehensive embodied reasoning capabilities, spanning embodied cognition, task planning, correction, and pointing, within a single architectur…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models

    Embodied-R1.5 is a unified embodied foundation model that integrates embodied reasoning capabilities and achieves state-of-the-art performance on embodied vision-language benchmarks through a multi-task balanced reinforcement learning approach.