PulseAugur
EN
LIVE 04:18:08
中文(ZH) 硬氪专访 | 智源研究院院长王仲远:VLA不会死,但世界模型是未来

BAAI Director: World Models are the Future for Embodied AI

The Director of the Beijing Academy of Artificial Intelligence (BAAI), Wang Zhongyuan, discussed the concept of "World Models" in AI, distinguishing them from current large language models (LLMs) and video generation models. He outlined four existing approaches to World Models: language-centric, pixel-centric, 3D structure-centric, and visual representation-centric. BAAI is exploring a fifth approach, integrating language and vision within a unified latent space representation. Wang emphasized that true World Models must understand physical laws, causality, and temporal consistency, moving beyond mere visual realism or token prediction to predict physical states. He believes World Models are crucial for advancing embodied AI, likening them to the "brain" for robotic "bodies," and anticipates their development will take several years. AI

IMPACT World Models are poised to become the next foundational AI, enabling robots to understand and interact with the physical world, moving beyond current LLM and video generation capabilities.

RANK_REASON Interview with a prominent AI researcher discussing future AI development directions and concepts.

Read on 36氪 (36Kr) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

BAAI Director: World Models are the Future for Embodied AI

COVERAGE [1]

  1. 36氪 (36Kr) TIER_1 中文(ZH) ·

    Hard Science Exclusive Interview | Yuan Institute President Wang Zhongyuan: VLA will not die, but world models are the future

    <p>作者&nbsp;|&nbsp;邱晓芬</p> <p>编辑&nbsp;|&nbsp;袁斯来</p> <p>过去几个月,“世界模型”(World Model)从学术黑话迅速膨胀成AI和机器人行业里的关键词。</p> <p>行业的目光转向背后是切实的焦虑。</p> <p>一方面,经过了过去两年的野蛮生长,具身智能暴露了当前AI在物理世界中的短板——机器人能识别物体,却不懂“推杯子会掉”;能听懂指令,却无法预判“拧瓶盖需要多大的力”。世界模型正是试图补上这个短板,让机器人学会物理世界的规律、因果。</p> <p>也就是说,世界模型与具身智能的关系,本质上是…