Brief · PulseAugur

TOOL · 量子位 (QbitAI) 中文(ZH) · 4d

Feifei Li strikes again, ImageNet for spatial intelligence is here

A new benchmark called ESI-Bench has been released by Fei-Fei Li's team to evaluate embodied spatial intelligence in AI. Unlike previous benchmarks that assumed optimal observation, ESI-Bench requires AI agents to actively take actions to gather information, closing the perception-action loop. Initial tests with leading models like GPT-5 and Gemini revealed that current AI struggles with active exploration and decision-making, exhibiting "action blindness" and metacognitive deficits, indicating that the primary challenge lies in strategic action rather than pure perception. AI

IMPACT Sets a new standard for embodied AI evaluation, highlighting action and metacognition as key challenges.

Gemini
Stanford University
GPT-5
UCLA
Tsinghua University
Fei-Fei Li
OmniGibson
ESI-Bench
Jiajun Wu
Yejin Choi
Yining Hong
BEHAVIOR-1K