Feifei Li strikes again, ImageNet for spatial intelligence is here
A new benchmark called ESI-Bench has been released by Fei-Fei Li's team to evaluate embodied spatial intelligence in AI. Unlike previous benchmarks that assumed optimal observation, ESI-Bench requires AI agents to actively take actions to gather information, closing the perception-action loop. Initial tests with leading models like GPT-5 and Gemini revealed that current AI struggles with active exploration and decision-making, exhibiting "action blindness" and metacognitive deficits, indicating that the primary challenge lies in strategic action rather than pure perception. AI
IMPACT Sets a new standard for embodied AI evaluation, highlighting action and metacognition as key challenges.