Brief

last 24h

[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · 量子位 (QbitAI) 中文(ZH) · 4d

Feifei Li strikes again, ImageNet for spatial intelligence is here

A new benchmark called ESI-Bench has been released by Fei-Fei Li's team to evaluate embodied spatial intelligence in AI. Unlike previous benchmarks that assumed optimal observation, ESI-Bench requires AI agents to actively take actions to gather information, closing the perception-action loop. Initial tests with leading models like GPT-5 and Gemini revealed that current AI struggles with active exploration and decision-making, exhibiting "action blindness" and metacognitive deficits, indicating that the primary challenge lies in strategic action rather than pure perception. AI

IMPACT Sets a new standard for embodied AI evaluation, highlighting action and metacognition as key challenges.
- UCLA
- Tsinghua University
- Fei-Fei Li
- OmniGibson
- ESI-Bench
- Jiajun Wu
- Yejin Choi
- Yining Hong
- BEHAVIOR-1K
- Gemini
- Stanford University
- GPT-5
RESEARCH · arXiv cs.AI English(EN) · 1w · [2 sources]

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

Researchers have introduced ESI-Bench, a new benchmark designed to evaluate embodied spatial intelligence in AI agents. This benchmark focuses on the perception-action loop, where agents actively explore their environment to gather information rather than passively processing visual data. Experiments with state-of-the-art multimodal large language models (MLLMs) show that active exploration significantly improves performance compared to passive observation, though failures often stem from poor action choices rather than weak perception. The study also highlights a metacognitive gap in models, as they tend to commit to conclusions prematurely, unlike humans who revise beliefs based on contradictory evidence. AI

IMPACT This benchmark could drive progress in developing AI agents capable of more sophisticated real-world interaction and problem-solving.

Brief

Feifei Li strikes again, ImageNet for spatial intelligence is here

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop