TAVIS benchmark advances robotics imitation learning with active vision

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have introduced TAVIS, a new benchmark designed to evaluate active vision in imitation learning for robotics. The benchmark includes two task suites, TAVIS-Head and TAVIS-Hands, built on humanoid embodiments and utilizing IsaacLab. TAVIS offers a paired headcam-vs-fixedcam protocol, a novel Gaze-Action Lead Time (GALT) metric for anticipatory gaze, and procedural in-distribution/out-of-distribution splits. Initial experiments with Diffusion Policy and $\pi_0$ indicate that active vision generally improves performance but is task-dependent, and multi-task policies struggle with distribution shifts. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Establishes a standardized evaluation framework for active vision in robotics, potentially accelerating progress in imitation learning for complex manipulation tasks.

RANK_REASON The cluster describes a new academic benchmark and evaluation infrastructure for robotics research. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

COVERAGE [1]

arXiv cs.AI TIER_1 · Giacomo Spigler · 2026-05-08 16:11

TAVIS: A Benchmark for Egocentric Active Vision and Anticipatory Gaze in Imitation Learning

Active vision -- where a policy controls its own gaze during manipulation -- has emerged as a key capability for imitation learning, with multiple independent systems demonstrating its benefits in the past year. Yet there is no shared benchmark to compare approaches or quantify w…

COVERAGE [1]

TAVIS: A Benchmark for Egocentric Active Vision and Anticipatory Gaze in Imitation Learning

RELATED ENTITIES

RELATED TOPICS