PulseAugur / Brief
EN
LIVE 12:36:54

Brief

last 24h
[1/1] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics

    Researchers have introduced HanDyVQA, a new video question-answering benchmark designed to evaluate fine-grained understanding of hand-object interaction dynamics. The benchmark includes over 11,000 QA pairs across six question types, focusing on manipulation styles, motion, and part-level state changes. Even advanced models like Gemini 2.5 Pro struggled, achieving only 73% average accuracy compared to human performance of 97%, highlighting ongoing challenges in spatial relationship and geometric understanding. AI

    IMPACT Highlights limitations in current video foundation models for understanding complex human-object interactions, guiding future research.