PulseAugur / Brief
EN
LIVE 20:34:25

Brief

last 24h
[1/1] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Enhancing Gaze Reasoning in Vision Foundation Models for Gaze Following

    Researchers have developed new methods to evaluate and improve how vision-language models (VLMs) understand human gaze. One study introduces EyeVLM, a framework to benchmark VLMs on gaze following and social gaze prediction, finding current models lack precise understanding. A separate paper proposes a novel training mechanism using local LoRA and an out-of-cone penalty to enhance gaze reasoning in vision foundation models for gaze following tasks, achieving state-of-the-art results. AI

    IMPACT New benchmarks and training techniques could lead to more sophisticated AI systems capable of understanding human attention and social cues.