PulseAugur
EN
LIVE 11:46:56

New dataset probes VLM concept learning against human generalization

Researchers have introduced the Novel Visual References Dataset (NVRD), comprising over 19,000 images across 90 visual concepts, designed to test how vision-language models (VLMs) learn new concepts, especially when they conflict with pre-existing knowledge. Evaluations of both open- and closed-source models alongside human judgments revealed that VLMs struggle to adapt to novel concepts in-context and tend to overgeneralize learned labels to incorrect stimuli, unlike humans. The NVRD aims to serve as a benchmark for studying visual concept acquisition in both humans and machines. AI

IMPACT Establishes a new benchmark for evaluating VLM concept learning and generalization, highlighting current limitations compared to human capabilities.

RANK_REASON The cluster contains an academic paper detailing a new dataset and benchmark for evaluating vision-language models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Ada Defne T\"ur, Gaurav Kamath, Joyce Chai, Siva Reddy, Benno Krojer ·

    Would you still call this Dax? Novel Visual References in VLMs and Humans

    arXiv:2606.05409v1 Announce Type: cross Abstract: Vision-language models (VLMs), like human learners, are frequently exposed to new visual concepts, but how they map novel visual references to language after exposure remains largely underexplored, particularly when those referenc…