tool · [1 source] · 2026-05-22 04:00

New MLLM benchmark reveals 'Prejudice Gap' in personality assessment

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have introduced a new task called Grounded Personality Reasoning (GPR) to evaluate how well Multimodal Large Language Models (MLLMs) truly understand personality beyond superficial pattern matching. They developed a new dataset, MM-OCEAN, containing videos and evidence-grounded trait analyses, to support this task. Benchmarking 27 MLLMs revealed a significant 'Prejudice Gap,' where over half of correct personality ratings were not supported by observable evidence, indicating a disconnect between accurate scoring and genuine reasoning. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Highlights a critical limitation in current MLLMs, suggesting a need for models that can ground social cognition in observable evidence.

RANK_REASON The cluster describes a new academic paper introducing a novel task, dataset, and benchmark for evaluating MLLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

paper
safety

COVERAGE [1]

arXiv cs.CV TIER_1 · Caixin Kang, Tianyu Yan, Sitong Gong, Mingfang Zhang, Liangyang Ouyang, Ruicong Liu, Bo Zheng, Huchuan Lu, Kaipeng Zhang, Yoichi Sato, Yifei Huang · 2026-05-22 04:00

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

arXiv:2605.22109v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) are increasingly deployed in human-facing roles where personality perception is critical, yet existing benchmarks evaluate this capability solely on numerical Big Five score prediction, lea…

COVERAGE [1]

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

RELATED ENTITIES

RELATED TOPICS