PulseAugur / Brief
EN
LIVE 21:02:36

Brief

last 24h
[1/1] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

    Researchers have introduced a new benchmark and dataset called MM-OCEAN to evaluate how well multimodal large language models (MLLMs) can reason about personality. The study found that a significant portion of MLLMs, over 51%, provide correct personality assessments without grounding their judgments in observable behavioral evidence. This "Prejudice Gap" highlights a disconnect between accurate predictions and genuine understanding, suggesting a need for more robust evaluation methods for social cognition in AI. AI

    IMPACT Highlights a critical flaw in current MLLM evaluations, potentially impacting their deployment in human-facing roles and guiding future safety research.