New SocialPersona benchmark tests MLLMs' ability to infer user preferences from social media

By PulseAugur Editorial · [2 sources] · 2026-06-25 06:31

Researchers have introduced SocialPersona, a new benchmark designed to evaluate the ability of multimodal large language models (MLLMs) to infer user preferences from social media data. The benchmark utilizes longitudinal timelines from 171 social media users, incorporating text, images, and timestamps, along with human-verified preference tags. SocialPersona supports tasks such as constructing user profiles and generating personalized responses, with experiments indicating that while MLLMs can identify broad interests, they struggle with fine-grained and recent preferences, highlighting a key challenge in cross-modal user modeling. AI

IMPACT This benchmark aims to advance the development of AI assistants that can infer and act on user preferences, potentially leading to more personalized and effective AI interactions.

RANK_REASON The cluster describes a new academic benchmark for evaluating multimodal large language models.

Read on arXiv cs.IR (Information Retrieval) →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New SocialPersona benchmark tests MLLMs' ability to infer user preferences from social media

COVERAGE [2]

arXiv cs.CL TIER_1 English(EN) · Qinkai Zhang, Yanyan Zhao, Xin Lu, Yulin Hu, Pengtao Han, Bing Qin · 2026-06-26 04:00

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

arXiv:2606.26654v1 Announce Type: new Abstract: Personalized language-model assistants are often evaluated through a memory lens: can a model recall preferences users have explicitly stated in dialogue? More comprehensive personalization demands a harder capability -- inferring w…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Bing Qin · 2026-06-25 06:31

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

Personalized language-model assistants are often evaluated through a memory lens: can a model recall preferences users have explicitly stated in dialogue? More comprehensive personalization demands a harder capability -- inferring what users care about from the multimodal traces …

COVERAGE [2]

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

RELATED ENTITIES

RELATED TOPICS