English(EN) SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

新的SocialPersona基准测试了MLLMs从社交媒体推断用户偏好的能力

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-25 06:31

研究人员推出了SocialPersona，这是一个新的基准测试，旨在评估多模态大语言模型（MLLMs）从社交媒体数据中推断用户偏好能力。该基准测试使用了来自171位社交媒体用户的纵向时间线，包含文本、图像和时间戳，以及经过人类验证的偏好标签。SocialPersona支持构建用户画像和生成个性化响应等任务，实验表明，虽然MLLMs可以识别广泛的兴趣，但它们在细粒度和近期偏好方面存在困难，这凸显了跨模态用户建模的一个关键挑战。 AI

影响该基准测试旨在推动能够推断用户偏好并据此采取行动的AI助手的开发，从而可能带来更个性化、更有效的AI交互。

排序理由该集群描述了一个用于评估多模态大语言模型的新学术基准测试。

在 arXiv cs.IR (Information Retrieval) 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Qinkai Zhang, Yanyan Zhao, Xin Lu, Yulin Hu, Pengtao Han, Bing Qin · 2026-06-26 04:00

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

arXiv:2606.26654v1 Announce Type: new Abstract: Personalized language-model assistants are often evaluated through a memory lens: can a model recall preferences users have explicitly stated in dialogue? More comprehensive personalization demands a harder capability -- inferring w…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Bing Qin · 2026-06-25 06:31

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

Personalized language-model assistants are often evaluated through a memory lens: can a model recall preferences users have explicitly stated in dialogue? More comprehensive personalization demands a harder capability -- inferring what users care about from the multimodal traces …

报道来源 [2]

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

SocialPersona: Benchmarking Personalized Profiling and Response with Multimodal Social-Media Context

相关实体

相关话题