New benchmark reveals LLMs pose greater privacy risks in group chats

By PulseAugur Editorial · [1 sources] · 2026-06-22 12:06

Researchers have introduced MuPPET, a new benchmark designed to evaluate the contextual privacy risks of large language model (LLM) assistants in multi-party conversations. Existing privacy benchmarks are limited to single-interlocutor settings, failing to capture the amplified risks present when an LLM handles sensitive data in group chats. Experiments using MuPPET demonstrate that LLMs, including frontier models and smaller open-weights models, leak significantly more private information in multi-party scenarios than previously understood. Current privacy defenses provide only partial protection and can degrade the utility of the LLM. AI

IMPACT Highlights significant privacy vulnerabilities in LLMs when used in group settings, potentially impacting enterprise adoption and data handling policies.

RANK_REASON The cluster describes a new academic paper introducing a novel benchmark for evaluating LLM privacy. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New benchmark reveals LLMs pose greater privacy risks in group chats

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Martin Gubri · 2026-06-22 12:06

MuPPET: A Benchmark for Contextual Privacy of LLM Assistants in Multi-Party Conversations

LLM agents are increasingly deployed in multi-party environments, handling sensitive personal data on behalf of individual users, for instance in group chats. When such an agent discloses private information, it reaches every group member at once. This risk is structurally harder…

COVERAGE [1]

MuPPET: A Benchmark for Contextual Privacy of LLM Assistants in Multi-Party Conversations

RELATED ENTITIES

RELATED TOPICS