New attacks reveal privacy risks in retrieval-based in-context learning

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed two novel black-box membership inference attacks targeting retrieval-based in-context learning systems for document question answering. These attacks leverage query text prefixes to differentiate between member and non-member inputs, with one method using a reference model and the other employing a weighted-averaging scheme to eliminate the need for a reference model. Empirical evaluations demonstrate that these attacks are resilient to paraphrasing and outperform existing methods, while an adapted ensemble prompting defense effectively mitigates the privacy leakage. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Highlights potential privacy vulnerabilities in retrieval-augmented language models, necessitating stronger defenses for secure deployment.

RANK_REASON This is a research paper detailing novel membership inference attacks against a specific AI technique. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
safety

COVERAGE [1]

arXiv cs.LG TIER_1 · Tejas Kulkarni, Antti Koskela, Laith Zumot · 2026-05-07 04:00

Membership Inference Attacks for Retrieval Based In-Context Learning for Document Question Answering

arXiv:2605.04116v1 Announce Type: cross Abstract: We show that remotely hosted applications employing in-context learning when augmented with a retrieval function to select in-context examples can be vulnerable to membership-inference attacks even when the service provider and us…

COVERAGE [1]

Membership Inference Attacks for Retrieval Based In-Context Learning for Document Question Answering

RELATED ENTITIES

RELATED TOPICS