New metric reveals LLM unlearning methods fail to fully forget sensitive data

By PulseAugur Editorial · [1 sources] · 2026-05-29 04:00

A new research paper introduces \"Leak@k\", a metric designed to evaluate the effectiveness of unlearning methods in large language models (LLMs). The study found that most current unlearning techniques fail to completely remove sensitive information, as it can still be retrieved through probabilistic decoding. To address this, the paper proposes a new algorithm called \"RULE\" (Robust Unlearning under LEak@k metric) which demonstrates improved performance in preventing information leakage on benchmark datasets. AI

IMPACT Current LLM unlearning methods are insufficient for robust data removal, necessitating new techniques like RULE to ensure privacy and compliance.

RANK_REASON Academic paper introducing a new metric and algorithm for evaluating LLM unlearning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New metric reveals LLM unlearning methods fail to fully forget sensitive data

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Hadi Reisizadeh, Jiajun Ruan, Yiwei Chen, Soumyadeep Pal, Sijia Liu, Mingyi Hong · 2026-05-29 04:00

Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

arXiv:2511.04934v3 Announce Type: replace Abstract: Unlearning in large language models (LLMs) is critical for regulatory compliance and for building ethical generative AI systems that avoid producing private, toxic, illegal, or copyrighted content. Despite rapid progress, in thi…

COVERAGE [1]

Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

RELATED ENTITIES

RELATED TOPICS