New algorithm reconstructs PII from finetuned language models

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-12 15:29

Researchers have developed a new decoding algorithm called COVA to reconstruct personally identifiable information (PII) from supervised finetuned language models. The study focused on sensitive domains like medical and legal settings, demonstrating that an adversary with even partial knowledge of the fine-tuning dataset can infer sensitive user data. The effectiveness of PII reconstruction varied by PII type, highlighting significant privacy risks associated with current fine-tuning practices. AI

影响 Reveals significant privacy risks in LLM fine-tuning, potentially impacting data handling and model deployment strategies.

排序理由 Academic paper detailing a new method for reconstructing PII from finetuned language models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Alina Oprea · 2026-05-12 15:29

Reconstruction of Personally Identifiable Information from Supervised Finetuned Models

Supervised Finetuning (SFT) has become one of the primary methods for adapting a large language model (LLM) with extensive pre-trained knowledge to domain-specific, instruction-following tasks. SFT datasets, composed of instruction-response pairs, often include user-provided info…

报道来源 [1]

Reconstruction of Personally Identifiable Information from Supervised Finetuned Models

相关实体

相关话题