English(EN) Representation-Aware Unlearning via Activation Signatures: From Suppression to Entity-Signature Erasure

新研究探索高效且鲁棒的机器学习擦除技术

作者 PulseAugur 编辑部 · [13 个来源] · 2026-05-27 04:00

研究人员正在开发新的机器学习擦除方法，旨在无需完全重新训练即可从训练模型中去除特定数据的影响。多篇论文提出了实现更高效、更鲁棒擦除的新技术。这些方法侧重于在确保遗忘的知识不易被恢复的同时，保持模型的效用，即使在持续训练或对抗性攻击下也是如此。 AI

影响机器学习擦除技术的发展对于确保人工智能的安全、合规和负责任的部署至关重要，尤其是在模型日益集成到敏感应用中的情况下。

排序理由多篇学术论文提出新的机器学习擦除方法。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 13 个来源。我们如何撰写摘要 →

报道来源 [13]

arXiv cs.LG TIER_1 English(EN) · Zizhao Hu, Ameya Godbole, Johnny Tian-Zheng Wei, Mohammad Rostami, Jesse Thomason, Robin Jia · 2026-06-05 04:00

SHRED：通过带Logit降级的自蒸馏实现保留集自由的遗忘

arXiv:2605.07482v2 Announce Type: replace Abstract: Machine unlearning for large language models (LLMs) aims to selectively remove memorized content such as private data, copyrighted text, or hazardous knowledge, without costly full retraining. Most existing methods require a ret…
arXiv cs.LG TIER_1 English(EN) · Hanwei Tan, Wentai Wu, Ligang He, Yijun Quan · 2026-06-03 04:00

Lethe：用于联邦知识遗忘的适配器增强双流更新

arXiv:2601.22601v2 Announce Type: replace Abstract: Federated unlearning (FU) aims to erase designated client-level, class-level, or sample-level knowledge from a global model. Existing studies commonly assume that the collaboration ends with the unlearning operation, overlooking…
arXiv cs.AI TIER_1 English(EN) · Archie Chaudhury · 2026-06-03 04:00

遗忘并非抹除：通过传输密钥恢复潜在知识

arXiv:2606.02860v1 Announce Type: cross Abstract: Catastrophic forgetting is often framed as a representational problem: after sequential training, a model appears to lose the features that supported performance on earlier tasks. We challenge the stronger form of this view. Acros…
arXiv cs.CL TIER_1 English(EN) · Clara Haya Suslik, Or Shafran, Mor Geva · 2026-06-03 04:00

别忘了你的嵌入：通过精确编辑嵌入实现鲁棒知识擦除

arXiv:2606.03695v1 Announce Type: new Abstract: As language models are increasingly deployed in real-world applications, the ability to erase specific knowledge from them becomes critical for safety and compliance. Prominent methods seek persistent removal by updating the model's…
arXiv cs.LG TIER_1 English(EN) · Federico Di Gennaro, Alexander Shevchenko, Fanny Yang · 2026-06-03 04:00

通过边际自校正实现快速大规模遗忘

arXiv:2606.02920v1 Announce Type: new Abstract: Language-model unlearning updates a trained model to behave as if it had not seen selected training examples, while preserving utility and avoiding costly retraining. Existing approaches typically fine-tune the pretrained model with…
arXiv cs.AI TIER_1 English(EN) · Vedant Jawandhia, Daksh Ahuja, Ghufran Alam Siddiqui, Prashant Trivedi, Yash Sinha, Pratik Narang · 2026-06-03 04:00

PURGE：通过保留引导擦除实现的预测性遗忘

arXiv:2606.03808v1 Announce Type: cross Abstract: We propose PURGE, a machine unlearning algorithm built on a simple but an under-exploited observation: continual learning (CL) and machine unlearning (MU) which are fundamentally dual problems. CL tries to learn new tasks without …
arXiv cs.AI TIER_1 English(EN) · Pratik Narang · 2026-06-02 15:53

PURGE：通过保留引导擦除实现的预测性遗忘

We propose PURGE, a machine unlearning algorithm built on a simple but an under-exploited observation: continual learning (CL) and machine unlearning (MU) which are fundamentally dual problems. CL tries to learn new tasks without forgetting old ones; MU tries to erase specific da…
arXiv cs.CL TIER_1 English(EN) · Mor Geva · 2026-06-02 14:15

别忘了你的嵌入：通过精确编辑嵌入实现鲁棒知识擦除

As language models are increasingly deployed in real-world applications, the ability to erase specific knowledge from them becomes critical for safety and compliance. Prominent methods seek persistent removal by updating the model's parameters, yet the target knowledge often can …
arXiv cs.AI TIER_1 English(EN) · Zhiyong Ma, Zhitao Deng, Huan Tang, Jialin Chen, Zhijun Zheng, Zhengping Li, Qingyuan Chuai · 2026-06-02 04:00

PECKER：一种用于扩散模型中机器遗忘的精确高效关键知识擦除方法

arXiv:2604.05634v2 Announce Type: replace Abstract: Machine unlearning (MU) has become a critical technique for GenAI models' safe and compliant operation. While existing MU methods are effective, most impose prohibitive training time and computational overhead. Our analysis sugg…
arXiv cs.LG TIER_1 English(EN) · Polina Dolgova, Sebastian U. Stich · 2026-06-01 04:00

遗忘有邻居：机器学习遗忘中的局部性附带遗忘

arXiv:2605.31317v1 Announce Type: new Abstract: Machine unlearning aims to remove the influence of selected training examples without full retraining. Standard evaluations often summarize unlearning quality with aggregate metrics, such as accuracy- and forgetting-based scores, wh…
arXiv cs.LG TIER_1 English(EN) · Sebastian U. Stich · 2026-05-29 13:48

遗忘有邻居：机器学习遗忘中的局部性附带遗忘

Machine unlearning aims to remove the influence of selected training examples without full retraining. Standard evaluations often summarize unlearning quality with aggregate metrics, such as accuracy- and forgetting-based scores, which can hide localized failures. We study this f…
arXiv cs.LG TIER_1 English(EN) · Antonio Almud\'evar, Alfonso Ortega · 2026-05-29 04:00

表示遗忘：通过信息压缩实现遗忘

arXiv:2601.21564v2 Announce Type: replace Abstract: Machine unlearning seeks to remove the influence of specific training data from a model, a need driven by privacy regulations and robustness concerns. Existing approaches typically modify model parameters, but such updates can b…
arXiv cs.CL TIER_1 English(EN) · Syed Naveed Mahmood, Md. Rezaur Rahman Bhuiyan, Tasfia Zaman, Jareen Tasneem Khondaker, Md. Sameer Sakib, K. M. Shadman Wadith, Nazia Tasnim, Farig Sadeque · 2026-05-27 04:00

通过激活签名实现感知性知识遗忘：从抑制到实体-签名擦除

arXiv:2601.10566v5 Announce Type: replace Abstract: Entity-level unlearning is usually evaluated by what a model says: whether it stops naming the target, refuses a query, or shifts a Truth Ratio distribution. These output-level tests, however, do not show whether a subject's int…

报道来源 [13]

相关实体

相关话题