English(EN)Representation-Aware Unlearning via Activation Signatures: From Suppression to Entity-Signature Erasure
新研究探索先进的机器学习遗忘技术
作者PulseAugur 编辑部·[5 个来源]·
研究人员正在开发新的机器学习遗忘方法,旨在无需完全重新训练即可从训练好的模型中移除特定数据的影响。一种方法是“局部附带遗忘”,该方法发现遗忘失败可能集中在要移除的数据附近,并提出“局部教师蒸馏”来缓解这种情况。另一个框架“表示遗忘”侧重于转换表示以压缩信息,从而实现更好的效用保留和效率。第三种方法“感知表示遗忘”使用激活签名来抑制内部表示,在对抗性实体恢复方面显示出显著的减少。
AI
arXiv:2604.05634v2 Announce Type: replace Abstract: Machine unlearning (MU) has become a critical technique for GenAI models' safe and compliant operation. While existing MU methods are effective, most impose prohibitive training time and computational overhead. Our analysis sugg…
arXiv cs.LG
TIER_1English(EN)·Polina Dolgova, Sebastian U. Stich·
arXiv:2605.31317v1 Announce Type: new Abstract: Machine unlearning aims to remove the influence of selected training examples without full retraining. Standard evaluations often summarize unlearning quality with aggregate metrics, such as accuracy- and forgetting-based scores, wh…
Machine unlearning aims to remove the influence of selected training examples without full retraining. Standard evaluations often summarize unlearning quality with aggregate metrics, such as accuracy- and forgetting-based scores, which can hide localized failures. We study this f…
arXiv cs.LG
TIER_1English(EN)·Antonio Almud\'evar, Alfonso Ortega·
arXiv:2601.21564v2 Announce Type: replace Abstract: Machine unlearning seeks to remove the influence of specific training data from a model, a need driven by privacy regulations and robustness concerns. Existing approaches typically modify model parameters, but such updates can b…
arXiv:2601.10566v5 Announce Type: replace Abstract: Entity-level unlearning is usually evaluated by what a model says: whether it stops naming the target, refuses a query, or shifts a Truth Ratio distribution. These output-level tests, however, do not show whether a subject's int…