Brief · PulseAugur

TOOL · arXiv cs.AI English(EN) · 8h

InstantForget: Update-Free Backdoor Unlearning with Inference-Time Feature Reset

Researchers have developed a new method called InstantForget for removing backdoor triggers from AI models without requiring model retraining. This technique operates at inference time by identifying and resetting anomalous features that indicate a backdoor. In tests on CIFAR-10 with ResNet-18 models, InstantForget significantly reduced the average attack success rate across various triggers while maintaining model utility. AI

IMPACT Offers a novel approach to AI model security by enabling backdoor removal without costly retraining.

Hugging Face
CIFAR-10
ResNet-18
Mahalanobis distance
InstantForget
BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain
ModelNet10