InstantForget: Update-Free Backdoor Unlearning with Inference-Time Feature Reset
Researchers have developed a new method called InstantForget for removing backdoor triggers from AI models without requiring model retraining. This technique operates at inference time by identifying and resetting anomalous features that indicate a backdoor. In tests on CIFAR-10 with ResNet-18 models, InstantForget significantly reduced the average attack success rate across various triggers while maintaining model utility. AI
IMPACT Offers a novel approach to AI model security by enabling backdoor removal without costly retraining.