Researchers have developed Attribution-Guided Masking (AGM), a novel training technique designed to improve the generalization capabilities of pre-trained Transformer models in sentiment classification tasks. AGM addresses the performance degradation observed when models transfer to out-of-domain data by identifying and penalizing domain-specific spurious tokens during fine-tuning. This method, which does not require target-domain labels, demonstrated competitive performance in zero-shot transfer settings and offers interpretability by highlighting features that drive generalization gaps. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT This method could improve the robustness of NLP models when applied to new domains, reducing the need for extensive re-training.
RANK_REASON This is a research paper detailing a new method for improving model generalization.