SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack
Researchers have introduced SEP-Attack, a novel method for generating adversarial text attacks that are transferable to different models. This approach utilizes Determinantal Point Process to create diverse ensemble weights, improving the representation of submodel transferability. SEP-Attack also employs a new metric for evaluating prediction confidence to better estimate word importance and generate adversarial candidates, outperforming existing methods on multiple datasets and real-world APIs. AI
IMPACT This research introduces a more effective method for generating transferable adversarial text attacks, potentially improving the robustness and security of NLP models.