PulseAugur
实时 15:40:14

New SEP-Attack method enhances transferable adversarial text attacks

Researchers have introduced SEP-Attack, a novel method for generating adversarial text attacks that are transferable to different models. This approach utilizes Determinantal Point Process to create diverse ensemble weights, improving the representation of submodel transferability. SEP-Attack also employs a new metric for evaluating prediction confidence to better estimate word importance and generate adversarial candidates, outperforming existing methods on multiple datasets and real-world APIs. AI

影响 This research introduces a more effective method for generating transferable adversarial text attacks, potentially improving the robustness and security of NLP models.

排序理由 The cluster contains a research paper detailing a new method for adversarial attacks on text. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. arXiv cs.AI TIER_1 English(EN) · Han Liu, Zhi Xu, Xiaotong Zhang, Feng Zhang, Xiaoming Xu, Wei Wang, Fenglong Ma, Hong Yu ·

    SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack

    arXiv:2605.24958v1 Announce Type: cross Abstract: Despite the strong performance of deep neural networks in modern Web and language applications, they remain vulnerable to adversarial attacks, especially transferable attacks that generate adversarial examples using surrogate mode…