Researchers have introduced SEP-Attack, a novel method for generating adversarial text attacks that are transferable to different models. This approach utilizes Determinantal Point Process to create diverse ensemble weights, improving the representation of submodel transferability. SEP-Attack also employs a new metric for evaluating prediction confidence to better estimate word importance and generate adversarial candidates, outperforming existing methods on multiple datasets and real-world APIs. AI
影响 This research introduces a more effective method for generating transferable adversarial text attacks, potentially improving the robustness and security of NLP models.
排序理由 The cluster contains a research paper detailing a new method for adversarial attacks on text. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →