PulseAugur
EN
LIVE 14:27:40

AI Transfer Attacks: "Scissors Effect" Reveals Diversity Hinders Robust Models

Researchers have identified a phenomenon called the "Scissors Effect" in transfer attacks against AI models. This effect demonstrates that while random resizing and padding (Input Diversity or DI) generally improve attack success for standard models, they can significantly hinder attacks against robustly trained models. This counterintuitive finding, observed across various model architectures like CNNs, ViTs, and Swin Transformers on datasets such as ImageNet and CIFAR-10, suggests that DI's effectiveness is highly dependent on the model's training regime. The research attributes this effect to geometric properties of gradients, with resizing identified as a primary contributor to the performance degradation in robust models. A new rule, CG-DI, is proposed to selectively disable diversity measures when gradient consistency indicates potential harm, thereby preserving attack benefits on standard models while mitigating losses on robust ones. AI

IMPACT Reveals a critical trade-off in adversarial attack strategies, impacting model robustness evaluations and defenses.

RANK_REASON Academic paper detailing a novel phenomenon in AI model attacks. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

AI Transfer Attacks: "Scissors Effect" Reveals Diversity Hinders Robust Models

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Xiaojing Chen ·

    The Scissors Effect: When Resize-Based Input Diversity Helps or Hurts Transfer Attacks

    Input Diversity (DI), which applies random resizing and padding at each attack iteration, is a near-default ingredient of transfer-based adversarial attacks, widely assumed to improve transferability. We show this assumption is regime-dependent and, for robustly trained surrogate…