PulseAugur
EN
LIVE 23:04:57

New attack method enhances adversarial transferability in MLLMs

Researchers have developed FRA-Attack, a novel method to improve the transferability of adversarial attacks against multimodal large language models (MLLMs). This technique utilizes frequency-domain regularization to align perturbations with shared visual cues across different models, overcoming limitations of existing spatial-domain approaches. Experiments on 15 MLLMs demonstrate FRA-Attack's superior performance, particularly against models like GPT-5.4, Claude-Opus-4.6, and Gemini-3-flash. AI

IMPACT Enhances understanding of MLLM vulnerabilities and informs security research.

RANK_REASON The cluster contains an academic paper detailing a new research method.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

COVERAGE [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

    Multimodal large language models (MLLMs) remain vulnerable to transfer-based targeted attacks, where perturbations optimized on open-source surrogate encoders can generalize to closed-source MLLMs. A key challenge for improving adversarial transferability is to effectively captur…

  2. arXiv stat.ML TIER_1 English(EN) · Leitao Yuan, Qinghua Mao, Daizong Liu, Kun Wang, Wenjie Wang, Yan Teng, Jing Shao, Dongrui Liu ·

    Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

    arXiv:2605.21541v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) remain vulnerable to transfer-based targeted attacks, where perturbations optimized on open-source surrogate encoders can generalize to closed-source MLLMs. A key challenge for improving ad…

  3. arXiv stat.ML TIER_1 English(EN) · Dongrui Liu ·

    Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

    Multimodal large language models (MLLMs) remain vulnerable to transfer-based targeted attacks, where perturbations optimized on open-source surrogate encoders can generalize to closed-source MLLMs. A key challenge for improving adversarial transferability is to effectively captur…