New attack framework targets AI models with theoretical guarantees

By PulseAugur Editorial · [1 sources] · 2026-05-25 04:00

Researchers have developed a new framework for adversarial attacks on AI models, focusing on hard-label black-box scenarios where only the top prediction is accessible. Their approach introduces a novel zero-query initialization strategy and a Pattern-Driven Optimization algorithm, grounded in theoretical analysis that links existing methods to gradient sign approximation. This method demonstrates superior efficiency and success rates compared to state-of-the-art attacks across various datasets and model types, including commercial APIs and CLIP models, while also showing robustness against data corruption and specialized tasks like segmentation. AI

IMPACT This research introduces a more efficient and theoretically grounded method for adversarial attacks, potentially impacting AI model security and robustness testing.

RANK_REASON The cluster contains an academic paper detailing a new method for adversarial attacks on AI models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Jun Liu, Leo Yu Zhang, Fengpeng Li, Isao Echizen, Jiantao Zhou · 2026-05-25 04:00

Low-Cost Hard-Label Adversarial Attack with Theoretical Foundations

arXiv:2601.14300v3 Announce Type: replace Abstract: Hard-label black-box attacks, relying solely on top-1 predictions, represent one of the most challenging yet practically threat models. Despite recent progress, existing approaches face two key limitations: (1) they overlook the…

COVERAGE [1]

Low-Cost Hard-Label Adversarial Attack with Theoretical Foundations

RELATED ENTITIES

RELATED TOPICS