New Sparse Backdoor attack hides undetectable compromises in image classifiers

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-07 04:00

Researchers have developed a novel supply-chain attack called Sparse Backdoor, capable of embedding a provably undetectable backdoor into pre-trained image classifiers like convolutional networks and Vision Transformers. The method involves injecting a sparse perturbation into fully connected layers, which is then masked by a Gaussian dither. This dither creates a clean reference distribution, making it computationally infeasible to distinguish the backdoored model from the original, even with white-box access to the parameters. AI

影响 Highlights a new sophisticated attack vector for model supply chains, necessitating enhanced security measures for deployed AI systems.

排序理由 Academic paper detailing a new method for embedding undetectable backdoors in image classification models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Sarthak Choudhary, Atharv Singh Patlan, Nils Palumbo, Ashish Hooda, Kassem Fawaz, Somesh Jha · 2026-05-07 04:00

Undetectable Backdoors in Model Parameters: Hiding Sparse Secrets in High Dimensions

arXiv:2605.04209v1 Announce Type: cross Abstract: We present Sparse Backdoor, a supply-chain attack that plants a \emph{provably undetectable} backdoor in pre-trained image classifiers, including convolutional networks and Vision Transformers. The attack injects a structured spar…

报道来源 [1]

Undetectable Backdoors in Model Parameters: Hiding Sparse Secrets in High Dimensions

相关实体

相关话题