EntropyScan detects LVLM backdoors using visual attention anomalies

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-15 08:01

Researchers have developed EntropyScan, a new method for detecting backdoors in Large Vision-Language Models (LVLMs). This approach is model-level and does not require knowledge of the training data or specific attack triggers. EntropyScan identifies backdoors by analyzing anomalies in the visual attention allocation of LVLMs when processing benign samples, indicating a disruption in cross-modal alignment. The method utilizes Tsallis entropy to quantify these distortions, achieving high accuracy in experiments. AI

影响 Introduces a novel method for detecting security vulnerabilities in vision-language models, crucial for safe deployment.

排序理由 Academic paper introducing a new method for backdoor detection in LVLMs. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Xilin Chen · 2026-05-15 08:01

EntropyScan: Towards Model-level Backdoor Detection in LVLMs via Visual Attention Entropy

Large Vision-Language Models (LVLMs) have demonstrated remarkable capabilities across various tasks, yet they remain vulnerable to backdoor attacks. Existing defense methods predominantly focus on sample-level defense, which relies on the knowledge of training data or triggers. H…

报道来源 [1]

EntropyScan: Towards Model-level Backdoor Detection in LVLMs via Visual Attention Entropy

相关实体

相关话题