Researchers have developed EntropyScan, a new method for detecting backdoors in Large Vision-Language Models (LVLMs). This approach is model-level and does not require knowledge of the training data or specific attack triggers. EntropyScan identifies backdoors by analyzing anomalies in the visual attention allocation of LVLMs when processing benign samples, indicating a disruption in cross-modal alignment. The method utilizes Tsallis entropy to quantify these distortions, achieving high accuracy in experiments. AI
影响 Introduces a novel method for detecting security vulnerabilities in vision-language models, crucial for safe deployment.
排序理由 Academic paper introducing a new method for backdoor detection in LVLMs. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →