English(EN) Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

新框架审计CLIP模型跨接口后门漏洞

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-16 11:41

研究人员开发了一个名为DIFE的新框架，用于评估对比语言图像预训练（CLIP）模型在跨不同接口重用时的安全漏洞。研究发现，CLIP模型中的后门并不能保证在新任务上的持续有效性，其暴露程度取决于特定的模型组件。为了解决已识别的差距，引入了一种名为BadTextTower的新方法，该方法为文本编码器中的对抗性行为创建了一个可重用的载体。 AI

影响新的审计框架揭示CLIP模型后门可能无法有效迁移到下游任务，突显了组件特定的风险。

排序理由该集群包含一篇在arXiv上发表的研究论文，详细介绍了一个用于审计AI模型漏洞的新框架和方法。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Kunlan Xiang, Haomiao Yang, Wenbo Jiang · 2026-06-17 04:00

Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

arXiv:2606.17815v1 Announce Type: cross Abstract: Contrastive Language-Image Pre-training models are widely reused across downstream interfaces, including feature extraction, retrieval, reranking, and selection. Existing CLIP backdoor, however, usually validate attacks on a small…
arXiv cs.CL TIER_1 English(EN) · Wenbo Jiang · 2026-06-16 11:41

Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

Contrastive Language-Image Pre-training models are widely reused across downstream interfaces, including feature extraction, retrieval, reranking, and selection. Existing CLIP backdoor, however, usually validate attacks on a small attack-native task, leaving unclear whether the s…

报道来源 [2]

Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

相关实体

相关话题