Researchers have developed a new framework called DIFE to evaluate the security vulnerabilities of Contrastive Language-Image Pre-training (CLIP) models when reused across different interfaces. The study found that backdoors in CLIP models do not guarantee continued effectiveness when applied to new tasks, and exposure is dependent on specific model components. To address a identified gap, a new method called BadTextTower was introduced, which creates a reusable carrier for adversarial behavior in the text encoder. AI
IMPACT New auditing framework reveals that CLIP model backdoors may not transfer effectively to downstream tasks, highlighting component-specific risks.
RANK_REASON The cluster contains a research paper published on arXiv detailing a new framework and method for auditing AI model vulnerabilities.
- arXiv
- BadTextTower
- Hugging Face
- alphaXiv
- CatalyzeX
- Connected Papers
- CORE Recommender
- DagsHub
- Gotit.pub
- Influence Flower
- Litmaps
- ScienceCast
- scite Smart Citations
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →