PulseAugur
EN
LIVE 07:52:37

New defense framework VPA-Guard tackles visual prompt attacks in AI video generation

Researchers have introduced VPA-Guard, a novel defense framework designed to protect image-to-video (I2V) generation models from visual prompt attacks. These attacks exploit visual cues like arrows or sketches to manipulate models into generating harmful content. To address this, the team also developed VVA-Bench, the first benchmark specifically for evaluating I2V safety against these types of attacks. Experiments on VVA-Bench showed that current state-of-the-art models are highly vulnerable, with success rates reaching 100% on some models. VPA-Guard, utilizing retrieval augmentation and self-evolving capabilities, significantly reduces attack success rates and harmfulness scores while preserving model utility. AI

IMPACT Enhances safety protocols for AI video generation, potentially enabling more responsible deployment of multimodal AI systems.

RANK_REASON The cluster contains a research paper detailing a new defense mechanism and benchmark for AI model safety. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

New defense framework VPA-Guard tackles visual prompt attacks in AI video generation

COVERAGE [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks

    Recent advancements in Image-to-Video (I2V) generation have transformed input images from simple appearance references into interactive control interfaces where visual cues such as arrows, sketches, and emojis orchestrate complex video dynamics with unprecedented controllability.…

  2. arXiv cs.CV TIER_1 English(EN) · Yining Sun, Haoyu Kang, Jiajun Wu, Heng Zhang, Danyang Zhang, Zhenjun Zhao, Haochen Han, Fangming Liu, Wai Kin Victor Chan, Alex Jinpeng Wang ·

    VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks

    arXiv:2606.25592v1 Announce Type: new Abstract: Recent advancements in Image-to-Video (I2V) generation have transformed input images from simple appearance references into interactive control interfaces where visual cues such as arrows, sketches, and emojis orchestrate complex vi…

  3. arXiv cs.CV TIER_1 English(EN) · Alex Jinpeng Wang ·

    VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks

    Recent advancements in Image-to-Video (I2V) generation have transformed input images from simple appearance references into interactive control interfaces where visual cues such as arrows, sketches, and emojis orchestrate complex video dynamics with unprecedented controllability.…