New attack manipulates vision-language models via image-only prompts

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed a new cross-modal prompt injection attack called CrossMPI that can manipulate the interpretation of both text and image inputs in large vision-language models (LVLMs) through image-only perturbations. This attack targets the model's hidden state space, which is larger and more influential than the visual embedding space typically used in previous attacks. The method employs a layer selection strategy to focus on critical multimodal integration layers and a distance-decremental budget assignment to constrain image perturbations, outperforming existing approaches in experiments. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT This research highlights a new vulnerability in vision-language models, potentially impacting their secure deployment and requiring new defense mechanisms.

RANK_REASON The cluster contains a new academic paper detailing a novel attack method. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

paper
safety

COVERAGE [1]

arXiv cs.CV TIER_1 · JianFeng Ma · 2026-05-15 15:47

A Cross-Modal Prompt Injection Attack against Large Vision-Language Models with Image-Only Perturbation

Large vision-language models (LVLMs) have emerged as a powerful paradigm for multimodal intelligence, but their growing deployment also expands the attack surface of prompt injection. Despite this growing concern, existing attacks still suffer from a critical limitation: the inje…

COVERAGE [1]

A Cross-Modal Prompt Injection Attack against Large Vision-Language Models with Image-Only Perturbation

RELATED ENTITIES

RELATED TOPICS