New benchmark reveals multilingual safety gaps in vision-language models

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-09 04:00

Researchers have developed MLingualFC, a new multilingual benchmark to test the safety vulnerabilities of vision-language models (VLMs). This benchmark uses flowchart images encoded with harmful instructions in five languages: Hindi, Punjabi, Spanish, Romanian, and German. Evaluations of models like Qwen2.5-VL, Gemma-4, and Pangea revealed that visual attacks are highly successful in Latin-script languages, indicating current safety measures do not generalize well across languages and modalities. AI

影响 Highlights the need for more robust, multilingual safety alignment in advanced AI models.

排序理由 The cluster contains an academic paper introducing a new benchmark for evaluating AI model safety. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Rishabh Makwana, Mamta, Deeksha Varshney, Oana Cocarascu · 2026-06-09 04:00

MLingualFC：评估多语言视觉-语言模型中的越狱漏洞

arXiv:2606.07706v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have demonstrated strong performance across multimodal tasks, yet their safety robustness remains an open challenge. While prior work has shown that structured visual prompts such as flowcharts can ef…

报道来源 [1]

MLingualFC：评估多语言视觉-语言模型中的越狱漏洞

相关实体

相关话题