English(EN) Safe Alone, Unsafe Together: Safeguarding Against Implicit Toxicity When Benign Images Combine

新的AI安全挑战：已识别出多图像隐性毒性（MIIT）

作者 PulseAugur 编辑部 · [2 个来源] · 2026-07-01 07:59

研究人员提出了一个名为多图像隐性毒性（MIIT）的新AI安全挑战，其中看似良性的图像组合会产生有害的语义。为解决此问题，他们开发了MIIT数据集并训练了一个名为MiShield的模型。该系统内的MiShield-8B模型在识别MIIT方面表现优于现有的商业审核服务和大型模型，并能对贡献实体进行明确分析。 AI

影响引入了一个新颖的AI安全挑战和一个解决该挑战的模型，有望改进多图像格式的内容审核。

排序理由介绍AI安全新概念和数据集的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Jiaxian Lv, Shiyao Cui, Yingkang Wang, Guoxin Wu, Qingling Zhang, Minlie Huang · 2026-07-02 04:00

Safe Alone, Unsafe Together: Safeguarding Against Implicit Toxicity When Benign Images Combine

arXiv:2607.00576v1 Announce Type: new Abstract: Multi-image content has become an increasingly prevalent form of visual communication in social media, giving rise to a new safety issue, multi-image implicit toxicity (MIIT), where each image appears benign in isolation, but harmfu…
arXiv cs.CL TIER_1 English(EN) · Minlie Huang · 2026-07-01 07:59

单独安全，组合不安全：防范良性图像组合时的隐性毒性

Multi-image content has become an increasingly prevalent form of visual communication in social media, giving rise to a new safety issue, multi-image implicit toxicity (MIIT), where each image appears benign in isolation, but harmful semantics emerge when the images are interpret…

报道来源 [2]

Safe Alone, Unsafe Together: Safeguarding Against Implicit Toxicity When Benign Images Combine

单独安全，组合不安全：防范良性图像组合时的隐性毒性

相关实体

相关话题