PulseAugur
实时 08:05:08
English(EN) Safe Alone, Unsafe Together: Safeguarding Against Implicit Toxicity When Benign Images Combine

新的AI安全挑战:已识别出多图像隐性毒性(MIIT)

研究人员提出了一个名为多图像隐性毒性(MIIT)的新AI安全挑战,其中看似良性的图像组合会产生有害的语义。为解决此问题,他们开发了MIIT数据集并训练了一个名为MiShield的模型。该系统内的MiShield-8B模型在识别MIIT方面表现优于现有的商业审核服务和大型模型,并能对贡献实体进行明确分析。 AI

影响 引入了一个新颖的AI安全挑战和一个解决该挑战的模型,有望改进多图像格式的内容审核。

排序理由 介绍AI安全新概念和数据集的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新的AI安全挑战:已识别出多图像隐性毒性(MIIT)

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Jiaxian Lv, Shiyao Cui, Yingkang Wang, Guoxin Wu, Qingling Zhang, Minlie Huang ·

    Safe Alone, Unsafe Together: Safeguarding Against Implicit Toxicity When Benign Images Combine

    arXiv:2607.00576v1 Announce Type: new Abstract: Multi-image content has become an increasingly prevalent form of visual communication in social media, giving rise to a new safety issue, multi-image implicit toxicity (MIIT), where each image appears benign in isolation, but harmfu…

  2. arXiv cs.CL TIER_1 English(EN) · Minlie Huang ·

    单独安全,组合不安全:防范良性图像组合时的隐性毒性

    Multi-image content has become an increasingly prevalent form of visual communication in social media, giving rise to a new safety issue, multi-image implicit toxicity (MIIT), where each image appears benign in isolation, but harmful semantics emerge when the images are interpret…