English(EN) BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM

BusterX++ MLLM 统一图像和视频人工智能生成内容检测

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-17 04:00

研究人员开发了 BusterX++，这是一种新颖的多模态大型语言模型 (MLLM)，旨在统一检测和解释跨图像和视频的人工智能生成内容。该方法旨在利用跨模态协同作用来解决日益增长的视觉错误信息问题。还引入了一个新的基准 GenBuster-Bench++，以促进该领域的研究。值得注意的是，研究发现，由稀疏奖励驱动的单阶段强化学习策略可以媲美甚至超越传统的监督微调后进行强化学习，这表明纯强化学习的更高策略熵有助于发展跨模态能力。 AI

影响这项研究可能有助于开发更强大的工具，以打击跨不同媒体类型的人工智能生成错误信息。

排序理由该集群描述了一篇关于人工智能生成内容检测的新研究论文，该论文详细介绍了一个新颖的模型和基准。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Haiquan Wen, Tianxiao Li, Zhenglin Huang, Yiwei He, Guangliang Cheng · 2026-06-17 04:00

BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM

arXiv:2507.14632v4 Announce Type: replace Abstract: The rapid advancement of generative AI has substantially improved image and video synthesis, amplifying the risk of multimodal visual misinformation. Recent MLLMs have shown promise for transparent AI-generated content detection…

报道来源 [1]

BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM

相关实体

相关话题