RelayFormer框架统一图像和视频篡改定位

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-11 04:00

研究人员推出RelayFormer，一个旨在改进图像和视频中篡改区域定位的新框架。这种统一的方法解决了现有方法在分辨率多样性和图像视频数据单独处理方面的挑战。RelayFormer利用全局局部中继（GLR）令牌和基于中继的注意力机制，在保留细粒度篡改伪影的同时，有效地交换上下文信息。 AI

影响引入了一种统一的视觉篡改定位方法，有望提高检测篡改媒体的效率和准确性。

排序理由这是一篇描述新技术框架的研究论文。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Wen Huang, Jiarui Yang, Tao Dai, Jiawei Li, Shaoxiong Zhan, Bin Wang, Shu-Tao Xia · 2026-06-11 04:00

RelayFormer: A Unified Local-Global Attention Framework for Scalable Image and Video Manipulation Localization

arXiv:2508.09459v3 Announce Type: replace-cross Abstract: Visual manipulation localization (VML) aims to identify tampered regions in images and videos, a task that has become increasingly challenging with the rise of advanced editing tools. Existing methods face two central issu…