English(EN) SEDiT: Mask-Free Video Subtitle Erasure via One-step Diffusion Transformer

新型扩散模型一步擦除视频字幕

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-14 14:37

研究人员开发了SEDiT，一种新颖的一阶段扩散Transformer模型，用于无遮罩视频字幕擦除。该方法直接移除字幕，无需预先提取遮罩，改进了依赖分割精度的现有两阶段方法。SEDiT利用一步生成过程，并通过Lipschitz连续性进行理论论证，并采用带有第一帧条件约束的混合训练策略，以确保长期的时间一致性。该模型通过其分块流式推理能力，能够高效处理高分辨率和长时视频。 AI

影响为视频编辑任务（如字幕移除）引入了一种更有效的方法。

排序理由发布了一篇详细介绍新型AI模型和方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Yunlong Bai · 2026-05-14 14:37

SEDiT: Mask-Free Video Subtitle Erasure via One-step Diffusion Transformer

Recent breakthroughs in video diffusion models have significantly accelerated the development of video editing techniques. However, existing methods often rely on inpainting video frames based on masked input, which requires extracting the target video mask in advance, and the pr…

报道来源 [1]

SEDiT: Mask-Free Video Subtitle Erasure via One-step Diffusion Transformer

相关实体

相关话题