PulseAugur
实时 21:16:48
English(EN) Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing

Goku数据集和基准推动指令视频编辑发展

研究人员推出了Goku,一个专为指令视频编辑设计的新数据集和基准。Goku包含200万个视频编辑对,超越了简单的外观编辑,涵盖了复杂的多任务和结构化操作。配套的Goku-Edit模型利用多模态大语言模型进行指令理解,在新提出的Goku-Bench基准上比现有开源模型提高了8%。 AI

影响 提升了指令视频编辑的能力,可能催生更复杂、更具创意的视频处理工具。

排序理由 该集群描述了一篇介绍视频编辑数据集、基准和模型的学术论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

Goku数据集和基准推动指令视频编辑发展

报道来源 [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing

    A large-scale video editing dataset and model are introduced that support multi-task and structural manipulations through advanced data synthesis and network architectures.

  2. arXiv cs.CV TIER_1 English(EN) · Sen Liang, Cong Wang, Zhentao Yu, Fengbin Guan, Zhengguang Zhou, Teng Hu, Youliang Zhang, Yuan Zhou, Xin Li, Qinglin Lu, Zhibo Chen ·

    Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing

    arXiv:2606.30599v1 Announce Type: new Abstract: Existing instruction-based video editing datasets commonly focus on single-task appearance editing, failing to meet the complex creative demands of real-world scenarios. To bridge this gap, we present Goku, a large-scale dataset fea…

  3. arXiv cs.CV TIER_1 English(EN) · Zhibo Chen ·

    Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing

    Existing instruction-based video editing datasets commonly focus on single-task appearance editing, failing to meet the complex creative demands of real-world scenarios. To bridge this gap, we present Goku, a large-scale dataset featuring 2 million high-quality, instruction-align…