PulseAugur
实时 04:38:16

KFC-W model generates 3D-consistent videos from unposed internet photos

Researchers have developed KFC-W, a novel self-supervised method for generating 3D-consistent videos from unposed internet photos. This approach leverages the inherent consistency of videos and the variability of multi-view images to train a 3D-aware video model without requiring explicit 3D annotations. The method demonstrates superior geometric and appearance consistency compared to existing baselines and shows potential for applications involving camera control, such as 3D Gaussian Splatting. AI

影响 Introduces a new method for generating 3D-consistent videos from unposed images, potentially improving scene understanding and applications like 3D reconstruction.

排序理由 This is a research paper detailing a new method for video generation from images. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

KFC-W model generates 3D-consistent videos from unposed internet photos

报道来源 [1]

  1. arXiv cs.CV TIER_1 English(EN) · Gene Chou, Kai Zhang, Sai Bi, Hao Tan, Zexiang Xu, Fujun Luan, Bharath Hariharan, Noah Snavely ·

    KFC-W: Generating 3D-Consistent Videos from Unposed Internet Photos

    arXiv:2411.13549v2 Announce Type: replace Abstract: We address the problem of generating videos from unposed internet photos. A handful of input images serve as keyframes, and our model interpolates between them to simulate a path moving between the cameras. Given random images, …