Track2View: 4D-Consistent Camera-Controlled Video Generation via Paired 3D Point Tracks
Researchers have developed Track2View, a novel method for generating videos from new camera viewpoints. This approach utilizes 3D point tracks to establish explicit spatiotemporal correspondences, ensuring temporal continuity and improving visual quality. Track2View conditions a video diffusion transformer with these paired 3D point tracks, enabling it to generalize to various camera trajectories without memorizing specific motions. The system has demonstrated state-of-the-art performance on a benchmark of 400 videos, significantly reducing rotation and translation errors compared to existing methods. AI
IMPACT Enables more accurate and visually consistent video re-rendering from novel camera viewpoints.