Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · Hugging Face Daily Papers English(EN) · 4d · [2 sources]

Track2View: 4D-Consistent Camera-Controlled Video Generation via Paired 3D Point Tracks

Researchers have developed Track2View, a novel method for generating videos from new camera viewpoints. This approach utilizes 3D point tracks to establish explicit spatiotemporal correspondences, ensuring temporal continuity and improving visual quality. Track2View conditions a video diffusion transformer with these paired 3D point tracks, enabling it to generalize to various camera trajectories without memorizing specific motions. The system has demonstrated state-of-the-art performance on a benchmark of 400 videos, significantly reducing rotation and translation errors compared to existing methods. AI

IMPACT Enables more accurate and visually consistent video re-rendering from novel camera viewpoints.
RESEARCH · arXiv cs.CV English(EN) · 3w · [6 sources]

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

Researchers have introduced several new frameworks for generating realistic and controllable driving scenes, crucial for training autonomous vehicles. DriveWAM adapts video diffusion transformers to create autoregressive action policies, incorporating scene understanding and memory for long-horizon planning. AnyScene offers a unified occupancy-centric approach, enabling precise control from arbitrary BEV layouts and generating temporally consistent multi-view videos. DriveGen3D combines efficient video diffusion with 3D scene reconstruction for high-quality, controllable dynamic scenes, supporting long driving videos and 3D representations. Additionally, a new dataset, Nuplan-Occ, has been curated to facilitate large-scale generative modeling and downstream applications in autonomous driving. AI

IMPACT These advancements in synthetic data generation could accelerate the development and testing of autonomous driving systems by providing more realistic and controllable training environments.

Brief

Track2View: 4D-Consistent Camera-Controlled Video Generation via Paired 3D Point Tracks

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond