Brief · PulseAugur

RESEARCH · Hugging Face Daily Papers Italiano(IT) · 1w · [3 sources]

Avatar V: Scaling Video-Reference Avatar Video Generation

Researchers have introduced Avatar V, a new framework for generating highly realistic avatar videos that are behaviorally recognizable. Unlike previous methods that relied on static images, Avatar V conditions on full video references to capture dynamic traits like talking rhythm and gestures. The system utilizes sparse attention mechanisms and a dedicated motion stream to achieve high-fidelity results, outperforming existing models such as Seedance 2.0 and Kling O3 Pro. AI

IMPACT Sets a new standard for avatar video generation by conditioning on full video references for behavioral realism.

Veo 3.1
Seedance 2.0
Kling O3 Pro
OmniHuman 1.5