SANA-WM model generates minute-long 720p videos

By PulseAugur Editorial · [1 sources] · 2026-05-18 11:14

Researchers have released SANA-WM, an open-source world model capable of generating minute-long videos at 720p resolution. This diffusion transformer model utilizes a hybrid linear attention mechanism and a dual-branch architecture for precise camera control. The model also incorporates a two-stage generation pipeline with a refiner for enhanced quality and temporal consistency, and it was trained using a robust annotation pipeline with metric-scale 6-DoF camera poses. AI

IMPACT Enables creation of longer, high-fidelity videos with precise camera control, potentially impacting content generation and simulation.

RANK_REASON The cluster describes the release of a new open-source model with a corresponding paper, fitting the research bucket. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Hugging Face Trending Models →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

SANA-WM model generates minute-long 720p videos

COVERAGE [1]

Hugging Face Trending Models TIER_1 English(EN) · Efficient-Large-Model · 2026-05-18 11:14

Efficient-Large-Model/SANA-WM_bidirectional

image-to-video · 0 downloads · 98 likes

COVERAGE [1]

Efficient-Large-Model/SANA-WM_bidirectional

RELATED ENTITIES

RELATED TOPICS