NVIDIA's SANA-WM world model has been released as open source by Victor M (@victormustar). It is a camera-conditional world model that runs on a single GPU, and it is introduced as processing 60 seconds of 720p video in 34 seconds with 2.6B parameters. World models, simulation, agents
NVIDIA has open-sourced its SANA-WM world model, which can generate a minute of 720p video from a single image. This camera-conditional model operates on a single GPU and processes 60 seconds of video in 34 seconds. The release is considered a practical advancement for research in world models, simulations, and agents. AI
IMPACT Enables researchers and developers to experiment with and build upon a new world model for video generation.