FineVideo is a new open-source text-to-video diffusion model that can generate high-quality videos from text prompts. It is based on the Stable Diffusion architecture and trained on a large dataset of video clips. FineVideo is capable of generating videos with a resolution of up to 512x512 pixels and a frame rate of up to 30 frames per second. The model is available for download on Hugging Face. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
RANK_REASON Open-source model release from a non-frontier lab.