Stability AI has released its Stable Audio 3 family of models, including small and medium versions, designed for efficient variable-length audio generation and editing. These latent diffusion models operate on a novel semantic-acoustic autoencoder and utilize adversarial post-training to enhance speed and quality. Trained on licensed and Creative Commons data, the models can produce music and sounds in seconds, with the small and medium versions capable of running on consumer hardware. AI
IMPACT Accelerates AI-powered audio creation and editing for both consumers and professionals.
RANK_REASON Model release from a frontier lab (Stability AI) with model weights and inference pipeline. [lever_c_demoted from frontier_release: ic=2 ai=1.0]
Read on Hugging Face Trending Models →
- Gemma
- Stability AI
- Stable Audio 3 Medium
- T5Gemma
- stabilityai/stable-audio-3-medium
- stabilityai/stable-audio-3-small-music
- Stable Audio 3
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →