PulseAugur
EN
LIVE 02:40:34

New SAME audio autoencoder offers high compression, open weights

Researchers have developed SAME, a new autoencoder for stereo music and general audio that achieves a high temporal compression ratio while preserving reconstruction quality. This model combines a transformer backbone with semantic regularization, phase-aware losses, and improved discriminator designs. SAME offers significant computational cost benefits and is released in open-weights with two variants: SAME-L and a CPU-deployable SAME-S. AI

IMPACT New open-weight audio autoencoder could reduce computational costs for generative audio tasks.

RANK_REASON The cluster contains a new academic paper detailing a novel model architecture and its release. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New SAME audio autoencoder offers high compression, open weights

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Jordi Pons ·

    SAME: A Semantically-Aligned Music Autoencoder

    Latent representations are at the heart of the majority of modern generative models. In the audio domain they are typically produced by a neural-audio-codec autoencoder. In this work we introduce SAME (Semantically-Aligned Music autoEncoder), an autoencoder for stereo music and g…