PulseAugur
实时 03:30:29

New SAME audio autoencoder offers high compression, open weights

Researchers have developed SAME, a new autoencoder for stereo music and general audio that achieves a high temporal compression ratio while preserving reconstruction quality. This model combines a transformer backbone with semantic regularization, phase-aware losses, and improved discriminator designs. SAME offers significant computational cost benefits and is released in open-weights with two variants: SAME-L and a CPU-deployable SAME-S. AI

影响 New open-weight audio autoencoder could reduce computational costs for generative audio tasks.

排序理由 The cluster contains a new academic paper detailing a novel model architecture and its release. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

New SAME audio autoencoder offers high compression, open weights

报道来源 [1]

  1. arXiv cs.AI TIER_1 English(EN) · Jordi Pons ·

    SAME: A Semantically-Aligned Music Autoencoder

    Latent representations are at the heart of the majority of modern generative models. In the audio domain they are typically produced by a neural-audio-codec autoencoder. In this work we introduce SAME (Semantically-Aligned Music autoEncoder), an autoencoder for stereo music and g…