New SAME audio autoencoder offers high compression, open weights

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-18 16:23

Researchers have developed SAME, a new autoencoder for stereo music and general audio that achieves a high temporal compression ratio while preserving reconstruction quality. This model combines a transformer backbone with semantic regularization, phase-aware losses, and improved discriminator designs. SAME offers significant computational cost benefits and is released in open-weights with two variants: SAME-L and a CPU-deployable SAME-S. AI

影响 New open-weight audio autoencoder could reduce computational costs for generative audio tasks.

排序理由 The cluster contains a new academic paper detailing a novel model architecture and its release. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Jordi Pons · 2026-05-18 16:23

SAME: A Semantically-Aligned Music Autoencoder

Latent representations are at the heart of the majority of modern generative models. In the audio domain they are typically produced by a neural-audio-codec autoencoder. In this work we introduce SAME (Semantically-Aligned Music autoEncoder), an autoencoder for stereo music and g…

报道来源 [1]

SAME: A Semantically-Aligned Music Autoencoder

相关实体

相关话题