PulseAugur
LIVE 13:11:01
research · [1 source] ·
0
research

Making Transformers Sing - with Mikey Shulman of Suno

Suno, a company founded by former Kensho employees who are also musicians, has developed advanced AI models for audio generation, moving beyond traditional text-to-speech. Their initial open-source model, Bark, demonstrated capabilities in generating speech, music, and sound effects by training on broad audio data rather than limited text-to-speech datasets. Suno's subsequent product, which gained significant attention in December 2023, aims to democratize music creation, allowing anyone to become a music maker. AI

Summary written by None from 1 source. How we write summaries →

RANK_REASON The article discusses the development and capabilities of Suno's AI models for audio and music generation, including their open-source model Bark, which is a significant research advancement in the field.

Read on Latent Space Podcast →

Making Transformers Sing - with Mikey Shulman of Suno

COVERAGE [1]

  1. Latent Space Podcast TIER_1 · Latent.Space ·

    Making Transformers Sing - with Mikey Shulman of Suno

    <p>Giving computers a voice has always been at the center of sci-fi movies; <a href="https://www.youtube.com/watch?v=qDrDUmuUBTo" target="_blank">“I’m sorry Dave, I’m afraid I can’t do that”</a> wouldn’t hit as hard if it just appeared on screen as a terminal output, after all. T…