PulseAugur
EN
LIVE 21:33:18

User seeks help implementing Calm TTS paper, facing voice cloning issues

A user is seeking assistance with implementing the Calm text-to-speech model described in a research paper. They have encountered difficulties in replicating the model's performance, experiencing issues with generating meaningful text and achieving accurate voice cloning. The user has tried various techniques, including scheduled sampling and adjusting data conditions, but has faced challenges such as exploding gradients and a trade-off between text quality and voice fidelity. They are asking for advice on how to proceed, whether to re-examine the paper, increase the dataset size, or address potential system design flaws. AI

IMPACT This cluster highlights challenges in replicating advanced TTS models, indicating potential areas for improvement in open-source implementations and research reproducibility.

RANK_REASON User is seeking help implementing a research paper, not announcing a new finding. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

User seeks help implementing Calm TTS paper, facing voice cloning issues

COVERAGE [1]

  1. r/MachineLearning TIER_1 English(EN) · /u/No-Motor-6274 ·

    I'm trying to implement CALM paper, and I have some questions. [P]

    <table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1uix556/im_trying_to_implement_calm_paper_and_i_have_some/"> <img alt="I'm trying to implement CALM paper, and I have some questions. [P]" src="https://preview.redd.it/kr4u22yfx8ah1.png?width=140&amp;heig…