PulseAugur
EN
LIVE 14:56:36

New framework CORTIS enables continual speaker unlearning in TTS models

Researchers have developed a new framework called Cumulative ORThogonal Identity Suppression (CORTIS) to address the challenge of continually unlearning speaker identities from zero-shot text-to-speech (ZS-TTS) models. Existing methods fail when unlearning requests are sequential, as they can revive previously unlearned speakers. CORTIS, however, uses Fisher-information-based parameter masking and orthogonal projection to ensure that once a speaker identity is unlearned, it remains forgotten even with subsequent unlearning requests, without needing access to the previously unlearned data. This approach was demonstrated to be effective with the VoiceBox model, outperforming sequential applications of prior methods. AI

IMPACT This research addresses a critical privacy concern in generative audio models, enabling more robust and sequential unlearning of sensitive data.

RANK_REASON The cluster contains an academic paper detailing a new method for machine unlearning in the context of ZS-TTS models.

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New framework CORTIS enables continual speaker unlearning in TTS models

COVERAGE [2]

  1. arXiv cs.AI TIER_1 English(EN) · Jinju Kim, Yunsung Kang, Gyeong-Moon Park, Jong Hwan Ko ·

    Continual Speaker Identity Unlearning with Minimal Interference

    arXiv:2605.25962v1 Announce Type: cross Abstract: Machine unlearning removes designated concepts or knowledge from pre-trained models. Recent work has extended this paradigm to speaker identity unlearning in zero-shot text-to-speech (ZS-TTS), the task of selectively erasing a mod…

  2. arXiv cs.AI TIER_1 English(EN) · Jong Hwan Ko ·

    Continual Speaker Identity Unlearning with Minimal Interference

    Machine unlearning removes designated concepts or knowledge from pre-trained models. Recent work has extended this paradigm to speaker identity unlearning in zero-shot text-to-speech (ZS-TTS), the task of selectively erasing a model's ability to replicate a speaker's voice. Exist…