Researchers have developed Phonikud, an open-source system designed to improve text-to-speech (TTS) for Modern Hebrew by addressing phonetic underspecification. This framework includes a grapheme-to-phoneme system that outputs detailed International Phonetic Alphabet (IPA) transcriptions, a new corpus named ILSpeech with annotated Hebrew audio and text, and models for automatic TTS evaluation. The system demonstrates improved phoneme prediction compared to previous methods, with small TTS models utilizing Phonikud's phonetic input achieving performance comparable to larger proprietary systems. AI
IMPACT Enhances TTS capabilities for under-resourced languages by providing a more accurate phonetic representation.
RANK_REASON The cluster contains an academic paper detailing a new system and dataset for a specific NLP task. [lever_c_demoted from research: ic=1 ai=1.0]
- arXiv
- Hebrew
- Hugging Face
- ILSpeech
- International Phonetic Alphabet
- Morris Alper
- Phonikud
- text-to-speech
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →