Together AI has announced the release of Cartesia Sonic 3.5, a new text-to-speech (TTS) model designed for real-time applications. The model boasts sub-90ms latency and supports 42 languages, with features for context-aware pronunciation and accurate transcript following. Developers can now access over 150 Cartesia Sonic 3.5 voices through Together AI's voice finder tool to compare and select voices before deployment. AI
IMPACT Enhances real-time TTS capabilities with low latency and broad language support, potentially improving voice agent interactions.
RANK_REASON Model release announcement from a frontier lab.
Read on X — Together (inference / OSS) →
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →