Google AI has released Gemini 3.1 TTS and Gemini 3.1 Flash TTS, their newest text-to-speech models. These models offer enhanced expressiveness and control, introducing audio tags to guide vocal style, pace, and delivery through natural language commands. The audio tags are designed to be an intuitive way for users to shape the output of the text-to-speech models. AI
Summary written by gemini-2.5-flash-lite from 3 sources. How we write summaries →
RANK_REASON Release of a new model version by a major AI lab, but not a frontier model release.