OpenAI has released new, advanced audio models through its API, enhancing capabilities for voice agents. The updated speech-to-text models, including gpt-4o-transcribe and gpt-4o-mini-transcribe, offer improved accuracy and reliability, particularly in challenging audio conditions. Additionally, a new text-to-speech model, gpt-4o-mini-tts, allows developers to customize vocal delivery for more expressive and tailored applications. AI
Summary written by gemini-2.5-flash-lite from 3 sources. How we write summaries →
RANK_REASON OpenAI released new generation audio models with improved performance benchmarks and new steerability features.