Gradium has launched two new real-time speech translation models, stt-translate and s2s-translate, which aim to outperform existing solutions like GPT-Realtime-Translate and Gemini 3.5 Live Translate. These models support five languages and 20 language pairs, collapsing the traditional three-model pipeline into a more efficient two-model process. Gradium claims superior accuracy on BLEU and MetricX benchmarks compared to Gemini 3.5 Live Translate and better BLEU scores than GPT-Realtime-Translate, while offering advanced voice control features not found in GPT-Realtime-Translate. AI
IMPACT Offers improved accuracy and latency for real-time speech translation, potentially impacting applications requiring live multilingual communication.
RANK_REASON The launch of new speech translation models by a company, competing with established players.
- Gemini 3.5 Live Translate
- GPT-Realtime-Translate
- Gradium
- Hibiki-Zero
- Juraska et al.
- Papineni et al.
- s2s-translate
- stt-translate
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →