PulseAugur
EN
LIVE 00:01:06

Gradium launches real-time speech translation models, challenging GPT and Gemini

Gradium has launched two new real-time speech translation models, stt-translate and s2s-translate, which aim to outperform existing solutions like GPT-Realtime-Translate and Gemini 3.5 Live Translate. These models support five languages and 20 language pairs, collapsing the traditional three-model pipeline into a more efficient two-model process. Gradium claims superior accuracy on BLEU and MetricX benchmarks compared to Gemini 3.5 Live Translate and better BLEU scores than GPT-Realtime-Translate, while offering advanced voice control features not found in GPT-Realtime-Translate. AI

IMPACT Offers improved accuracy and latency for real-time speech translation, potentially impacting applications requiring live multilingual communication.

RANK_REASON The launch of new speech translation models by a company, competing with established players.

Read on MarkTechPost →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Gradium launches real-time speech translation models, challenging GPT and Gemini

COVERAGE [1]

  1. MarkTechPost TIER_1 English(EN) · Asif Razzaq ·

    Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency

    <p>Gradium released two real-time speech translation models, stt-translate and s2s-translate, covering English, French, German, Spanish, and Portuguese across 20 language pairs. The models collapse the standard three-model cascade into two, pairing single-pass transcription-and-t…