PulseAugur
LIVE 04:12:59
significant · [1 source] · · 中文(ZH) GPT-5级推理能力塞进语音模型,OpenAI把同传翻译成本砍穿地板价
0
significant

OpenAI launches GPT-5 level voice models for real-time translation and agents

OpenAI has released three new real-time voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. GPT-Realtime-2 integrates GPT-5 level reasoning into voice interactions, supporting a 128K context window and parallel tool calls. GPT-Realtime-Translate offers real-time, low-cost simultaneous interpretation across numerous languages, drastically undercutting traditional human interpreter costs. GPT-Realtime-Whisper provides low-latency, streaming speech-to-text transcription. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT These models significantly lower the cost and increase accessibility of real-time voice translation and AI-powered voice agents, potentially disrupting the simultaneous interpretation industry and enabling more natural human-computer interaction.

RANK_REASON OpenAI announced three new voice models with advanced capabilities, including GPT-5 level reasoning and real-time translation. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 听雨 ·

    GPT-5 level reasoning ability packed into a voice model, OpenAI slashes simultaneous interpretation costs to the floor.

    OpenAI上新三款实时语音模型