PulseAugur
LIVE 04:13:08
frontier release · [45 sources] ·
0
frontier release

OpenAI ships GPT-5-class voice models for real-time reasoning, translation, and transcription

OpenAI has released three new real-time voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. These models offer enhanced reasoning capabilities, live speech translation for over 70 languages, and low-latency transcription. GPT-Realtime-2, in particular, is described as having "GPT-5-class reasoning" and features a significantly expanded context window of 128K tokens, alongside improved handling of interruptions and tool usage. AI

Summary written by gemini-2.5-flash-lite from 45 sources. How we write summaries →

IMPACT Enhances real-time voice agent capabilities with improved reasoning, translation, and transcription, potentially accelerating adoption of voice-first interfaces.

RANK_REASON OpenAI released new real-time voice models with GPT-5-class reasoning capabilities.

Read on OpenAI News →

COVERAGE [45]

  1. OpenAI News TIER_1 ·

    Advancing voice intelligence with new models in the API

    Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice experiences.

  2. OpenAI News TIER_1 ·

    How OpenAI delivers low-latency voice AI at scale

    How OpenAI rebuilt its WebRTC stack to power real-time Voice AI with low latency, global scale, and seamless conversational turn-taking.

  3. Latent Space (swyx) TIER_1 ·

    [AINews] Thinking Machines' Native Interaction Models - TML-Interaction-Small 276B-A12B - advances SOTA Realtime Voice and kills standard VAD

    well done, Team Thinky.

  4. Latent Space (swyx) TIER_1 ·

    [AINews] GPT-Realtime-2, -Translate, and -Whisper: new SOTA realtime voice APIs

    OpenAI continues deploying GPT-5 everywhere

  5. Smol AINews TIER_1 ·

    GPT-Realtime-2, -Translate, and -Whisper: new SOTA realtime voice APIs

    **OpenAI** released **GPT-Realtime-2**, a voice model with **GPT-5-class reasoning**, tool use, interruption handling, and extended context windows up to **128K tokens**, achieving top scores on **Big Bench Audio** and **Conversational Dynamics** benchmarks. They also launched a …

  6. The Decoder TIER_1 · Matthias Bastian ·

    OpenAI's new voice model brings GPT-5-level reasoning to real-time conversations

    <p><img alt="" class="attachment-full size-full wp-post-image" height="1152" src="https://the-decoder.com/wp-content/uploads/2026/05/openai_audio-1.png" style="height: auto; margin-bottom: 10px;" width="2048" /></p> <p> OpenAI is shipping three new voice models—GPT-Realtime-2, GP…

  7. Hacker News — AI stories ≥50 points TIER_1 · Sean-Der ·

    How OpenAI delivers low-latency voice AI at scale

  8. MarkTechPost TIER_1 · Asif Razzaq ·

    OpenAI Releases Three Realtime Audio Models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper in the Realtime API

    <p>Three purpose-built audio models expand what developers can build with live voice: reasoning agents, speech translation across 70+ languages, and streaming transcription.</p> <p>The post <a href="https://www.marktechpost.com/2026/05/08/openai-releases-three-realtime-audio-mode…

  9. MarkTechPost TIER_1 · Asif Razzaq ·

    Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk

    <p>The Inworld AI's new model conditions on full audio context, not just transcripts — a meaningful architectural shift for voice-first AI agents</p> <p>The post <a href="https://www.marktechpost.com/2026/05/05/inworld-ai-launches-realtime-tts-2-a-closed-loop-voice-model-that-ada…

  10. Email — Mindstream TIER_1 · bounces+35008234-749c-ns3evnpcff6928077d7u=kill-the-newsletter.com@em5320.mindstream.news (bounces+35008234-749c-ns3evnpcff6928077d7u=kill-the-newsletter.com@em5320.mindstream.news) ·

    ChatGPT voice can now do much more

    <!--[if !mso]><!--><!--<![endif]-->ChatGPT voice can now do much more<!--[if mso]><xml><o:OfficeDocumentSettings><o:AllowPNG></o:AllowPNG><o:PixelsPerInch>96</o:PixelsPerInch></o:OfficeDocumentSettings></xml><![endif]--><!--[if mso]><style type="text/css"> h1, h2, h3, h4, h5, h6 …

  11. Mastodon — sigmoid.social TIER_1 · [email protected] ·

    New voice models from # OpenAI are quite impressive. Demo: https:// x.com/OpenAI/status/2052438194 625593804 # AI

    New voice models from # OpenAI are quite impressive. Demo: https:// x.com/OpenAI/status/2052438194 625593804 # AI

  12. Mastodon — sigmoid.social TIER_1 · [email protected] ·

    OpenAI's new Realtime API trio: GPT-Realtime-2 brings GPT-5-class reasoning to live voice with 128K context, parallel tool calls, and configurable reasoning tie

    OpenAI's new Realtime API trio: GPT-Realtime-2 brings GPT-5-class reasoning to live voice with 128K context, parallel tool calls, and configurable reasoning tiers. Zillow achieved 95% call success (up from 69%) on adversarial benchmarks. Plus real-time translation in 70+ language…

  13. Mastodon — sigmoid.social TIER_1 Suomi(FI) · [email protected] ·

    OpenAI released three new, real-time voice models Language models for real-time speech translation, transcription, and conversation are available

    OpenAI julkaisi kolme uutta, reaaliaikaista äänimallia Reaaliaikaiseen puheen kääntämiseen, litterointiin ja keskustelemiseen tarkoitetut kielimallit ovat saatavilla sovelluskehittäjille välittömästi. https:// dawn.fi/uutiset/2026/05/08/ope nai-reaaliaikaiset-aanimallit # OpenAI …

  14. Email — The Rundown AI TIER_1 · bounces+31366032-637c-8d9utci1mq15fs7p9a4h=kill-the-newsletter.com@em8370.daily.therundown.ai (bounces+31366032-637c-8d9utci1mq15fs7p9a4h=kill-the-newsletter.com@em8370.daily.therundown.ai) ·

    🗣️ OpenAI closes reasoning gap in voice agents

    <!--[if !mso]><!--><!--<![endif]-->🗣️ OpenAI closes reasoning gap in voice agents<!--[if mso]><xml><o:OfficeDocumentSettings><o:AllowPNG></o:AllowPNG><o:PixelsPerInch>96</o:PixelsPerInch></o:OfficeDocumentSettings></xml><![endif]--><!--[if mso]><style type="text/css"> h1, h2, h3,…

  15. Mastodon — sigmoid.social TIER_1 Türkçe(TR) · [email protected] ·

    OpenAI has upgraded its real-time voice and translation capabilities with the new GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper. Multilingual support and high d

    OpenAI yeni GPT‑Realtime‑2, GPT‑Realtime‑Translate ve GPT‑Realtime‑Whisper ile gerçek‑süre sesli ve çeviri yeteneklerini yükseltti. Çok dilli destek ve yüksek doğrulukta ses tanıma sayesinde tüm dili tek bir platformda topluyor. API’lerde hali hazırda erişilebilir. 🚩 # AI # OpenA…

  16. TechCrunch AI TIER_1 · Lucas Ropek ·

    OpenAI launches new voice intelligence features in its API

    The new features could be handy for customer service systems, but OpenAI says they have applications that work across a variety of other fields, including education and creator platforms.

  17. Mastodon — sigmoid.social TIER_1 · [email protected] ·

    OpenAI has unveiled three new voice models, including GPT-Realtime-2 with GPT-5-class reasoning and GPT-Realtime-Translate supporting over 70 languages. The com

    OpenAI has unveiled three new voice models, including GPT-Realtime-2 with GPT-5-class reasoning and GPT-Realtime-Translate supporting over 70 languages. The company says it is responding to viral videos highlighting its voice technologys shortcomings. https:// gizmodo.com/openai-…

  18. Mastodon — sigmoid.social TIER_1 · [email protected] ·

    Inworld AI has launched Realtime TTS-2, a closed-loop voice model that hears the full audio of conversation turns to adapt its delivery to users actual tone and

    Inworld AI has launched Realtime TTS-2, a closed-loop voice model that hears the full audio of conversation turns to adapt its delivery to users actual tone and emotional state. The model uses plain-language prompts like "[speak sadly, as if something bad just happened]" to steer…

  19. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    https:// winbuzzer.com/2026/05/10/opena i-brings-gpt-5-class-reasoning-to-real-time-v-xcxwbn/ OpenAI has launched a three-model real-time voice lineup that sepa

    https:// winbuzzer.com/2026/05/10/opena i-brings-gpt-5-class-reasoning-to-real-time-v-xcxwbn/ OpenAI has launched a three-model real-time voice lineup that separates reasoning, translation, and transcription instead of treating voice as one bundled chat feature. # AI # OpenAI # G…

  20. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    OpenAI has new voice models that reason, translate, and transcribe as you speak OpenAI has just released three new realtime voice models that it says will “unlo

    OpenAI has new voice models that reason, translate, and transcribe as you speak OpenAI has just released three new realtime voice models that it says will “unlock a new class of voice apps for developers.” Each new voice intelligence model has a unique speciality for different pu…

  21. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    OpenAI is pushing deeper into voice. The company just launched three new realtime audio models in its API. GPT-Realtime-2 for conversational reasoning, GPT-Real

    OpenAI is pushing deeper into voice. The company just launched three new realtime audio models in its API. GPT-Realtime-2 for conversational reasoning, GPT-Realtime-Translate for live multilingual translation, and GPT-Realtime-Whisper for streaming speech transcription. https:// …

  22. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    OpenAI has released new voice intelligence features in its API, aimed at customer service, education and creator platforms. The tools enable real-time spoken in

    OpenAI has released new voice intelligence features in its API, aimed at customer service, education and creator platforms. The tools enable real-time spoken interaction beyond basic transcription. https:// techcrunch.com/2026/05/07/open ai-launches-new-voice-intelligence-feature…

  23. Mastodon — mastodon.social TIER_1 · rhodzy ·

    New blog post: When AI Finally Starts Talking Back (Properly) OpenAI's move to low-latency voice AI isn't just a tech upgrade; it's a fundamental shift that mak

    New blog post: When AI Finally Starts Talking Back (Properly) OpenAI's move to low-latency voice AI isn't just a tech upgrade; it's a fundamental shift that makes truly conversational AI a reality, with massive implications for everything from health tech to gaming. https:// rhod…

  24. Mastodon — mastodon.social TIER_1 · [email protected] ·

    Advancing voice intelligence with new models in the API openai.com/index/advancin… #AI #voice #translation #OpenAI

    Advancing voice intelligence with new models in the API openai.com/index/advancin… #AI #voice #translation #OpenAI

  25. Mastodon — mastodon.social TIER_1 · [email protected] ·

    Tracking the AI Evolution in Research Libraries: Findings from ARL’s 2026 AI Quick Poll — Association of Research Libraries www.arl.org/blog/tracking-… #AI #lib

    Tracking the AI Evolution in Research Libraries: Findings from ARL’s 2026 AI Quick Poll — Association of Research Libraries www.arl.org/blog/tracking-… #AI #libraries

  26. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 Realtime Audio Models 2026: OpenAI Unveils GPT-Realtime-2, Translate & Whisper OpenAI has introduced three new realtime audio models—GPT-Realtime-2, GPT-Realt

    📰 Realtime Audio Models 2026: OpenAI Unveils GPT-Realtime-2, Translate & Whisper OpenAI has introduced three new realtime audio models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—designed to transform live voice applications with reasoning, translation, and l…

  27. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 OpenAI Redefines Conversational AI with Real-Time Voice Models in 2026: GPT-4o, Whisper ... OpenAI revolutionizes real-time voice capabilities

    📰 OpenAI 2026'da Gerçek Zamanlı Ses Modelleriyle Konuşan AI'yi Yeniden Tanımlıyor: GPT-4o, Whisper ... OpenAI, gerçek zamanlı konuşma yeteneklerinde devrim yaratacak üç yeni ses modelini duyurdu: GPT-Realtime-2, GPT-Realtime-Translate ve GPT-Realtime-Whisper. Bu modeller, yapay z…

  28. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 GPT-Realtime-2 Launches in 2026: 15.2% Faster Voice AI with Realtime Translation & Transcription OpenAI has unveiled GPT-Realtime-2, -Translate, and -Whisper

    📰 GPT-Realtime-2 Launches in 2026: 15.2% Faster Voice AI with Realtime Translation & Transcription OpenAI has unveiled GPT-Realtime-2, -Translate, and -Whisper — a suite of next-generation realtime voice APIs that set new state-of-the-art benchmarks in speech understanding and tr…

  29. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 GPT-Realtime-2 Set a New Standard in Real-Time Voice APIs in 2026 OpenAI, with GPT-Realtime-2, -Translate, and -Whisper, in real-time voice processing

    📰 GPT-Realtime-2 ile 2026'da Gerçek Zamanlı Ses API'leri Yeni Standartı Yarattı OpenAI, GPT-Realtime-2, -Translate ve -Whisper ile gerçek zamanlı ses işlemede devrim yarattı. Bu yeni API'ler, sesli diyalogların doğallığını ve hızını tamamen yeniden tanımlıyor.... # YapayZekaAraçl…

  30. Mastodon — mastodon.social TIER_1 日本語(JA) · [email protected] ·

    OpenAI's new voice model "GPT-Realtime-2" enables instant translation and low-latency transcription https://www.watch.impress.co.jp/docs/news/12107115.html #watch_impress #ChatGPT #Tech #AI

    OpenAI、新音声モデル「GPT-Realtime-2」 即時翻訳や低遅延文字起こしも https://www. watch.impress.co.jp/docs/news/ 2107115.html # watch_impress # ChatGPT # テック # AI

  31. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 OpenAI Voice Models 2026: GPT-Realtime-2, Whisper & Translate Revolutionize Real-Time Transcription OpenAI has introduced groundbreaking voice models includin

    📰 OpenAI Voice Models 2026: GPT-Realtime-2, Whisper & Translate Revolutionize Real-Time Transcription OpenAI has introduced groundbreaking voice models including GPT-Realtime-2, Translate, and Whisper, revolutionizing real-time speech processing. These models enhance transcriptio…

  32. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 Voice Communication Revolution in 2026: Real-time Transcription with GPT-Realtime-2, Whisper, and Translate... OpenAI Creates a Milestone in Voice AI

    📰 2026'da Sesli İletişim Devrimi: GPT-Realtime-2, Whisper ve Translate ile Gerçek Zamanlı Transkrip... OpenAI, sesli yapay zeka alanında bir dönüm noktası yarattı: GPT-Realtime-2, Translate ve Whisper ile gerçek zamanlı konuşma, çeviri ve transkripsiyon artık insan diline çok dah…

  33. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 AlphaEvolve: How Google DeepMind’s Gemini AI Transforms Algorithm Design in 2026 AlphaEvolve, a Gemini-powered coding agent developed by Google DeepMind, is r

    📰 AlphaEvolve: How Google DeepMind’s Gemini AI Transforms Algorithm Design in 2026 AlphaEvolve, a Gemini-powered coding agent developed by Google DeepMind, is reshaping how advanced algorithms are designed across scientific and engineering domains. By autonomously generating and …

  34. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 AlphaEvolve Generates Code with Gemini AI: Automating Scientific Discoveries 2026 Google DeepMind, Automatically Designs Algorithms Using Gemini AI

    📰 Gemini AI ile Kod Üreten AlphaEvolve: Bilimsel Keşifleri Otomatikleştiriyor 2026 Google DeepMind, Gemini AI’sini kullanarak algoritmaları otomatik tasarlayan AlphaEvolve’u tanıttı. Bu sistem, sadece kod üretmekle kalmıyor, bilimsel problemleri çözme kapasitesiyle disiplinler ar…

  35. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 2026’s Best Real-Time Speech Processing APIs: OpenAI Whisper, GPT-4o & More OpenAI has unveiled a next-generation voice API suite capable of real-time speech

    📰 2026’s Best Real-Time Speech Processing APIs: OpenAI Whisper, GPT-4o & More OpenAI has unveiled a next-generation voice API suite capable of real-time speech processing, integrating advanced inference, translation, and transcription. This innovation aims to redefine human-AI in…

  36. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 OpenAI Voice APIs 2026: AI Revolution with Real-Time Voice Processing OpenAI announced three new APIs redefining the future of voice interactions: GPT-R

    📰 OpenAI Ses API'leri 2026: Gerçek Zamanlı Ses İşleme ile AI Devrimi OpenAI, sesli interaksiyonların geleceğini yeniden tanımlayan üç yeni API'yi duyurdu: GPT-Realtime-2, GPT-Realtime-Translate ve GPT-Realtime-Whisper. Bu teknolojiler, yapay zekânın sesle iletişim kurma kapasites…

  37. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 Audio Models in API 2026: Build Voice Apps 70% Faster with OpenAI’s New Tools OpenAI has introduced three new audio models in its API, empowering developers t

    📰 Audio Models in API 2026: Build Voice Apps 70% Faster with OpenAI’s New Tools OpenAI has introduced three new audio models in its API, empowering developers to build advanced voice applications. This move aligns with broader industry efforts to standardize AI transparency and b…

  38. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 New Voice Models 2026: Revolution in Voice API and AI Voice Applications for Developers The future of voice-based applications for developers will be changed by three

    📰 Yeni Ses Modelleri 2026: Geliştiriciler İçin Ses API ve AI Ses Uygulamaları Devrimi Geliştiriciler için ses tabanlı uygulamaların geleceğini değiştirecek üç yeni ses modeli duyuruldu. Bu adım, yapay zekanın sesle etkileşime geçme kapasitesini derinlemesine yeniden tanımlıyor...…

  39. Mastodon — mastodon.social TIER_1 · aihaberleri ·

    📰 How OpenAI Codex Got Me Banned from Reddit (2026) Codex usage in development is sparking debate after a builder was banned from Reddit for disclosing AI tool

    📰 How OpenAI Codex Got Me Banned from Reddit (2026) Codex usage in development is sparking debate after a builder was banned from Reddit for disclosing AI tool use. The incident highlights growing tensions between AI efficiency and community transparency norms.... # AINews # AI #…

  40. Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri ·

    📰 Was OpenAI Codex Usage Banned on Reddit in 2026? Critical Warning for Developers OpenAI's Codex tools are very popular among developers

    📰 OpenAI Codex Kullanımı 2026'da Reddit'te Yasaklandı mı? Geliştiriciler İçin Kritik Uyarı OpenAI'nin Codex araçları geliştiriciler arasında büyük popülerlik kazanırken, Reddit'te kullanımı nedeniyle hesap yasaklamaları yaşanmaya başlandı. Bu durum sadece bir teknik sorun değil, …

  41. Mastodon — mastodon.social TIER_1 Svenska(SV) · redaktionen ·

    OpenAI's New Voice Intelligence: A Revolution for Customer Service and Education https://redaktionen.net/artikel/985 # ai # svtech

    OpenAI:s Nya Röstintelligens: En Revolution för Kundtjänst och Utbildning https:// redaktionen.net/artikel/985 # ai # svtech

  42. Mastodon — mastodon.social TIER_1 · sagalinked ·

    📰 OpenAI has introduced new voice intelligence features to its API, which could be beneficial for customer service systems and have applications across various

    📰 OpenAI has introduced new voice intelligence features to its API, which could be beneficial for customer service systems and have applications across various fields such as education and creator platforms. 🔗 https:// techcrunch.com/2026/05/07/open ai-launches-new-voice-intellig…

  43. Mastodon — mastodon.social TIER_1 · [email protected] ·

    Inworld AI has launched Realtime TTS-2, a closed-loop voice model that adapts to how users actually talk. Unlike traditional text-to-speech systems, TTS-2 hears

    Inworld AI has launched Realtime TTS-2, a closed-loop voice model that adapts to how users actually talk. Unlike traditional text-to-speech systems, TTS-2 hears the full audio context of each conversation - not just transcripts - allowing it to detect tone, pacing and emotion. ht…

  44. Mastodon — mastodon.social TIER_1 · [email protected] ·

    OpenAI serves 900M weekly users with voice AI, but traditional pipelines caused unacceptable latency. This deep dive reveals their ingenious solution: an "audio

    OpenAI serves 900M weekly users with voice AI, but traditional pipelines caused unacceptable latency. This deep dive reveals their ingenious solution: an "audio-native" architecture built on a re-engineered WebRTC stack. They tackled "one-port-per-session" and stateful protocol i…

  45. Mastodon — mastodon.social TIER_1 Svenska(SV) · redaktionen ·

    OpenAI's New Voice AI: Faster Than Ever with Low Latency

    OpenAI:s Nya Röst-AI: Snabbare Än Någonsin med Låg Latens https:// redaktionen.net/artikel/881 # ai # svtech