OpenAI ships GPT-5-class voice models for real-time reasoning, translation, and transcription

OpenAI News TIER_1 English(EN) · 2026-05-07 10:00

Advancing voice intelligence with new models in the API

Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice experiences.

OpenAI News TIER_1 English(EN) · 2026-05-04 00:00

How OpenAI delivers low-latency voice AI at scale

How OpenAI rebuilt its WebRTC stack to power real-time Voice AI with low latency, global scale, and seamless conversational turn-taking.

Latent Space (swyx) TIER_1 English(EN) · 2026-05-12 04:33

[AINews] Thinking Machines' Native Interaction Models - TML-Interaction-Small 276B-A12B - advances SOTA Realtime Voice and kills standard VAD

well done, Team Thinky.

Latent Space (swyx) TIER_1 English(EN) · 2026-05-08 07:11

[AINews] GPT-Realtime-2, -Translate, and -Whisper: new SOTA realtime voice APIs

OpenAI continues deploying GPT-5 everywhere

Smol AINews TIER_1 English(EN) · 2026-05-07 05:44

GPT-Realtime-2, -Translate, and -Whisper: new SOTA realtime voice APIs

**OpenAI** released **GPT-Realtime-2**, a voice model with **GPT-5-class reasoning**, tool use, interruption handling, and extended context windows up to **128K tokens**, achieving top scores on **Big Bench Audio** and **Conversational Dynamics** benchmarks. They also launched a …

The Decoder TIER_1 English(EN) · Matthias Bastian · 2026-05-07 18:44

OpenAI's new voice model brings GPT-5-level reasoning to real-time conversations

<p><img alt="" class="attachment-full size-full wp-post-image" height="1152" src="https://the-decoder.com/wp-content/uploads/2026/05/openai_audio-1.png" style="height: auto; margin-bottom: 10px;" width="2048" /></p> <p> OpenAI is shipping three new voice models—GPT-Realtime-2, GP…

Hacker News — AI stories ≥50 points TIER_1 English(EN) · Sean-Der · 2026-05-04 19:42

How OpenAI delivers low-latency voice AI at scale

MarkTechPost TIER_1 English(EN) · Asif Razzaq · 2026-05-08 07:05

OpenAI Releases Three Realtime Audio Models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper in the Realtime API

<p>Three purpose-built audio models expand what developers can build with live voice: reasoning agents, speech translation across 70+ languages, and streaming transcription.</p> <p>The post <a href="https://www.marktechpost.com/2026/05/08/openai-releases-three-realtime-audio-mode…

MarkTechPost TIER_1 English(EN) · Asif Razzaq · 2026-05-06 00:34

Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk

<p>The Inworld AI's new model conditions on full audio context, not just transcripts — a meaningful architectural shift for voice-first AI agents</p> <p>The post <a href="https://www.marktechpost.com/2026/05/05/inworld-ai-launches-realtime-tts-2-a-closed-loop-voice-model-that-ada…

Email — Mindstream TIER_1 English(EN) · bounces+35008234-749c-ns3evnpcff6928077d7u=kill-the-newsletter.com@em5320.mindstream.news (bounces+35008234-749c-ns3evnpcff6928077d7u=kill-the-newsletter.com@em5320.mindstream.news) · 2026-05-12 15:06

ChatGPT voice can now do much more

ChatGPT voice can now do much more<!--[if mso]><style type="text/css"> h1, h2, h3, h4, h5, h6 …

Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-11 22:17

New voice models from # OpenAI are quite impressive. Demo: https:// x.com/OpenAI/status/2052438194 625593804 # AI

Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-08 16:11

OpenAI's new Realtime API trio: GPT-Realtime-2 brings GPT-5-class reasoning to live voice with 128K context, parallel tool calls, and configurable reasoning tie

OpenAI's new Realtime API trio: GPT-Realtime-2 brings GPT-5-class reasoning to live voice with 128K context, parallel tool calls, and configurable reasoning tiers. Zillow achieved 95% call success (up from 69%) on adversarial benchmarks. Plus real-time translation in 70+ language…

LINKS aintelligencehub.com/…/openai-realtime-2-… aintelligencehub.com/link-not-found

Mastodon — sigmoid.social TIER_1 Suomi(FI) · [email protected] · 2026-05-08 15:55

OpenAI released three new, real-time voice models Language models for real-time speech translation, transcription, and conversation are available

OpenAI julkaisi kolme uutta, reaaliaikaista äänimallia Reaaliaikaiseen puheen kääntämiseen, litterointiin ja keskustelemiseen tarkoitetut kielimallit ovat saatavilla sovelluskehittäjille välittömästi. https:// dawn.fi/uutiset/2026/05/08/ope nai-reaaliaikaiset-aanimallit # OpenAI …

LINKS dawn.fi/…/openai-reaaliaikaiset-aanimallit

Email — The Rundown AI TIER_1 English(EN) · bounces+31366032-637c-8d9utci1mq15fs7p9a4h=kill-the-newsletter.com@em8370.daily.therundown.ai (bounces+31366032-637c-8d9utci1mq15fs7p9a4h=kill-the-newsletter.com@em8370.daily.therundown.ai) · 2026-05-08 10:09

🗣️ OpenAI closes reasoning gap in voice agents

🗣️ OpenAI closes reasoning gap in voice agents<!--[if mso]><style type="text/css"> h1, h2, h3,…

Mastodon — sigmoid.social TIER_1 Türkçe(TR) · [email protected] · 2026-05-08 10:08

OpenAI has upgraded its real-time voice and translation capabilities with the new GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper. Multilingual support and high d

OpenAI yeni GPT‑Realtime‑2, GPT‑Realtime‑Translate ve GPT‑Realtime‑Whisper ile gerçek‑süre sesli ve çeviri yeteneklerini yükseltti. Çok dilli destek ve yüksek doğrulukta ses tanıma sayesinde tüm dili tek bir platformda topluyor. API’lerde hali hazırda erişilebilir. 🚩 # AI # OpenA…

TechCrunch AI TIER_1 English(EN) · Lucas Ropek · 2026-05-07 22:24

OpenAI launches new voice intelligence features in its API

The new features could be handy for customer service systems, but OpenAI says they have applications that work across a variety of other fields, including education and creator platforms.

Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-07 20:53

OpenAI has unveiled three new voice models, including GPT-Realtime-2 with GPT-5-class reasoning and GPT-Realtime-Translate supporting over 70 languages. The com

OpenAI has unveiled three new voice models, including GPT-Realtime-2 with GPT-5-class reasoning and GPT-Realtime-Translate supporting over 70 languages. The company says it is responding to viral videos highlighting its voice technologys shortcomings. https:// gizmodo.com/openai-…

LINKS gizmodo.com/openai-is-tired-of-seeing-all…

Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-06 05:51

Inworld AI has launched Realtime TTS-2, a closed-loop voice model that hears the full audio of conversation turns to adapt its delivery to users actual tone and

Inworld AI has launched Realtime TTS-2, a closed-loop voice model that hears the full audio of conversation turns to adapt its delivery to users actual tone and emotional state. The model uses plain-language prompts like "[speak sadly, as if something bad just happened]" to steer…

LINKS marktechpost.com/…/inworld-ai-launches-re…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-10 19:30

https:// winbuzzer.com/2026/05/10/opena i-brings-gpt-5-class-reasoning-to-real-time-v-xcxwbn/ OpenAI has launched a three-model real-time voice lineup that sepa

https:// winbuzzer.com/2026/05/10/opena i-brings-gpt-5-class-reasoning-to-real-time-v-xcxwbn/ OpenAI has launched a three-model real-time voice lineup that separates reasoning, translation, and transcription instead of treating voice as one bundled chat feature. # AI # OpenAI # G…

LINKS winbuzzer.com/…/openai-brings-gpt-5-class… winbuzzer.com/…/openagi-unveils-lux-agent…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-10 08:33

OpenAI has new voice models that reason, translate, and transcribe as you speak OpenAI has just released three new realtime voice models that it says will “unlo

OpenAI has new voice models that reason, translate, and transcribe as you speak OpenAI has just released three new realtime voice models that it says will “unlock a new class of voice apps for developers.” Each new voice intelligence model has a unique speciality for different pu…

LINKS 9to5mac.com/…/openai-has-new-voice-models…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-08 12:34

OpenAI is pushing deeper into voice. The company just launched three new realtime audio models in its API. GPT-Realtime-2 for conversational reasoning, GPT-Real

OpenAI is pushing deeper into voice. The company just launched three new realtime audio models in its API. GPT-Realtime-2 for conversational reasoning, GPT-Realtime-Translate for live multilingual translation, and GPT-Realtime-Whisper for streaming speech transcription. https:// …

LINKS firethering.com/openai-new-voice-models-r…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-08 02:51

OpenAI has released new voice intelligence features in its API, aimed at customer service, education and creator platforms. The tools enable real-time spoken in

OpenAI has released new voice intelligence features in its API, aimed at customer service, education and creator platforms. The tools enable real-time spoken interaction beyond basic transcription. https:// techcrunch.com/2026/05/07/open ai-launches-new-voice-intelligence-feature…

LINKS techcrunch.com/…/openai-launches-new-voic…

Mastodon — mastodon.social TIER_1 English(EN) · rhodzy · 2026-05-10 01:00

New blog post: When AI Finally Starts Talking Back (Properly) OpenAI's move to low-latency voice AI isn't just a tech upgrade; it's a fundamental shift that mak

New blog post: When AI Finally Starts Talking Back (Properly) OpenAI's move to low-latency voice AI isn't just a tech upgrade; it's a fundamental shift that makes truly conversational AI a reality, with massive implications for everything from health tech to gaming. https:// rhod…

LINKS rhodzy.com/…/when-ai-finally-starts-talki… rhodzy.com/…/when-ai-finall

Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-05-08 23:10

Advancing voice intelligence with new models in the API openai.com/index/advancin… #AI #voice #translation #OpenAI

Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-05-08 23:07

Tracking the AI Evolution in Research Libraries: Findings from ARL’s 2026 AI Quick Poll — Association of Research Libraries www.arl.org/blog/tracking-… #AI #lib

Tracking the AI Evolution in Research Libraries: Findings from ARL’s 2026 AI Quick Poll — Association of Research Libraries www.arl.org/blog/tracking-… #AI #libraries

LINKS arl.org/…/tracking-the-ai-evolution-in-re…

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 08:12

📰 Realtime Audio Models 2026: OpenAI Unveils GPT-Realtime-2, Translate & Whisper OpenAI has introduced three new realtime audio models—GPT-Realtime-2, GPT-Realt

📰 Realtime Audio Models 2026: OpenAI Unveils GPT-Realtime-2, Translate & Whisper OpenAI has introduced three new realtime audio models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—designed to transform live voice applications with reasoning, translation, and l…

LINKS aihaberleri.org/…/realtime-audio-models-2…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 08:11

📰 OpenAI Redefines Conversational AI with Real-Time Voice Models in 2026: GPT-4o, Whisper ... OpenAI revolutionizes real-time voice capabilities

📰 OpenAI 2026'da Gerçek Zamanlı Ses Modelleriyle Konuşan AI'yi Yeniden Tanımlıyor: GPT-4o, Whisper ... OpenAI, gerçek zamanlı konuşma yeteneklerinde devrim yaratacak üç yeni ses modelini duyurdu: GPT-Realtime-2, GPT-Realtime-Translate ve GPT-Realtime-Whisper. Bu modeller, yapay z…

LINKS aihaberleri.org/…/openai-2026da-gercek-za…

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 07:33

📰 GPT-Realtime-2 Launches in 2026: 15.2% Faster Voice AI with Realtime Translation & Transcription OpenAI has unveiled GPT-Realtime-2, -Translate, and -Whisper

📰 GPT-Realtime-2 Launches in 2026: 15.2% Faster Voice AI with Realtime Translation & Transcription OpenAI has unveiled GPT-Realtime-2, -Translate, and -Whisper — a suite of next-generation realtime voice APIs that set new state-of-the-art benchmarks in speech understanding and tr…

LINKS aihaberleri.org/…/gpt-realtime-2-launches…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 07:32

📰 GPT-Realtime-2 Set a New Standard in Real-Time Voice APIs in 2026 OpenAI, with GPT-Realtime-2, -Translate, and -Whisper, in real-time voice processing

📰 GPT-Realtime-2 ile 2026'da Gerçek Zamanlı Ses API'leri Yeni Standartı Yarattı OpenAI, GPT-Realtime-2, -Translate ve -Whisper ile gerçek zamanlı ses işlemede devrim yarattı. Bu yeni API'ler, sesli diyalogların doğallığını ve hızını tamamen yeniden tanımlıyor.... # YapayZekaAraçl…

LINKS aihaberleri.org/…/gpt-realtime-2-ile-2026…

Mastodon — mastodon.social TIER_1 日本語(JA) · [email protected] · 2026-05-08 04:51

OpenAI's new voice model "GPT-Realtime-2" enables instant translation and low-latency transcription https://www.watch.impress.co.jp/docs/news/12107115.html #watch_impress #ChatGPT #Tech #AI

OpenAI、新音声モデル「GPT-Realtime-2」即時翻訳や低遅延文字起こしも https://www. watch.impress.co.jp/docs/news/ 2107115.html # watch_impress # ChatGPT # テック # AI

LINKS watch.impress.co.jp/…/2107115.html

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 01:46

📰 OpenAI Voice Models 2026: GPT-Realtime-2, Whisper & Translate Revolutionize Real-Time Transcription OpenAI has introduced groundbreaking voice models includin

📰 OpenAI Voice Models 2026: GPT-Realtime-2, Whisper & Translate Revolutionize Real-Time Transcription OpenAI has introduced groundbreaking voice models including GPT-Realtime-2, Translate, and Whisper, revolutionizing real-time speech processing. These models enhance transcriptio…

LINKS aihaberleri.org/…/openai-voice-models-202…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 01:46

📰 Voice Communication Revolution in 2026: Real-time Transcription with GPT-Realtime-2, Whisper, and Translate... OpenAI Creates a Milestone in Voice AI

📰 2026'da Sesli İletişim Devrimi: GPT-Realtime-2, Whisper ve Translate ile Gerçek Zamanlı Transkrip... OpenAI, sesli yapay zeka alanında bir dönüm noktası yarattı: GPT-Realtime-2, Translate ve Whisper ile gerçek zamanlı konuşma, çeviri ve transkripsiyon artık insan diline çok dah…

LINKS aihaberleri.org/…/2026da-sesli-iletisim-d…

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 01:45

📰 AlphaEvolve: How Google DeepMind’s Gemini AI Transforms Algorithm Design in 2026 AlphaEvolve, a Gemini-powered coding agent developed by Google DeepMind, is r

📰 AlphaEvolve: How Google DeepMind’s Gemini AI Transforms Algorithm Design in 2026 AlphaEvolve, a Gemini-powered coding agent developed by Google DeepMind, is reshaping how advanced algorithms are designed across scientific and engineering domains. By autonomously generating and …

LINKS aihaberleri.org/…/alphaevolve-how-google-…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 01:45

📰 AlphaEvolve Generates Code with Gemini AI: Automating Scientific Discoveries 2026 Google DeepMind, Automatically Designs Algorithms Using Gemini AI

📰 Gemini AI ile Kod Üreten AlphaEvolve: Bilimsel Keşifleri Otomatikleştiriyor 2026 Google DeepMind, Gemini AI’sini kullanarak algoritmaları otomatik tasarlayan AlphaEvolve’u tanıttı. Bu sistem, sadece kod üretmekle kalmıyor, bilimsel problemleri çözme kapasitesiyle disiplinler ar…

LINKS aihaberleri.org/…/gemini-ai-ile-kod-urete…

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 01:19

📰 2026’s Best Real-Time Speech Processing APIs: OpenAI Whisper, GPT-4o & More OpenAI has unveiled a next-generation voice API suite capable of real-time speech

📰 2026’s Best Real-Time Speech Processing APIs: OpenAI Whisper, GPT-4o & More OpenAI has unveiled a next-generation voice API suite capable of real-time speech processing, integrating advanced inference, translation, and transcription. This innovation aims to redefine human-AI in…

LINKS aihaberleri.org/…/2026s-best-real-time-sp…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 01:18

📰 OpenAI Voice APIs 2026: AI Revolution with Real-Time Voice Processing OpenAI announced three new APIs redefining the future of voice interactions: GPT-R

📰 OpenAI Ses API'leri 2026: Gerçek Zamanlı Ses İşleme ile AI Devrimi OpenAI, sesli interaksiyonların geleceğini yeniden tanımlayan üç yeni API'yi duyurdu: GPT-Realtime-2, GPT-Realtime-Translate ve GPT-Realtime-Whisper. Bu teknolojiler, yapay zekânın sesle iletişim kurma kapasites…

LINKS aihaberleri.org/…/openai-ses-apileri-2026…

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 00:12

📰 Audio Models in API 2026: Build Voice Apps 70% Faster with OpenAI’s New Tools OpenAI has introduced three new audio models in its API, empowering developers t

📰 Audio Models in API 2026: Build Voice Apps 70% Faster with OpenAI’s New Tools OpenAI has introduced three new audio models in its API, empowering developers to build advanced voice applications. This move aligns with broader industry efforts to standardize AI transparency and b…

LINKS aihaberleri.org/…/audio-models-in-api-202…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 00:12

📰 New Voice Models 2026: Revolution in Voice API and AI Voice Applications for Developers The future of voice-based applications for developers will be changed by three

📰 Yeni Ses Modelleri 2026: Geliştiriciler İçin Ses API ve AI Ses Uygulamaları Devrimi Geliştiriciler için ses tabanlı uygulamaların geleceğini değiştirecek üç yeni ses modeli duyuruldu. Bu adım, yapay zekanın sesle etkileşime geçme kapasitesini derinlemesine yeniden tanımlıyor...…

LINKS aihaberleri.org/…/yeni-ses-modelleri-2026…

Mastodon — mastodon.social TIER_1 English(EN) · aihaberleri · 2026-05-08 00:11

📰 How OpenAI Codex Got Me Banned from Reddit (2026) Codex usage in development is sparking debate after a builder was banned from Reddit for disclosing AI tool

📰 How OpenAI Codex Got Me Banned from Reddit (2026) Codex usage in development is sparking debate after a builder was banned from Reddit for disclosing AI tool use. The incident highlights growing tensions between AI efficiency and community transparency norms.... # AINews # AI #…

LINKS aihaberleri.org/…/how-openai-codex-got-me…

Mastodon — mastodon.social TIER_1 Türkçe(TR) · aihaberleri · 2026-05-08 00:11

📰 Was OpenAI Codex Usage Banned on Reddit in 2026? Critical Warning for Developers OpenAI's Codex tools are very popular among developers

📰 OpenAI Codex Kullanımı 2026'da Reddit'te Yasaklandı mı? Geliştiriciler İçin Kritik Uyarı OpenAI'nin Codex araçları geliştiriciler arasında büyük popülerlik kazanırken, Reddit'te kullanımı nedeniyle hesap yasaklamaları yaşanmaya başlandı. Bu durum sadece bir teknik sorun değil, …

LINKS aihaberleri.org/…/openai-codex-kullanimi-…

Mastodon — mastodon.social TIER_1 Svenska(SV) · redaktionen · 2026-05-07 23:14

OpenAI's New Voice Intelligence: A Revolution for Customer Service and Education https://redaktionen.net/artikel/985 # ai # svtech

OpenAI:s Nya Röstintelligens: En Revolution för Kundtjänst och Utbildning https:// redaktionen.net/artikel/985 # ai # svtech

LINKS redaktionen.net/…/985

Mastodon — mastodon.social TIER_1 English(EN) · sagalinked · 2026-05-07 22:46

📰 OpenAI has introduced new voice intelligence features to its API, which could be beneficial for customer service systems and have applications across various

📰 OpenAI has introduced new voice intelligence features to its API, which could be beneficial for customer service systems and have applications across various fields such as education and creator platforms. 🔗 https:// techcrunch.com/2026/05/07/open ai-launches-new-voice-intellig…

LINKS techcrunch.com/…/openai-launches-new-voic…

Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-05-06 02:52

Inworld AI has launched Realtime TTS-2, a closed-loop voice model that adapts to how users actually talk. Unlike traditional text-to-speech systems, TTS-2 hears

Inworld AI has launched Realtime TTS-2, a closed-loop voice model that adapts to how users actually talk. Unlike traditional text-to-speech systems, TTS-2 hears the full audio context of each conversation - not just transcripts - allowing it to detect tone, pacing and emotion. ht…

LINKS marktechpost.com/…/inworld-ai-launches-re…

Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-05-05 08:22

OpenAI serves 900M weekly users with voice AI, but traditional pipelines caused unacceptable latency. This deep dive reveals their ingenious solution: an "audio

OpenAI serves 900M weekly users with voice AI, but traditional pipelines caused unacceptable latency. This deep dive reveals their ingenious solution: an "audio-native" architecture built on a re-engineered WebRTC stack. They tackled "one-port-per-session" and stateful protocol i…

LINKS thepixelspulse.com/…/openai-low-latency-v…

Mastodon — mastodon.social TIER_1 Svenska(SV) · redaktionen · 2026-05-04 21:03

OpenAI's New Voice AI: Faster Than Ever with Low Latency

OpenAI:s Nya Röst-AI: Snabbare Än Någonsin med Låg Latens https:// redaktionen.net/artikel/881 # ai # svtech

LINKS redaktionen.net/…/881

COVERAGE [45]

RELATED ENTITIES

RELATED TOPICS