Deepgram
PulseAugur coverage of Deepgram — every cluster mentioning Deepgram across labs, papers, and developer communities, ranked by signal.
4 day(s) with sentiment data
-
AssemblyAI enhances medical transcription accuracy with new 'Medical Mode'
AssemblyAI has introduced a new "Medical Mode" for its Universal-3 Pro and Universal-3.5 Pro Realtime speech-to-text models. This feature, activated by a single configuration parameter, aims to reduce missed medical ent…
-
AssemblyAI claims medical transcription accuracy edge over Deepgram
AssemblyAI has released a new blog post comparing its medical transcription capabilities against Deepgram's. The post highlights AssemblyAI's Universal-3 Pro model with Medical Mode, claiming superior accuracy on comple…
-
Top 5 Speechmatics Alternatives for Advanced Voice AI in 2026
This guide compares five alternatives to Speechmatics for speech-to-text services, highlighting AssemblyAI, Deepgram, Google Cloud Speech-to-Text, OpenAI Whisper, and AWS Transcribe. The market for speech-based Natural …
-
AssemblyAI Compares Top 5 Deepgram Speech-to-Text API Alternatives
This article compares five alternatives to Deepgram's speech-to-text API, including AssemblyAI, Google Cloud Speech-to-Text, AWS Transcribe, and OpenAI Whisper. The comparison focuses on key factors such as accuracy, pr…
-
AssemblyAI claims Universal-3 Pro beats Deepgram Nova-3 on critical speech-to-text accuracy
AssemblyAI has published a comparison of its Universal-3 Pro model against Deepgram's Nova-3 for speech-to-text services. The comparison emphasizes "missed-entity rate" over traditional Word Error Rate (WER), arguing th…
-
AssemblyAI's STT model favored by AI coding assistants for voice agents
AssemblyAI is highlighting its Universal-3 Pro Streaming model as a key component for building effective AI voice agents. The company's blog posts demonstrate how developers can use "vibe coding" with tools like ChatGPT…
-
AssemblyAI: Hidden costs of speech-to-text outweigh base rates
AssemblyAI argues that the advertised per-hour cost of speech-to-text APIs is misleading, as hidden expenses like human correction labor and downstream failures can multiply the actual cost. The company emphasizes that …
-
Voice AI latency benchmark: End-to-end models beat cascades
A recent benchmark of five voice AI stacks revealed that only two consistently responded under the critical 300ms latency threshold. The author found that voice-to-voice end-to-end models, which collapse STT, LLM, and T…
-
Developer builds privacy-first AI app using local audio capture
The developer built a privacy-focused AI application called Plan AI that avoids intrusive meeting bots by capturing system audio locally. This application uses Electron for the desktop interface and a distributed pipeli…
-
Together AI launches Voice Finder for 600+ TTS voices
Together AI has launched Voice Finder, a new tool designed to help developers quickly select the most suitable voice for their applications from a catalog of over 600 options. The tool allows users to search for voices …
-
Curated learning path guides developers in building real-time voice AI agents
A new GitHub repository, "Voice-AI-for-Beginners," offers a structured learning path for developers to build real-time voice AI agents. The guide covers the entire process from initial speech-to-text calls to scaling pr…
-
Speech models fail on street names, especially for non-native speakers
Researchers at Together AI have found that current state-of-the-art speech recognition models exhibit a significant failure rate, averaging 39% error in transcribing street names, particularly for non-native English spe…
-
Rowboat launches open-source AI coworker that builds knowledge graphs
Rowboat, an open-source AI coworker, has been released, allowing users to create a personal knowledge graph from their work data. This tool connects to email and meeting notes to build a persistent, local knowledge base…
-
New benchmarks and platforms advance voice agent evaluation and development
New research introduces EVA-Bench, a comprehensive framework for evaluating voice agents, addressing challenges in simulating realistic conversations and measuring performance across various failure modes. Simultaneousl…
-
April launches voice AI assistant for email and calendar management
April, a new voice-controlled AI assistant, has launched on the App Store to manage emails and calendars. The application allows users to dictate replies, summarize messages, and reschedule meetings hands-free. It utili…
-
Together AI integrates Deepgram voice models, launches fast Whisper STT
Together AI has launched new speech-to-text (STT) and text-to-speech (TTS) capabilities, integrating Deepgram's advanced voice models and its own high-performance Whisper V3 API. This move aims to streamline the develop…