Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · r/LocalLLaMA English(EN) · 2d

Tried to benchmark Google’s new on-device dictation models (Eloquent) and basically couldn’t

A user attempting to benchmark Google's new on-device dictation app, Eloquent, found it to be largely unusable due to frequent failures to transcribe audio. While the app's accuracy was competitive when it did function, it often returned incomplete or no transcriptions, leading the user to suspect it behaves like a chat-style AI that sometimes responds about the audio rather than transcribing it. The user also tested Google's open model Gemma 3n, which exhibited similar issues, suggesting Eloquent might be hiding these problems. AI

IMPACT The unreliability of Google's Eloquent dictation app highlights challenges in deploying on-device AI for speech recognition, potentially slowing adoption.
COMMENTARY · r/MachineLearning English(EN) · 3d

What will be the next breakthrough in ASR? [D]

The field of Automatic Speech Recognition (ASR) is seeing rapid advancements driven by two primary factors: the increasing availability of pseudo-labeled data and the emergence of new model architectures. While models like Whisper-large-v3 and Nvidia Parakeet v3 demonstrate the power of large-scale supervised training, the discussion questions whether self-supervised learning approaches will be phased out for ASR tasks. This contrasts with computer vision, where self-supervised methods like Dinov3 are highly performant, prompting speculation about a similar breakthrough in speech processing. AI

IMPACT Discussion explores the potential shift from self-supervised to supervised learning in ASR, impacting future model development and research focus.

Brief

Tried to benchmark Google’s new on-device dictation models (Eloquent) and basically couldn’t

What will be the next breakthrough in ASR? [D]