PulseAugur
EN
LIVE 22:58:28

Whisper.cpp users report hallucinations and repetition issues

A user on the r/LocalLLaMA subreddit expressed disappointment with the performance of whisper.cpp, a local speech-to-text model. Despite using the ggml-large-v3 model, the user experienced persistent hallucinations and repetitive sentences after approximately 20 minutes of transcription. They are seeking advice on potential solutions or workarounds, such as segmenting audio files. AI

IMPACT User reports indicate potential issues with local speech-to-text models, suggesting a need for further development or optimization.

RANK_REASON User-generated feedback on a specific software's performance.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 Deutsch(DE) · /u/Larkonath ·

    Whisper.cpp is underwhelming

    <!-- SC_OFF --><div class="md"><p>Hi, I'm running whisper.cpp with the best model I could find (ggml-large-v3) but after about 20 min of transcription it hallucinates a sentence that it will repeat endlessly until the end.</p> <p>Is there something I'm missing or should I cut my …