Brief · PulseAugur

COMMENTARY · r/LocalLLaMA Deutsch(DE) · 2h

Whisper.cpp is underwhelming

A user on the r/LocalLLaMA subreddit expressed disappointment with the performance of whisper.cpp, a local speech-to-text model. Despite using the ggml-large-v3 model, the user experienced persistent hallucinations and repetitive sentences after approximately 20 minutes of transcription. They are seeking advice on potential solutions or workarounds, such as segmenting audio files. AI

IMPACT User reports indicate potential issues with local speech-to-text models, suggesting a need for further development or optimization.

r/LocalLLaMA
whisper.cpp
ggml-large-v3