PulseAugur
EN
LIVE 04:07:32

Omi Health releases open-weight medical ASR model

Omi Health founder has released Omi Med STT v1, a fine-tuned version of NVIDIA's Parakeet TDT 0.6B model for medical Automatic Speech Recognition (ASR). This open-weight model is designed to run locally on devices, ensuring patient audio privacy. Benchmarked against other models, Omi Med STT v1 demonstrates competitive performance in medical word error rate (M-WER) while being significantly smaller and faster than larger models. AI

IMPACT Enables local, private transcription of medical audio, potentially improving patient data security and workflow efficiency for smaller clinics.

RANK_REASON This is a fine-tuned open-source model release, not a frontier model release from a major lab. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Omi Health releases open-weight medical ASR model

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/MajesticAd2862 ·

    I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1u0q5h9/i_finetuned_parakeet_06b_for_medical_asr_open/"> <img alt="I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU" src="https://preview.redd.it/qpcsb19ll36h1.png?width=140&amp;he…