PulseAugur / Brief
EN
LIVE 12:18:28

Brief

last 24h
[1/1] 223 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Whisfusion: Parallel ASR Decoding with Masked Diffusion

    Researchers have developed Whisfusion, a novel non-autoregressive system for automatic speech recognition (ASR) that utilizes masked diffusion models. This approach aims to match the accuracy of traditional autoregressive models while significantly improving inference speed. Whisfusion achieves this by training a diffusion decoder on top of frozen Whisper-large-v3 audio embeddings, enabling parallel decoding and outperforming existing models in both speed and accuracy across multiple languages. AI

    IMPACT Establishes masked diffusion as a viable, high-throughput alternative for multilingual ASR, potentially accelerating real-time transcription applications.