Researchers have developed FormalASR, a novel end-to-end system designed to convert spoken Chinese directly into formal written text. This approach bypasses the need for a separate post-editing step by an LLM, reducing latency and computational costs. The system utilizes two models, 0.6B and 1.7B parameters, fine-tuned from Qwen3-ASR, and is trained on newly created large-scale datasets, WenetSpeech-Formal and Speechio-Formal. AI
IMPACT Offers a more efficient and direct method for transcribing spoken language into formal text, potentially improving downstream NLP applications.
RANK_REASON This is a research paper describing a new model and dataset. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →