PulseAugur
实时 23:13:02
English(EN) Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training

Transcoda系统实现零样本光学音乐识别

研究人员开发了Transcoda,一种用于光学音乐识别(OMR)的新型系统,可以将乐谱转录为文本格式。该系统通过采用先进的合成数据生成管道和基于语法的解码方法,解决了带注释数据集稀缺的问题。Transcoda拥有紧凑的5900万参数模型,实现了最先进的性能,超越了更大的模型,并显著降低了历史音乐扫描的错误率。 AI

影响 推进OMR能力,可能为音乐分析和数字化提供新工具。

排序理由 发表了一篇详细介绍新系统及其在基准测试上性能的学术论文。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Transcoda系统实现零样本光学音乐识别

报道来源 [2]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training

    Optical Music Recognition (OMR), the task of transcribing sheet music into a structured textual representation, is currently bottlenecked by a lack of large-scale, annotated datasets of real scans. This forces models to rely on either few-shot transfer or synthetic training pipel…

  2. arXiv cs.CV TIER_1 English(EN) · Paul Swoboda ·

    Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training

    Optical Music Recognition (OMR), the task of transcribing sheet music into a structured textual representation, is currently bottlenecked by a lack of large-scale, annotated datasets of real scans. This forces models to rely on either few-shot transfer or synthetic training pipel…