PulseAugur
实时 11:39:20

New PureDocBench benchmark reveals document parsing is far from solved

Researchers have introduced PureDocBench, a new benchmark for document parsing that addresses issues with the existing OmniDocBench dataset, which suffers from annotation errors and potential contamination. PureDocBench is programmatically generated and source-traceable, offering a more reliable evaluation across clean, digitally degraded, and real-world document settings. Initial evaluations on 40 models reveal that document parsing is far from solved, with significant performance gaps between models and a shared bottleneck in formula recognition. AI

影响 PureDocBench provides a more reliable evaluation for document parsing models, highlighting current limitations and guiding future research.

排序理由 The cluster describes a new benchmark for evaluating document parsing models, along with findings from its initial application. [lever_c_demoted from research: ic=1 ai=1.0]

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

New PureDocBench benchmark reveals document parsing is far from solved

报道来源 [1]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    How Far Is Document Parsing from Solved? PureDocBench: A Source-TraceableBenchmark across Clean, Degraded, and Real-World Settings

    The past year has seen over 20 open-source document parsing models, yet thefield still benchmarks almost exclusively on OmniDocBench, a 1,355-pagemanually annotated dataset whose top scores have saturated above 90%. Athree-stage audit pipeline we run on OmniDocBench screens its 2…