PulseAugur
实时 19:00:06
English(EN) DTBench: A Synthetic Benchmark for Document-to-Table Extraction

新基准和模型推动文档解析和表格提取发展

研究人员推出了新的基准和改进的模型,用于文档解析和表格提取。Dr. DocBench 专注于专家级别的文档解析,包括化学式和音乐符号等复杂结构,突出了当前模型的局限性。DTBench 提供了一个用于文档到表格提取的合成基准,评估 LLM 的推理和冲突解决能力。此外,PaddleOCR-VL-1.6 通过区域感知优化和渐进式后训练得到了增强,在 OmniDocBench v1.6 上取得了最先进的结果。 AI

影响 文档和表格提取基准和模型的进步将提高 AI 处理和分析复杂文档和数据的能力。

排序理由 多篇研究论文介绍了用于文档解析和表格提取的新基准和模型改进。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 6 个来源。 我们如何撰写摘要 →

报道来源 [6]

  1. arXiv cs.AI TIER_1 English(EN) · Minglai Yang, Xinyan Velocity Yu, Pengyuan Li, Xinyu Guo, Zhenting Qi, Konwoo Kim, Longtian Ye, Xiaolong Luo, Jinhe Bi, Henry Zhang, Haris Riaz, Xuan Zhang, Yunze Xiao, Bangya Liu, Tom Tang, Yunfei Zhao, Qunshu Lin, Zihan Wang, Minghao Liu, Michael Lingz… ·

    Dr. DocBench:专家级和困难文档解析的综合基准测试

    arXiv:2606.01393v1 Announce Type: cross Abstract: Document parsing and recognition are fundamental capabilities for vision-language models (VLMs) and document processing systems. However, existing Optical Character Recognition (OCR) and document parsing benchmarks are increasingl…

  2. arXiv cs.AI TIER_1 English(EN) · Pius Horn, Janis Keuper ·

    超越字符串匹配:PDF表格提取的语义评估

    arXiv:2603.18652v2 Announce Type: replace-cross Abstract: Reliably extracting tables from PDFs is essential for large-scale scientific data mining and knowledge base construction, yet existing evaluation approaches rely on rule-based metrics that fail to capture semantic equivale…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    PaddleOCR-VL-1.6:通过欠优化区域精炼和渐进式后训练拓展文档解析前沿

    PaddleOCR-VL-1.6 enhances document parsing performance through targeted data optimization and progressive post-training techniques, achieving state-of-the-art results on OmniDocBench v1.6.

  4. arXiv cs.AI TIER_1 English(EN) · Yuxiang Guo, Zhuoran Du, Nan Tang, Kezheng Tang, Congcong Ge, Yunjun Gao ·

    DTBench:文档到表格提取的合成基准测试

    arXiv:2602.13812v3 Announce Type: replace-cross Abstract: Document-to-table (Doc2Table) extraction derives structured tables from unstructured documents under a target schema, enabling reliable and verifiable SQL-based data analytics. Although large language models (LLMs) have sh…

  5. arXiv cs.CV TIER_1 English(EN) · Zelun Zhang, Hongen Liu, Suyin Liang, Yubo Zhang, Yiqing Xiang, Jiaxuan Liu, Ting Sun, Manhui Lin, Yue Zhang, Changda Zhou, Tingquan Gao, Cheng Cui, Yi Liu, Dianhai Yu, Yanjun Ma ·

    PaddleOCR-VL-1.6:通过欠优化区域精炼和渐进式后训练拓展文档解析前沿

    arXiv:2606.03264v1 Announce Type: new Abstract: We introduce PaddleOCR-VL-1.6, an upgraded compact document parsing model built upon PaddleOCR-VL-1.5. Although PaddleOCR-VL-1.5 establishes a strong 0.9B baseline, its remaining errors concentrate in under-optimized regions where m…

  6. arXiv cs.CV TIER_1 English(EN) · Brandon Smock, Valerie Faucon-Morin, Max Sokolov, Libin Liang, Tayyibah Khanam, Amrit Ramesh, Maury Courtland ·

    PubTables-v2:用于整页和多页表格提取的新型大规模数据集

    arXiv:2512.10888v3 Announce Type: replace Abstract: Table extraction (TE) is a key challenge in document understanding. Traditional approaches detect tables first, then recognize their structure. Recently, interest has surged in developing methods, such as vision-language models …