PulseAugur
EN
LIVE 11:24:04
中文(ZH) 百度文心发布 PaddleOCR-VL-1.6:准确率突破 96.33%,刷新文档解析 SOTA

Baidu's PaddleOCR-VL-1.6 sets new SOTA in document parsing

Baidu's Wenxin has released PaddleOCR-VL-1.6, a new version of its open-source OCR tool. This update achieves over 96.33% accuracy on the OmniDocBench v1.6 benchmark, surpassing major models like Gemini-3-Pro and GPT-5.2. The model demonstrates significant improvements in understanding complex documents, including scanned papers, bent documents, and screen captures, making it a leading solution for document digitization. AI

IMPACT Sets new SOTA on document parsing benchmarks, potentially accelerating enterprise adoption of advanced OCR solutions.

RANK_REASON New version of a specialized OCR model released by a major tech company, achieving state-of-the-art results on industry benchmarks. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 量子位的朋友们 ·

    Baidu Wenxin Releases PaddleOCR-VL-1.6: Accuracy Exceeds 96.33%, Refreshing Document Parsing SOTA

    已上线 PaddleOCR 官网,支持网页端和API调用