Baidu's Wenxin has released PaddleOCR-VL-1.6, a new version of its open-source OCR tool. This update achieves over 96.33% accuracy on the OmniDocBench v1.6 benchmark, surpassing major models like Gemini-3-Pro and GPT-5.2. The model demonstrates significant improvements in understanding complex documents, including scanned papers, bent documents, and screen captures, making it a leading solution for document digitization. AI
IMPACT Sets new SOTA on document parsing benchmarks, potentially accelerating enterprise adoption of advanced OCR solutions.
RANK_REASON New version of a specialized OCR model released by a major tech company, achieving state-of-the-art results on industry benchmarks. [lever_c_demoted from frontier_release: ic=1 ai=1.0]
- Baidu
- Gemini-3-Pro
- GLM-OCR
- GPT-5.2
- MinerU-2.5-Pro
- OmniDocBench v1.6
- PaddleOCR-VL-1.5
- PaddleOCR-VL-1.6
- Real5-OmniDocBench
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →