PulseAugur
EN
LIVE 16:05:58
中文(ZH) 97毫秒!百度 PP-OCRv6 把 OCR 带进毫秒时代

Baidu's PP-OCRv6 achieves 97ms inference, leads global OCR benchmarks

Baidu's Wenxin officially released the new OCR model PP-OCRv6, offering Tiny, Small, and Medium versions that support over 50 languages and are deployable across various scenarios from browsers to servers. The Tiny model, weighing just 1.5MB, can perform OCR in as little as 97 milliseconds directly within a browser, enhancing privacy and reducing deployment barriers. PP-OCRv6 has set new benchmarks in OCR performance, outperforming major multimodal models in specialized OCR tasks and solidifying PaddleOCR's position as a leading open-source OCR project. AI

IMPACT Sets new SOTA for browser-based OCR, potentially accelerating AI agent capabilities in edge and privacy-sensitive applications.

RANK_REASON New OCR model release with performance benchmarks and deployment details. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 雷峰网 (Leiphone) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Baidu's PP-OCRv6 achieves 97ms inference, leads global OCR benchmarks

COVERAGE [1]

  1. 雷峰网 (Leiphone) TIER_1 中文(ZH) ·

    97 milliseconds! Baidu PP-OCRv6 brings OCR into the millisecond era

    <p>近日,百度文心正式发布新一代OCR模型PP-OCRv6,一次性推出Tiny、Small、Medium三档模型,支持&nbsp;50&nbsp;多种语言,覆盖浏览器端、嵌入式设备到服务器等主流场景。公开结果显示,PP-OCRv6再次刷新OCR领域评测纪录,综合性能位居全球第一。</p><p>其中,PP-OCRv6 Tiny的尺寸仅1.5MB,可直接部署于本地浏览器环境,单图预测最快仅需&nbsp;97&nbsp;毫秒。用户数据无需上传云端即可完成OCR处理,在保障隐私安全的同时,大幅降低部署门槛。有开发者评价,PP-OCRv6可能是全球唯一可在浏览器…