Tesseract
PulseAugur coverage of Tesseract — every cluster mentioning Tesseract across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
University seeks on-premise document parsing tools for data governance
A university IT department is seeking an on-premise document processing solution to index and search administrative PDFs, class schedules, and meeting notes. Due to data governance policies, cloud-based APIs are not an …
-
Mamba models offer faster OCR but lag Transformer accuracy on historical texts
Researchers have benchmarked State-Space Models (SSMs), specifically Mamba, against Transformers and BiLSTMs for Optical Character Recognition (OCR) on historical newspapers. The studies indicate that while Mamba-based …
-
Baidu's PP-OCRv6 achieves 97ms inference, leads global OCR benchmarks
Baidu's Wenxin officially released the new OCR model PP-OCRv6, offering Tiny, Small, and Medium versions that support over 50 languages and are deployable across various scenarios from browsers to servers. The Tiny mode…
-
AI agent built to safely summarize patient discharge data
This article details the creation of an AI agent designed to summarize patient discharge information from PDF documents. The agent focuses on extracting structured data like diagnoses, medications, and allergies, priori…
-
Baidu's PaddleOCR-VL-1.6 sets new SOTA in document parsing
Baidu's Wenxin has released PaddleOCR-VL-1.6, a new version of its open-source OCR tool. This update achieves over 96.33% accuracy on the OmniDocBench v1.6 benchmark, surpassing major models like Gemini-3-Pro and GPT-5.…
-
Local Document AI Needs OCR, RAG, and Local Inference
Building a fully local document AI system requires more than just running a language model on a local machine. It necessitates a complete pipeline that includes Optical Character Recognition (OCR) for document parsing, …
-
New OCR pipeline enhances retail bill digitization with adaptive enhancement
Researchers have developed and benchmarked an adaptive Optical Character Recognition (OCR) pipeline specifically designed for digitizing diverse retail bills. This system incorporates a CNN-based enhancement module, an …
-
New OCR pipeline enhances retail bill digitization with adaptive enhancement
Researchers have developed and benchmarked an adaptive Optical Character Recognition (OCR) pipeline designed for digitizing retail bills across various commercial sectors. The system incorporates a CNN-based image enhan…