Tesseract

ENTITY Tesseract

Tesseract

PulseAugur coverage of Tesseract — every cluster mentioning Tesseract across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

8

8 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

4

4 over 90d

TIER MIX · 90D

research 4
tool 3
commentary 1

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

5 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL

TOOL · CL_105874 · Jun 23 · 13:33

University seeks on-premise document parsing tools for data governance

A university IT department is seeking an on-premise document processing solution to index and search administrative PDFs, class schedules, and meeting notes. Due to data governance policies, cloud-based APIs are not an …
RESEARCH · CL_105258 · Jun 22 · 16:07

Mamba models offer faster OCR but lag Transformer accuracy on historical texts

Researchers have benchmarked State-Space Models (SSMs), specifically Mamba, against Transformers and BiLSTMs for Optical Character Recognition (OCR) on historical newspapers. The studies indicate that while Mamba-based …
SIGNIFICANT · CL_91830 · Jun 15 · 08:51

Baidu's PP-OCRv6 achieves 97ms inference, leads global OCR benchmarks

Baidu's Wenxin officially released the new OCR model PP-OCRv6, offering Tiny, Small, and Medium versions that support over 50 languages and are deployable across various scenarios from browsers to servers. The Tiny mode…
TOOL · CL_74626 · Jun 6 · 08:01

AI agent built to safely summarize patient discharge data

This article details the creation of an AI agent designed to summarize patient discharge information from PDF documents. The agent focuses on extracting structured data like diagnoses, medications, and allergies, priori…
SIGNIFICANT · CL_66398 · Jun 2 · 07:47

Baidu's PaddleOCR-VL-1.6 sets new SOTA in document parsing

Baidu's Wenxin has released PaddleOCR-VL-1.6, a new version of its open-source OCR tool. This update achieves over 96.33% accuracy on the OmniDocBench v1.6 benchmark, surpassing major models like Gemini-3-Pro and GPT-5.…
COMMENTARY · CL_26679 · May 11 · 13:38

Local Document AI Needs OCR, RAG, and Local Inference

Building a fully local document AI system requires more than just running a language model on a local machine. It necessitates a complete pipeline that includes Optical Character Recognition (OCR) for document parsing, …
RESEARCH · CL_13537 · Apr 28 · 03:31

New OCR pipeline enhances retail bill digitization with adaptive enhancement

Researchers have developed and benchmarked an adaptive Optical Character Recognition (OCR) pipeline specifically designed for digitizing diverse retail bills. This system incorporates a CNN-based enhancement module, an …
RESEARCH · CL_08224 · Apr 28 · 03:31

New OCR pipeline enhances retail bill digitization with adaptive enhancement

Researchers have developed and benchmarked an adaptive Optical Character Recognition (OCR) pipeline designed for digitizing retail bills across various commercial sectors. The system incorporates a CNN-based image enhan…