PulseAugur
LIVE 00:09:41
ENTITY optical character recognition

optical character recognition

PulseAugur coverage of optical character recognition — every cluster mentioning optical character recognition across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 11 TOTAL
  1. TOOL · CL_20775 ·

    Consensus Entropy improves VLM OCR accuracy by measuring inter-model agreement

    Researchers have developed a new metric called Consensus Entropy (CE) to assess the reliability of Optical Character Recognition (OCR) outputs from Vision-Language Models (VLMs). CE measures the agreement between multip…

  2. RESEARCH · CL_18242 ·

    New CC-OCR V2 benchmark reveals LMMs fall short in real-world document processing

    A new benchmark, CC-OCR V2, has been released to evaluate Large Multimodal Models (LMMs) on real-world document processing tasks. The benchmark includes 7,093 challenging samples across five OCR-centric tracks, addressi…

  3. TOOL · CL_15586 ·

    New OCR benchmark reveals accuracy doesn't guarantee RAG performance

    A new benchmark has been developed to evaluate the robustness of Optical Character Recognition (OCR) systems specifically for Retrieval-Augmented Generation (RAG) applications. Current OCR benchmarks using character-lev…

  4. TOOL · CL_15804 ·

    AI classifies historical document pages for tailored content processing

    Researchers have developed an AI-powered image classification system to automatically categorize pages from historical documents. This system aims to streamline the processing of digitized archives by identifying differ…

  5. TOOL · CL_10878 ·

    Sun Finance boosts ID verification accuracy with generative AI on AWS

    Sun Finance, a Latvian fintech company, has successfully automated its identity document extraction and fraud detection processes using generative AI on Amazon Web Services (AWS). The new system, developed in partnershi…

  6. RESEARCH · CL_08205 ·

    Researchers release dataset of AI-generated images from GPT-Image-2's first week

    Researchers have released a dataset of over 10,000 images generated by OpenAI's GPT-Image-2, collected in the first week following its April 21, 2026 release. The dataset, sourced from Twitter/X, was curated using a mul…

  7. RESEARCH · CL_06544 ·

    iWatchRoad system uses YOLO to detect and map potholes for smart cities

    Researchers have developed iWatchRoad, an end-to-end system designed for the scalable detection and geospatial visualization of potholes. The system utilizes a fine-tuned YOLO model for real-time pothole identification …

  8. RESEARCH · CL_06492 ·

    New dataset and methods tackle low-light scene text recognition challenges

    Researchers have introduced LSTR, a large-scale dataset for low-light scene text recognition, and ESTR, a smaller evaluation set of real nighttime street scenes. They explored two approaches: fine-tuning existing OCR mo…

  9. RESEARCH · CL_06398 ·

    HalalBench benchmark tackles OCR challenges for multilingual food packaging ingredient extraction

    Researchers have introduced HalalBench, a new multilingual benchmark designed to evaluate Optical Character Recognition (OCR) performance specifically on food packaging ingredient labels. The benchmark addresses the uni…

  10. RESEARCH · CL_03553 ·

    Older, cheaper LLMs often match premium OCR accuracy at lower cost

    Researchers have open-sourced a new benchmark and framework for evaluating Optical Character Recognition (OCR) performance across 18 different large language models (LLMs). Their analysis, involving over 7,500 calls, re…

  11. TOOL · CL_17567 ·

    Lume enables macOS VMs for AI agents and CI/CD on Apple Silicon

    Lume is an open-source command-line tool that enables the creation and management of macOS and Linux virtual machines on Apple Silicon hardware. It leverages Apple's Virtualization Framework for near-native performance …