ENTITY OmniDocBench

OmniDocBench

PulseAugur coverage of OmniDocBench — every cluster mentioning OmniDocBench across labs, papers, and developer communities, ranked by signal.

Total · 30d

7

7 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

4

4 over 90d

TIER MIX · 90D

frontier release 1
research 4
tool 2

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL

SIGNIFICANT · CL_114231 · Jun 28 · 06:04

Baidu releases Unlimited OCR, challenging long-context AI memory mechanisms · 1 source tracked

Baidu has open-sourced a new OCR model called Unlimited OCR, which excels at processing long documents by mimicking human reading habits. Unlike traditional OCR systems that process documents page by page and then stitc…
TOOL · CL_108999 · Jun 24 · 16:26

Open-source OCR models and benchmarks consolidated on Papers with Code

A new resource has been created to track open-source optical character recognition (OCR) models, consolidating information on top-performing models, benchmarks, and links to their papers and code. This initiative highli…
FRONTIER RELEASE · CL_103597 · Jun 19 · 09:40

Baidu releases Unlimited OCR with constant KV cache for long documents

Baidu has released Unlimited OCR, a 3-billion-parameter Mixture-of-Experts model designed for efficient long-document parsing. The model utilizes Reference Sliding Window Attention (R-SWA) to maintain a constant KV cach…
RESEARCH · CL_56512 · May 27 · 05:16

ABot-OCR model transcribes pages directly to Markdown

Researchers have introduced ABot-OCR, a novel end-to-end vision-language model designed for direct transcription of page images into Markdown. This approach bypasses the need for complex modular systems by processing th…
RESEARCH · CL_40912 · May 19 · 13:58

New method enhances VLM document layout understanding

Researchers have developed a new method to improve how Vision-Language Models (VLMs) understand document layouts, particularly for documents with structures not seen during training. The approach pre-resolves layout inf…
TOOL · CL_26975 · May 8 · 09:30

New PureDocBench benchmark reveals document parsing is far from solved

Researchers have introduced PureDocBench, a new benchmark for document parsing that addresses issues with the existing OmniDocBench dataset, which suffers from annotation errors and potential contamination. PureDocBench…
RESEARCH · CL_14088 · May 1 · 04:30

RTPrune boosts DeepSeek-OCR inference speed by 1.23x with novel token pruning

Researchers have developed RTPrune, a novel two-stage token pruning method designed to enhance the efficiency of DeepSeek-OCR inference. This method mimics the model's two-stage reading process, first prioritizing high-…