PulseAugur
EN
LIVE 11:02:11
ENTITY OmniDocBench

OmniDocBench

PulseAugur coverage of OmniDocBench — every cluster mentioning OmniDocBench across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
7
7 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
4
4 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL
  1. SIGNIFICANT · CL_114231 ·

    Baidu releases Unlimited OCR, challenging long-context AI memory mechanisms · 1 source tracked

    Baidu has open-sourced a new OCR model called Unlimited OCR, which excels at processing long documents by mimicking human reading habits. Unlike traditional OCR systems that process documents page by page and then stitc…

  2. TOOL · CL_108999 ·

    Open-source OCR models and benchmarks consolidated on Papers with Code

    A new resource has been created to track open-source optical character recognition (OCR) models, consolidating information on top-performing models, benchmarks, and links to their papers and code. This initiative highli…

  3. FRONTIER RELEASE · CL_103597 ·

    Baidu releases Unlimited OCR with constant KV cache for long documents

    Baidu has released Unlimited OCR, a 3-billion-parameter Mixture-of-Experts model designed for efficient long-document parsing. The model utilizes Reference Sliding Window Attention (R-SWA) to maintain a constant KV cach…

  4. RESEARCH · CL_56512 ·

    ABot-OCR model transcribes pages directly to Markdown

    Researchers have introduced ABot-OCR, a novel end-to-end vision-language model designed for direct transcription of page images into Markdown. This approach bypasses the need for complex modular systems by processing th…

  5. RESEARCH · CL_40912 ·

    New method enhances VLM document layout understanding

    Researchers have developed a new method to improve how Vision-Language Models (VLMs) understand document layouts, particularly for documents with structures not seen during training. The approach pre-resolves layout inf…

  6. TOOL · CL_26975 ·

    New PureDocBench benchmark reveals document parsing is far from solved

    Researchers have introduced PureDocBench, a new benchmark for document parsing that addresses issues with the existing OmniDocBench dataset, which suffers from annotation errors and potential contamination. PureDocBench…

  7. RESEARCH · CL_14088 ·

    RTPrune boosts DeepSeek-OCR inference speed by 1.23x with novel token pruning

    Researchers have developed RTPrune, a novel two-stage token pruning method designed to enhance the efficiency of DeepSeek-OCR inference. This method mimics the model's two-stage reading process, first prioritizing high-…