PulseAugur
EN
LIVE 09:46:43

New dataset targets text detection in historical astronomical diagrams

Researchers have introduced a new dataset for text detection in historical astronomical diagrams, addressing a gap in document analysis. The dataset, comprising 948 diagrams from the 8th to 18th centuries, features over 10,000 annotated text regions with precise polygonal delineations and reading direction encoding. Several baseline models were evaluated, with Poly-DETR, an extension of DINO-DETR, showing strong performance on existing benchmarks and serving as a solid baseline for this new dataset. The dataset and code are publicly available. AI

RANK_REASON The cluster describes a new academic paper introducing a dataset and evaluating baseline models for a specific computer vision task. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Zeynep Sonat Baltac{\i}, Rapha\"el Baena, Fei Meng, Somk\'eo Norindr, Florence Somer, Matthieu Husson, Mathieu Aubry ·

    Text region detection in historical astronomical diagrams

    arXiv:2606.15886v1 Announce Type: new Abstract: Text detection is a crucial task in the analysis of historical documents. While datasets and benchmarks exist for text detection in manuscripts and maps, the study of text in mathematical diagrams has received little attention. To a…