PulseAugur
EN
LIVE 10:04:14

Orli model streamlines text line detection and ordering

Researchers have developed Orli, an end-to-end model for text line detection and ordering in historical documents. Orli treats the entire process as a single image-to-sequence problem, directly generating text lines in reading order from a page image. Trained on a large, diverse corpus, Orli achieves state-of-the-art performance on line detection and reading order benchmarks, even without dataset-specific training, and can adapt to specialized layouts with fine-tuning. The model's code and weights are publicly available. AI

IMPACT This model could improve automated analysis of historical documents by handling complex layouts and marginalia more effectively.

RANK_REASON The cluster contains a new academic paper detailing a novel model and its performance. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Benjamin Kiessling (ALMAnaCH) ·

    End-to-End Text Line Detection and Ordering

    arXiv:2606.04166v1 Announce Type: new Abstract: Practical text-recognition pipelines for historical documents typically decompose layout analysis into line detection followed by a separate reading-order step, with the latter most often handled by a hand-coded geometric heuristic …