Researchers have developed Orli, an end-to-end model for text line detection and ordering in historical documents. Orli treats the entire process as a single image-to-sequence problem, directly generating text lines in reading order from a page image. Trained on a large, diverse corpus, Orli achieves state-of-the-art performance on line detection and reading order benchmarks, even without dataset-specific training, and can adapt to specialized layouts with fine-tuning. The model's code and weights are publicly available. AI
影响 This model could improve automated analysis of historical documents by handling complex layouts and marginalia more effectively.
排序理由 The cluster contains a new academic paper detailing a novel model and its performance. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →