Urdu Katib Handwritten Dataset: A Historical Document Dataset for Offline Urdu Handwritten Text Recognition with CRNN-Based Baseline Evaluation
Researchers have introduced the Urdu Katib Handwritten Dataset (UKHD), the first offline dataset of historical Urdu handwritten text lines. This dataset aims to address the scarcity of resources for Urdu Handwritten Text Recognition (UHTR). The study also evaluated various CRNN-based models, identifying CNN-BGRU-CTC as the most effective for Urdu Katib Handwriting Recognition, achieving low character and word error rates. AI
IMPACT This dataset and model evaluation could spur further development in recognizing historical Urdu script, aiding in the preservation of cultural heritage.