Researchers have introduced the Urdu Katib Handwritten Dataset (UKHD), the first offline dataset of historical Urdu handwritten text lines. This dataset aims to address the scarcity of resources for Urdu Handwritten Text Recognition (UHTR). The study also evaluated various CRNN-based models, identifying CNN-BGRU-CTC as the most effective for Urdu Katib Handwriting Recognition, achieving low character and word error rates. AI
IMPACT This dataset and model evaluation could spur further development in recognizing historical Urdu script, aiding in the preservation of cultural heritage.
RANK_REASON The cluster describes a new academic dataset and evaluation of models for a specific recognition task, fitting the research bucket.
- arXiv
- CNN-BGRU-CTC
- computer science
- Computer vision and pattern recognition
- Convolutional Recurrent Neural Network
- Nastalique
- Urdu Handwritten Text Recognition
- Urdu Katib Handwritten Dataset
- Character Error Rate
- Urdu Katib Handwriting Recognition
- Word Error Rate
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →