1. CALText: Contextual Attention Localization for Offline Handwritten Text.
- Author
-
Anjum, Tayaba and Khan, Nazar
- Subjects
CONVOLUTIONAL neural networks ,OPTICAL character recognition ,WORD recognition ,TEXT recognition - Abstract
Recognition of Arabic-like scripts such as Persian and Urdu is more challenging than Latin-based scripts. This is due to the presence of a two-dimensional structure, context-dependent character shapes, spaces and overlaps, and placement of diacritics. We present an attention based encoder-decoder model that learns to read handwritten text in context. A novel localization penalty is introduced to encourage the model to attend only one location at a time when recognizing the next character. In addition, we comprehensively refine the only complete and publicly available handwritten Urdu dataset in terms of ground-truth annotations. We evaluate the model on both Urdu and Arabic datasets. For Urdu, contextual attention localization achieves 82.06 % character recognition rate and 51.97 % word recognition rate which represent more than 2 × improvement over existing bi-directional LSTM models. For Arabic, the model outperforms multi-directional LSTM models with 77.47 % character recognition rate and 37.66 % word recognition rate without performing any slant or skew correction. Code and pre-trained models for this work are available at https://github.com/nazar-khan/CALText. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF