Start Over

Handwritten word recognition using Web resources and recurrent neural networks

Authors :: Laurence Likforman-Sulem
Chafic Mokbel
Cristina Oprean
Adrian Popescu
Laboratoire Traitement et Communication de l'Information (LTCI)
Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Département Intelligence Ambiante et Systèmes Interactifs (DIASI)
Laboratoire d'Intégration des Systèmes et des Technologies (LIST)
Direction de Recherche Technologique (CEA) (DRT (CEA))
Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Direction de Recherche Technologique (CEA) (DRT (CEA))
Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay
University of Balamand - UOB (LIBAN)
Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA))
University of Balamand [Liban] (UOB)
Source :: International Journal on Document Analysis and Recognition, International Journal on Document Analysis and Recognition, 2015, 18 (4), pp.287-301. ⟨10.1007/s10032-015-0251-1⟩
Publication Year :: 2015
Publisher :: HAL CCSD, 2015.
Abstract: International audience; Handwriting recognition systems usually rely on static dictionaries and language models. Full coverage of these dictionaries is generally not achieved when dealing with unrestricted document corpora due to the presence of Out-Of-Vocabulary (OOV) words. We propose an approach which uses the World Wide Web as a corpus to improve dictionary coverage. We exploit the very large and freely available Wikipedia corpus in order to obtain dynamic dictionaries on the fly. We rely on recurrent neural network (RNN) recognizers, with and without linguistic resources, to detect words that are non-reliably recognized within a word sequence. Such words are labeled as non-anchor words (NAWs) and include OOVs and In-Vocabulary words recognized with low confidence. To recognize a non-anchor word, a dynamic dictionary is built by selecting words from the Web resource based on their string similarity with the NAW image, and their linguistic relevance in the NAW context. Similarity is evaluated by computing the edit distance between the sequence of characters generated by the RNN recognizer exploited as a filler model, and the Wikipedia words. Linguistic relevance is based on an N-gram language model estimated from the Wikipedia corpus. Experiments conducted on aword-segmented version of the publicly available RIMES database show that the proposed approach can improve recognition accuracy compared to systems based on static dictionaries only. The proposed approach shows even better behavior as the proportion of OOVs increases, in terms of both accuracy and dictionary coverage.

Details

Language :: English
Database :: OpenAIRE
Journal :: International Journal on Document Analysis and Recognition, International Journal on Document Analysis and Recognition, 2015, 18 (4), pp.287-301. ⟨10.1007/s10032-015-0251-1⟩
Accession number :: edsair.doi.dedup.....76cc1aa2dbbfb2852f99b22acb9e590e
Full Text :: https://doi.org/10.1007/s10032-015-0251-1⟩