Back to Search
Start Over
Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks
- Source :
- Journal of Imaging, Journal of Imaging, 2018, 4 (1), pp.22, Journal of Imaging, Vol 4, Iss 1, p 15 (2018), Journal of Imaging; Volume 4; Issue 1; Pages: 15, RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia, instname
- Publication Year :
- 2018
- Publisher :
- MDPI AG, 2018.
-
Abstract
- [EN] The digitization of historical handwritten document images is important for the preservation of cultural heritage. Moreover, the transcription of text images obtained from digitization is necessary to provide efficient information access to the content of these documents. Handwritten Text Recognition (HTR) has become an important research topic in the areas of image and computational language processing that allows us to obtain transcriptions from text images. State-of-the-art HTR systems are, however, far from perfect. One difficulty is that they have to cope with image noise and handwriting variability. Another difficulty is the presence of a large amount of Out-Of-Vocabulary (OOV) words in ancient historical texts. A solution to this problem is to use external lexical resources, but such resources might be scarce or unavailable given the nature and the age of such documents. This work proposes a solution to avoid this limitation. It consists of associating a powerful optical recognition system that will cope with image noise and variability, with a language model based on sub-lexical units that will model OOV words. Such a language modeling approach reduces the size of the lexicon while increasing the lexicon coverage. Experiments are first conducted on the publicly available Rodrigo dataset, which contains the digitization of an ancient Spanish manuscript, with a recognizer based on Hidden Markov Models (HMMs). They show that sub-lexical units outperform word units in terms of Word Error Rate (WER), Character Error Rate (CER) and OOV word accuracy rate. This approach is then applied to deep net classifiers, namely Bi-directional Long-Short Term Memory (BLSTMs) and Convolutional Recurrent Neural Nets (CRNNs). Results show that CRNNs outperform HMMs and BLSTMs, reaching the lowest WER and CER for this image dataset and significantly improving OOV recognition.<br />Work partially supported by projects READ: Recognition and Enrichment of Archival Documents - 674943 (European Union's H2020) and CoMUN-HaT: Context, Multimodality and User Collaboration in Handwritten Text Processing - TIN2015-70924-C2-1-R (MINECO/FEDER), and a DGA-MRIS (Direction Generale de l'Armement - Mission pour la Recherche et l'Innovation Scientifique) scholarship.
- Subjects :
- Out-of-vocabulary word recognition
Computer science
Information access
Word error rate
02 engineering and technology
computer.software_genre
Lexicon
lcsh:Computer applications to medicine. Medical informatics
lcsh:QA75.5-76.95
Transcription (linguistics)
Handwriting
[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]
0202 electrical engineering, electronic engineering, information engineering
historical handwritten transcription
out-of-vocabulary word recognition
character-level language model
word structure retrieval
Radiology, Nuclear Medicine and imaging
lcsh:Photography
Electrical and Electronic Engineering
Hidden Markov model
Character-level language model
[MATH.MATH-ST] Mathematics [math]/Statistics [math.ST]
Digitization
business.industry
Historical handwritten transcription
020207 software engineering
lcsh:TR1-1050
Computer Graphics and Computer-Aided Design
Word structure retrieval
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
lcsh:R858-859.7
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Language model
Artificial intelligence
lcsh:Electronic computers. Computer science
business
computer
LENGUAJES Y SISTEMAS INFORMATICOS
Natural language processing
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Journal of Imaging, Journal of Imaging, 2018, 4 (1), pp.22, Journal of Imaging, Vol 4, Iss 1, p 15 (2018), Journal of Imaging; Volume 4; Issue 1; Pages: 15, RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia, instname
- Accession number :
- edsair.doi.dedup.....aed7d2b739d54994f58f3e9d2e1ad299
- Full Text :
- https://doi.org/10.3390/jimaging4010015