1. Improving offline handwritten text recognition with hybrid HMM/ANN models
- Author
-
Maria Jose Castro-Bleda, Francisco Zamora-Martínez, J. Gorbe-Moya, and Salvador España-Boquera
- Subjects
Handwriting recognition ,Handwriting ,Computer science ,Character recognition ,Speech recognition ,Pattern Recognition, Automated ,Hidden Markov models ,Hidden Markov model ,Image normalization ,Contextual image classification ,Artificial neural network ,Applied Mathematics ,Pattern recognition systems ,Hybrid HMM/ANN ,Markov Chains ,Computational Theory and Mathematics ,Multilayer perceptron ,symbols ,Feedforward neural network ,Computer Vision and Pattern Recognition ,Neural networks ,Electric loads ,Algorithms ,Normalization (statistics) ,Multilayer neural networks ,Computer Science::Neural and Evolutionary Computation ,Markov process ,symbols.namesake ,Artificial Intelligence ,Humans ,Optical multilayers ,HMM ,Electronic Data Processing ,Feedforward neural networks ,Markov chain ,Offline handwriting ,business.industry ,Supervised learning ,Reproducibility of Results ,Pattern recognition ,Image segmentation ,Perceptron ,ComputingMethodologies_PATTERNRECOGNITION ,Multilayers ,Reading ,Computer Science::Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,LENGUAJES Y SISTEMAS INFORMATICOS ,Software - Abstract
This paper proposes the use of hybrid Hidden Markov Model (HMM)/Artificial Neural Network (ANN) models for recognizing unconstrained offline handwritten texts. The structural part of the optical models has been modeled with Markov chains, and a Multilayer Perceptron is used to estimate the emission probabilities. This paper also presents new techniques to remove slope and slant from handwritten text and to normalize the size of text images with supervised learning methods. Slope correction and size normalization are achieved by classifying local extrema of text contours with Multilayer Perceptrons. Slant is also removed in a nonuniform way by using Artificial Neural Networks. Experiments have been conducted on offline handwritten text lines from the IAM database, and the recognition rates achieved, in comparison to the ones reported in the literature, are among the best for the same task. © 2006 IEEE., The authors acknowledge the valuable help provided by Moises Pastor, Juan Miguel Vilar, Alex Graves, and Marcus Liwicki. Thanks are also due to the reviewers and the Editor-in-Chief for their many valuable comments and suggestions. This work has been partially supported by the Spanish Ministerio de Educacion y Ciencia (TIN2006-12767) and by the BPFI 06/250 Scholarship from the Conselleria d'Empresa, Universitat i Ciencia, Generalitat Valenciana.
- Published
- 2010