Back to Search Start Over

Recurrent Convolutional Neural Network MSER-Based Approach for Payable Document Processing.

Authors :
Aladhadh, Suliman
Ur Rehman, Hidayat
Qamar, Ali Mustafa
Khan, Rehan Ullah
Source :
Computers, Materials & Continua; 2021, Vol. 69 Issue 3, p3399-3411, 13p
Publication Year :
2021

Abstract

A tremendous amount of vendor invoices is generated in the corporate sector. To automate the manual data entry in payable documents, highly accurate Optical Character Recognition (OCR) is required. This paper proposes an end-to-end OCR system that does both localization and recognition and serves as a single unit to automate payable document processing such as cheques and cash disbursement. For text localization, the maximally stable extremal region is used, which extracts a word or digit chunk from an invoice. This chunk is later passed to the deep learning model, which performs text recognition. The deep learning model utilizes both convolution neural networks and long short-termmemory (LSTM). The convolution layer is used for extracting features, which are fed to the LSTM. The model integrates feature extraction, modeling sequence, and transcription into a unified network. It handles the sequences of unconstrained lengths, independent of the character segmentation or horizontal scale normalization. Furthermore, it applies to both the lexicon-free and lexicon-based text recognition, and finally, it produces a comparatively smaller model, which can be implemented in practical applications. The overall superior performance in the experimental evaluation demonstrates the usefulness of the proposed model. The model is thus generic and can be used for other similar recognition scenarios. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15462218
Volume :
69
Issue :
3
Database :
Complementary Index
Journal :
Computers, Materials & Continua
Publication Type :
Academic Journal
Accession number :
152050657
Full Text :
https://doi.org/10.32604/cmc.2021.018724