Back to Search
Start Over
Coarse-to-fine document localization in natural scene image with regional attention and recursive corner refinement
- Source :
- International Journal on Document Analysis and Recognition; 20240101, Issue: Preprints p1-10, 10p
- Publication Year :
- 2024
-
Abstract
- Document localization is a promising step for document-based optical character recognition. This task gains difficulty when documents are located in complex natural scene images. In this paper, we propose a coarse-to-fine document localization approach to detect the four corner points of the document in natural scene images. In the first stage, the four corners are roughly predicted through a deep neural networks-based Joint Corner Detector (JCD) with an attention mechanism, which roughly localize the document region via an attentional map. As a key to produce accurate inference of corners, the JCD module suppresses the interference from background in convolutional features substantially. In the second stage, a corner-specific refiner module is designed to refine the previously predicted corners. Considering the different characteristics of the four document corners, the patches cropped around the predicted corners are input into four different corner-specified CNN models, to search the accurate corner locations recursively. Three datasets (ICDAR 2015 SmartDoc competition 1 dataset, SEECS-NUSF dataset and a self-collected dataset) are used to evaluate the performance of our method. The experimental results demonstrate the superiority of the proposed method in localizing the document in natural images, especially in those with complex background. Compared with the state-of-the-art works, our method outperforms most of them.
Details
- Language :
- English
- ISSN :
- 14332833 and 14332825
- Issue :
- Preprints
- Database :
- Supplemental Index
- Journal :
- International Journal on Document Analysis and Recognition
- Publication Type :
- Periodical
- Accession number :
- ejs50760094
- Full Text :
- https://doi.org/10.1007/s10032-019-00341-0