Back to Search Start Over

Document Image Binarization With Stroke Boundary Feature Guided Network

Authors :
Quang-Vinh Dang
Guee-Sang Lee
Source :
IEEE Access, Vol 9, Pp 36924-36936 (2021)
Publication Year :
2021
Publisher :
IEEE, 2021.

Abstract

Text is the most crucial element in a document image but is often disconnected in document image binarization. Most of the previous methods based on deep learning do not focus on structure information such as stroke boundary, leading to disconnected strokes when the stroke is ambiguous or weak. In this paper, we propose a multi-task learning with an auxiliary task for learning stroke boundary features in an adversarial manner. The learned boundary features are integrated into the main task for the binarization. Specifically, in the first step, in addition to using shared global location features with the main task, the auxiliary task leverages additional local edges to obtain stroke boundary features. In the second step, we use adversarial loss based on boundary ground truth to supervise the obtained stroke boundary feature in the auxiliary task. The adversarial training is to embed expert knowledge, especially structure information, in the model. In the third step, the learned boundary feature from the auxiliary task supports the main task directly. The fusion module of the main task refines the final binarized image. Experiments show that our method achieves better-preserved stroke and better performance than existing methods on benchmark H-DBCO and DIBCO datasets.

Details

Language :
English
ISSN :
21693536
Volume :
9
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.059434efc074bc399be6e28e7e929d4
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2021.3062904