Back to Search
Start Over
An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers
- Source :
- ICPR
- Publication Year :
- 2020
-
Abstract
- One important and particularly challenging step in the optical character recognition (OCR) of historical documents with complex layouts, such as newspapers, is the separation of text from non-text content (e.g. page borders or illustrations). This step is commonly referred to as page segmentation. While various rule-based algorithms have been proposed, the applicability of Deep Neural Networks (DNNs) for this task recently has gained a lot of attention. In this paper, we perform a systematic evaluation of 11 different published DNN backbone architectures and 9 different tiling and scaling configurations for separating text, tables or table column lines. We also show the influence of the number of labels and the number of training pages on the segmentation quality, which we measure using the Matthews Correlation Coefficient. Our results show that (depending on the task) Inception-ResNet-v2 and EfficientNet backbones work best, vertical tiling is generally preferable to other tiling approaches, and training data that comprises 30 to 40 pages will be sufficient most of the time.<br />Evaluation of deep neural networks for the segmentation of pages of historical newspapers; 21 pages total (incl. references and appendix), 7 figures, 5 tables
- Subjects :
- FOS: Computer and information sciences
Information retrieval
Computer science
business.industry
Computer Vision and Pattern Recognition (cs.CV)
I.4.6
Computer Science - Computer Vision and Pattern Recognition
02 engineering and technology
Optical character recognition
010501 environmental sciences
computer.software_genre
Matthews correlation coefficient
01 natural sciences
Newspaper
Task (computing)
Column (typography)
0202 electrical engineering, electronic engineering, information engineering
Table (database)
020201 artificial intelligence & image processing
Segmentation
Artificial intelligence
business
Transfer of learning
computer
0105 earth and related environmental sciences
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- ICPR
- Accession number :
- edsair.doi.dedup.....8ca4c68f3384adf0d0ae39368e52451c