Back to Search Start Over

TabCellNet: Deep learning-based tabular cell structure detection.

Authors :
Jiang, JiChu
Simsek, Murat
Kantarci, Burak
Khan, Shahzad
Source :
Neurocomputing. Jun2021, Vol. 440, p12-23. 12p.
Publication Year :
2021

Abstract

There is an increasing demand for automated document processing techniques as the volume of electronic component documents increase. This is most prevalent in the supply chain optimization sector where vast amount of documents need to be processed and is time consuming and prone to error. Detection of tables and table structures serves as a crucial step to automate document processing. While table detection is a well investigated problem, tabular structure detection is more complex, and requires further improvements. To address this, this study proposes a deep learning model that focuses on high precision tabular cell structure detection. The proposed model creates a benchmark for the ICDAR2013 dataset cell structure with comparison to the previous state of the art table detection models as well as proposing alternative models. Our methodology approaches improving table structure detection through the detection of cells instead of row and columns for better generalization capabilities for heterogeneous table structures. Our proposed model advances prior models by improving major parts of the detection pipeline, mainly the two-stage detector, backbone, backbone architecture, and non-maximum-suppression (NMS). TabCellNet consists of Hybrid Task Cascade (HTC) with Combinational Backbone Network (CBNet), dual ResNeXt101 and Soft-NMS to achieve a precision of 89.2% and recall of 98.7% on the hand annotated ICDAR2013 cell structure dataset. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
440
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
149919656
Full Text :
https://doi.org/10.1016/j.neucom.2021.01.103