Back to Search Start Over

An Expert-driven Computer-aided Classification for Database Construction: Its Impact to Predict Polyp Sub-types via Computed Tomographic Colonography

Authors :
Marc J. Pomeroy
Zhengrong Liang
Weiguo Cao
Yongfeng Gao
Luhao Wang
Kenneth Ng
Source :
2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC).
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

Data preparation for any machine learning process is of utmost importance to produce consistent and robust results. The challenges faced with identifying the colon polyps in computed tomographic colonography (CTC) images are that some polyps do not have a definite outline, some are coated by oral tag material due to poor preparation, and the intra-variation when multiple contributors work on the same dataset. This study aims to relieve the challenge by an iterative user-driven procedure, which starts by an expert to draw the initial borders of the colon polyps (or VOIs), followed by using computer aided classification (CAC) on an adequately grouped VOIs to find outliers. Then the expert examines the outliers for refining the VOIs and the CAC is repeated on the refined VOIs. This iterative procedure is repeated until a threshold is satisfied. The expert-driven CAC procedure was validated by experiments using three datasets. One small dataset containing 87 large polyp masses, and two large datasets containing 726 and 563 polyp masses varying in size from medium and small. Of the dataset with 87 polyps, 63 VOIs were constructed previously by three experts as the baseline, including 31 benign and 32 malignant. The remaining 24 (12 benign and 12 malignant) were added after going through the expert-driven CAC procedure (i.e. only one expert). The two large datasets had multiple contributors and each dataset could be split into several subgroups and cross validated using the highest performing subgroup as the baseline. The cross validation was performed using the grey-level co-occurrence measures of the VOIs, two-fold validation, and random forest classifier. The AUC score on the large polyp dataset remained the same as that of the baseline when the 24 new VOIs were added using the expert-driven CAC procedure, while varied by 4% if the procedure was not used. The AUC score on the medium and small polyp datasets had nominal increases up to 2% after expert-driven CAC procedure. Upon further examination on the-up-to 2% variation, the causes include flat small polyps and small polyps being submerged and/or surrounded by oral tagging materials. These causes of up to 2% variation are CTC data specific and acceptable. In conclusion, expert-driven CAC is important for large database construction. Key words: ML, Database, Colon polyps, CTC, CAC.

Details

Database :
OpenAIRE
Journal :
2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)
Accession number :
edsair.doi...........d3be2067191c38e7d25bac12e4443576