1. A new optical density granulometry-based descriptor for the classification of prostate histological images using shallow and deep Gaussian processes
- Author
-
Valery Naranjo, Adrián Colomer, María A. Sales, Rafael Molina, Miguel López-Pérez, and Ángel E. Esteban
- Subjects
Male ,Databases, Factual ,Granulometries ,Normal Distribution ,Color ,Health Informatics ,Variational Inference ,Deep Gaussian Processes ,Optical density ,030218 nuclear medicine & medical imaging ,Machine Learning ,03 medical and health sciences ,symbols.namesake ,0302 clinical medicine ,TEORIA DE LA SEÑAL Y COMUNICACIONES ,Image Processing, Computer-Assisted ,Humans ,False Positive Reactions ,Diagnosis, Computer-Assisted ,Gaussian Processes ,Gaussian process ,Remote sensing ,Probability ,Prostate cancer ,Prostate ,Prostatic Neoplasms ,Hospitals ,Computer Science Applications ,ROC Curve ,Granulometry ,Area Under Curve ,symbols ,Titan (rocket family) ,Histopathological Images ,030217 neurology & neurosurgery ,Software ,Geology ,Algorithms - Abstract
[EN] Background and objective Prostate cancer is one of the most common male tumors. The increasing use of whole slide digital scanners has led to an enormous interest in the application of machine learning techniques to histopathological image classification. Here we introduce a novel family of morphological descriptors which, extracted in the appropriate image space and combined with shallow and deep Gaussian process based classifiers, improves early prostate cancer diagnosis. Method We decompose the acquired RGB image in its RGB and optical density hematoxylin and eosin components. Then, we define two novel granulometry-based descriptors which work in both, RGB and optical density, spaces but perform better when used on the latter. In this space they clearly encapsulate knowledge used by pathologists to identify cancer lesions. The obtained features become the inputs to shallow and deep Gaussian process classifiers which achieve an accurate prediction of cancer. Results We have used a real and unique dataset. The dataset is composed of 60 Whole Slide Images. For a five fold cross validation, shallow and deep Gaussian Processes obtain area under ROC curve values higher than 0.98. They outperform current state of the art patch based shallow classifiers and are very competitive to the best performing deep learning method. Models were also compared on 17 Whole Slide test Images using the FROC curve. With the cost of one false positive, the best performing method, the one layer Gaussian process, identifies 83.87% (sensitivity) of all annotated cancer in the Whole Slide Image. This result corroborates the quality of the extracted features, no more than a layer is needed to achieve excellent generalization results. Conclusion Two new descriptors to extract morphological features from histological images have been proposed. They collect very relevant information for cancer detection. From these descriptors, shallow and deep Gaussian Processes are capable of extracting the complex structure of prostate histological images. The new space/descriptor/classifier paradigm outperforms state-of-art shallow classifiers. Furthermore, despite being much simpler, it is competitive to state-of-art CNN architectures both on the proposed SICAPv1 database and on an external database, This work was supported by the Ministerio de Economia y Competitividad through project DPI2016-77869. The Titan V used for this research was donated by the NVIDIA Corporation
- Published
- 2019