Back to Search Start Over

Self-Learning for Weakly Supervised Gleason Grading of Local Patterns

Authors :
Jose Dolz
Valery Naranjo
Julio Silva-Rodríguez
Adrián Colomer
Source :
RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia, instname
Publication Year :
2021
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2021.

Abstract

[EN] Prostate cancer is one of the main diseases affecting men worldwide. The gold standard for diagnosis and prognosis is the Gleason grading system. In this process, pathologists manually analyze prostate histology slides under microscope, in a high time-consuming and subjective task. In the last years, computer-aided-diagnosis (CAD) systems have emerged as a promising tool that could support pathologists in the daily clinical practice. Nevertheless, these systems are usually trained using tedious and prone-to-error pixel-level annotations of Gleason grades in the tissue. To alleviate the need of manual pixel-wise labeling, just a handful of works have been presented in the literature. Furthermore, despite the promising results achieved on global scoring the location of cancerous patterns in the tissue is only qualitatively addressed. These heatmaps of tumor regions, however, are crucial to the reliability of CAD systems as they provide explainability to the system's output and give confidence to pathologists that the model is focusing on medical relevant features. Motivated by this, we propose a novel weakly-supervised deeplearning model, based on self-learning CNNs, that leverages only the global Gleason score of gigapixel whole slide images during training to accurately perform both, grading of patch-level patterns and biopsy-level scoring. To evaluate the performance of the proposed method, we perform extensive experiments on three different external datasets for the patch-level Gleason grading, and on two different test sets for global Grade Group prediction. We empirically demonstrate that our approach outperforms its supervised counterpart on patch-level Gleason grading by a large margin, as well as state-of-the-art methods on global biopsylevel scoring. Particularly, the proposed model brings an average improvement on the Cohen's quadratic kappa (kappa) score of nearly 18% compared to full-supervision for the patch-level Gleason grading task. This suggests that the absence of the annotator's bias in our approach and the capability of using large weakly labeled datasets during training leads to higher performing and more robust models. Furthermore, raw features obtained from the patchlevel classifier showed to generalize better than previous approaches in the literature to the subjective global biopsylevel scoring.<br />This work was supported by the Spanish Ministry of Economy and Competitiveness through Projects DPI2016-77869 and PID2019-105142RB-C21.

Details

ISSN :
21682208 and 21682194
Volume :
25
Database :
OpenAIRE
Journal :
IEEE Journal of Biomedical and Health Informatics
Accession number :
edsair.doi.dedup.....51cbe01b9858f9d35773aa0ba9bba8f2
Full Text :
https://doi.org/10.1109/jbhi.2021.3061457