Back to Search Start Over

Convolutional Neural Networks for Automated Classification of Prostate Multiparametric Magnetic Resonance Imaging Based on Image Quality

Authors :
Lorenzo Farina
Stefano Cipollari
Emanuele Messina
Valerio Guarrasi
Paola Paci
Carlo Catalano
Martina Pecoraro
Valeria Panebianco
Marco Bicchetti
Source :
Journal of Magnetic Resonance Imaging. 55:480-490
Publication Year :
2021
Publisher :
Wiley, 2021.

Abstract

BACKGROUND Prostate magnetic resonance imaging (MRI) is technically demanding, requiring high image quality to reach its full diagnostic potential. An automated method to identify diagnostically inadequate images could help optimize image quality. PURPOSE To develop a convolutional neural networks (CNNs) based analysis pipeline for the classification of prostate MRI image quality. STUDY TYPE Retrospective. SUBJECTS Three hundred sixteen prostate mpMRI scans and 312 men (median age 67). FIELD STRENGTH/SEQUENCE A 3 T; fast spin echo T2WI, echo planar imaging DWI, ADC, gradient-echo dynamic contrast enhanced (DCE). ASSESSMENT MRI scans were reviewed by three genitourinary radiologists (V.P., M.D.M., S.C.) with 21, 12, and 5 years of experience, respectively. Sequences were labeled as high quality (Q1) or low quality (Q0) and used as the reference standard for all analyses. STATISTICAL TESTS Sequences were split into training, validation, and testing sets (869, 250, and 120 sequences, respectively). Inter-reader agreement was assessed with the Fleiss kappa. Following preprocessing and data augmentation, 28 CNNs were trained on MRI slices for each sequence. Model performance was assessed on both a per-slice and a per-sequence basis. A pairwise t-test was performed to compare performances of the classifiers. RESULTS The number of sequences labeled as Q0 or Q1 was 38 vs. 278 for T2WI, 43 vs. 273 for DWI, 41 vs. 275 for ADC, and 38 vs. 253 for DCE. Inter-reader agreement was almost perfect for T2WI and DCE and substantial for DWI and ADC. On the per-slice analysis, accuracy was 89.95% ± 0.02% for T2WI, 79.83% ± 0.04% for DWI, 76.64% ± 0.04% for ADC, 96.62% ± 0.01% for DCE. On the per-sequence analysis, accuracy was 100% ± 0.00% for T2WI, DWI, and DCE, and 92.31% ± 0.00% for ADC. The three best algorithms performed significantly better than the remaining ones on every sequence (P-value

Details

ISSN :
15222586 and 10531807
Volume :
55
Database :
OpenAIRE
Journal :
Journal of Magnetic Resonance Imaging
Accession number :
edsair.doi.dedup.....ba0aad52773a5a7c4049f8950fbb38b5