251. Abstract 186: Gleason grade group predictions from mp-MRI of prostate cancer patients using auto deep learning
- Author
-
Zhen Sun, Ali Dabaja, Ning Wen, Aharon M Feldman, Joon Sang Lee, Lanyu Xu, Weiwei Zong, and Eric Carver
- Subjects
Cancer Research ,business.industry ,Feature extraction ,Gold standard (test) ,medicine.disease ,Prostate cancer ,medicine.anatomical_structure ,Oncology ,Binary classification ,Prostate ,Feature (computer vision) ,Region of interest ,Medicine ,business ,Precision and recall ,Nuclear medicine - Abstract
Gleason Grade Group Predictions from mp-MRI of Prostate Cancer Patients using Automated Deep Learning Though histopathology remains the gold standard, there have been significant interests in predicting Gleason Grade using noninvasive imaging such as mp-MRI. Most studies simplify the task into binary classification for the high uncertainty at each group. Handcrafted radiomic features were heavily investigated but prone to errors from the definition of region of interest, feature extraction variations, etc. We proposed an automated deep learning framework (auto-Keras) to predict the group directly based on the 3D data of the whole prostate gland. The training cohort A consisted of 96 PCa patients from SPIE-AAPM-NCI Challenge. The number of patients in each Group was 30, 35, 18, 7, and 6. The testing cohort B consisted of 34 PCa patients from our institute (10, 14, 4, 3, 3). We resampled and rigidly registered ADC and T2WI. N4-bias correction was applied to correct the non-uniformity. For each slice, we performed Gaussian blurring followed by prostate cropping from contour delineated by two clinicians.We tested five scenarios, including input of T2WI, ADC, both, two parallel inputs followed by feature concatenation, and prediction ensemble. The search space of augmentation included translation, flip, rotation, zooming, and contrast. The search space of the architectures had vanilla, ResNet, and Xception. With ADC alone, the model detected 75% of patients in Group 3. Using T2WI and ADC as input, 46% of Group 2 and 40% of Group 1 were identified. Since GG 2 is less aggressive and has a favorable outcome, we further studied the performance of classifying 1 VS. 2-5 and 1-2 VS. 3-5. The models' precision and recall were 91% and 72% for 1-2, 60% and 24% for 3-5. We separated 1 VS. 2-5, with a 96% precision and 73% recall for 2-5. The model had a better performance to predict lower GG when the input contained both T2WI and AD, and better at higher GG when the features were concatenated at the output level. Table 1.Performance of Precision and recall for Gleason Grade Group on the testing cohort.1 VS. 2 VS. 3 VS. 4 VS. 51-2 VS. 3-51 VS. 2-5ADC-onlyGroup 1Group 2Group 3Group 4Group 5Group 1-2Group 3-5Group 1Group 2-5Precision0.100.230.75000.300.500.300.50Recall0.250.380.14--0.580.240.580.24Input MergePrecision0.400.460.25000.910.200.400.61Recall0.310.380.25--0.720.500.310.70Feature MergePrecision0.200.080.250.3300.130.600.200.96Recall0.670.250.250.0600.430.230.670.73PredictionEnsemblePrecision0.200.080.750.3300.170.600.200.91Recall0.500.250.200.10-0.500.240.500.72 Citation Format: Weiwei Zong, Eric Carver, Aharon Feldman, Joon Lee, Zhen Sun, Lanyu Xu, Ali Dabaja, Ning Wen. Gleason grade group predictions from mp-MRI of prostate cancer patients using auto deep learning [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 186.
- Published
- 2021