1. External validation of a convolutional neural network artificial intelligence tool to predict malignancy in pulmonary nodules
- Author
-
Paul Holland, Victoria Ashford-Turner, Alan Exell, Hazel Spence, Vaclav Potesil, P. Novotny, Jennifer Gustafson, Timor Kadir, Catarina Figueiras, David R Baldwin, Fergus V. Gleeson, Albert Sterba, E. O'Dowd, Carlos Arteta, Matthew Clark, Matthew E.J. Callister, L. Pickup, Alison Clubley, and Jerome Declerck
- Subjects
Adult ,Male ,Pulmonary and Respiratory Medicine ,Lung Neoplasms ,Databases, Factual ,Risk of malignancy ,Malignancy ,Risk Assessment ,Convolutional neural network ,030218 nuclear medicine & medical imaging ,Cohort Studies ,03 medical and health sciences ,0302 clinical medicine ,Artificial Intelligence ,Predictive Value of Tests ,medicine ,Humans ,Neoplasm Invasiveness ,Lung cancer ,non-small cell lung cancer ,Early Detection of Cancer ,Aged ,Neoplasm Staging ,Retrospective Studies ,business.industry ,Incidence ,Lung Cancer ,External validation ,Area under the curve ,Cancer ,Nodule (medicine) ,Middle Aged ,Prognosis ,medicine.disease ,Cell Transformation, Neoplastic ,ROC Curve ,Area Under Curve ,030220 oncology & carcinogenesis ,CT imaging ,Multiple Pulmonary Nodules ,Female ,Neural Networks, Computer ,Artificial intelligence ,medicine.symptom ,business ,Algorithms - Abstract
BackgroundEstimation of the risk of malignancy in pulmonary nodules detected by CT is central in clinical management. The use of artificial intelligence (AI) offers an opportunity to improve risk prediction. Here we compare the performance of an AI algorithm, the lung cancer prediction convolutional neural network (LCP-CNN), with that of the Brock University model, recommended in UK guidelines.MethodsA dataset of incidentally detected pulmonary nodules measuring 5–15 mm was collected retrospectively from three UK hospitals for use in a validation study. Ground truth diagnosis for each nodule was based on histology (required for any cancer), resolution, stability or (for pulmonary lymph nodes only) expert opinion. There were 1397 nodules in 1187 patients, of which 234 nodules in 229 (19.3%) patients were cancer. Model discrimination and performance statistics at predefined score thresholds were compared between the Brock model and the LCP-CNN.ResultsThe area under the curve for LCP-CNN was 89.6% (95% CI 87.6 to 91.5), compared with 86.8% (95% CI 84.3 to 89.1) for the Brock model (p≤0.005). Using the LCP-CNN, we found that 24.5% of nodules scored below the lowest cancer nodule score, compared with 10.9% using the Brock score. Using the predefined thresholds, we found that the LCP-CNN gave one false negative (0.4% of cancers), whereas the Brock model gave six (2.5%), while specificity statistics were similar between the two models.ConclusionThe LCP-CNN score has better discrimination and allows a larger proportion of benign nodules to be identified without missing cancers than the Brock model. This has the potential to substantially reduce the proportion of surveillance CT scans required and thus save significant resources.
- Published
- 2020
- Full Text
- View/download PDF