Start Over

Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study

Authors :: Zech, John R.
Badgeley, Marcus A.
Liu, Manway
Costa, Anthony B.
Titano, Joseph J.
Oermann, Eric Karl
Source :: PLoS Medicine. November 6, 2018, Vol. 15 Issue 11, e1002683
Publication Year :: 2018
Abstract: Background There is interest in using convolutional neural networks (CNNs) to analyze medical imaging to provide computer-aided diagnosis (CAD). Recent work has suggested that image classification CNNs may not generalize to new data as well as previously believed. We assessed how well CNNs generalized across three hospital systems for a simulated pneumonia screening task. Methods and findings A cross-sectional design with multiple model training cohorts was used to evaluate model generalizability to external sites using split-sample validation. A total of 158,323 chest radiographs were drawn from three institutions: National Institutes of Health Clinical Center (NIH; 112,120 from 30,805 patients), Mount Sinai Hospital (MSH; 42,396 from 12,904 patients), and Indiana University Network for Patient Care (IU; 3,807 from 3,683 patients). These patient populations had an age mean (SD) of 46.9 years (16.6), 63.2 years (16.5), and 49.6 years (17) with a female percentage of 43.5%, 44.8%, and 57.3%, respectively. We assessed individual models using the area under the receiver operating characteristic curve (AUC) for radiographic findings consistent with pneumonia and compared performance on different test sets with DeLong's test. The prevalence of pneumonia was high enough at MSH (34.2%) relative to NIH and IU (1.2% and 1.0%) that merely sorting by hospital system achieved an AUC of 0.861 (95% CI 0.855-0.866) on the joint MSH-NIH dataset. Models trained on data from either NIH or MSH had equivalent performance on IU (P values 0.580 and 0.273, respectively) and inferior performance on data from each other relative to an internal test set (i.e., new data from within the hospital system used for training data; P values both Conclusion Pneumonia-screening CNNs achieved better internal than external performance in 3 out of 5 natural comparisons. When models were trained on pooled data from sites with different pneumonia prevalence, they performed better on new pooled data from these sites but not on external data. CNNs robustly identified hospital system and department within a hospital, which can have large differences in disease burden and may confound predictions.<br />Author(s): John R. Zech 1, Marcus A. Badgeley 2, Manway Liu 2, Anthony B. Costa 3, Joseph J. Titano 4, Eric Karl Oermann 3,* Introduction There is significant interest in [...]

Subjects :: Health screening -- Usage
Pneumonia -- Diagnosis -- Risk factors
Artificial neural networks -- Usage
Melanocyte stimulating hormone
Natural language processing
Chest x-rays
Health
Neural network
Biological sciences

Details

Language :: English
ISSN :: 15491277
Volume :: 15
Issue :: 11
Database :: Gale General OneFile
Journal :: PLoS Medicine
Publication Type :: Academic Journal
Accession number :: edsgcl.564080810
Full Text :: https://doi.org/10.1371/journal.pmed.1002683

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources