Back to Search
Start Over
Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology Systems
- Source :
- Frontiers in Digital Health, Vol 3 (2021), Frontiers in Digital Health
- Publication Year :
- 2021
-
Abstract
- Artificial intelligence (AI) has been successful at solving numerous problems in machine perception. In radiology, AI systems are rapidly evolving and show progress in guiding treatment decisions, diagnosing, localizing disease on medical images, and improving radiologists' efficiency. A critical component to deploying AI in radiology is to gain confidence in a developed system's efficacy and safety. The current gold standard approach is to conduct an analytical validation of performance on a generalization dataset from one or more institutions, followed by a clinical validation study of the system's efficacy during deployment. Clinical validation studies are time-consuming, and best practices dictate limited re-use of analytical validation data, so it is ideal to know ahead of time if a system is likely to fail analytical or clinical validation. In this paper, we describe a series of sanity tests to identify when a system performs well on development data for the wrong reasons. We illustrate the sanity tests' value by designing a deep learning system to classify pancreatic cancer seen in computed tomography scans.
- Subjects :
- FOS: Computer and information sciences
medicine.medical_specialty
Computer Science - Machine Learning
bias
Generalization
Computer science
Best practice
Computer Vision and Pattern Recognition (cs.CV)
Biomedical Engineering
Computer Science - Computer Vision and Pattern Recognition
Medicine (miscellaneous)
Health Informatics
Machine Learning (stat.ML)
Machine perception
030218 nuclear medicine & medical imaging
Machine Learning (cs.LG)
03 medical and health sciences
0302 clinical medicine
Statistics - Machine Learning
Component (UML)
medicine
FOS: Electrical engineering, electronic engineering, information engineering
Spurious relationship
Original Research
validation
business.industry
Deep learning
Image and Video Processing (eess.IV)
deep learning
computed tomography
Gold standard (test)
QA75.5-76.95
Electrical Engineering and Systems Science - Image and Video Processing
artificial intelligence
3. Good health
Computer Science Applications
ComputingMethodologies_PATTERNRECOGNITION
Software deployment
030220 oncology & carcinogenesis
Electronic computers. Computer science
Digital Health
Medicine
Artificial intelligence
Radiology
spurious correlations
Public aspects of medicine
RA1-1270
business
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Frontiers in Digital Health, Vol 3 (2021), Frontiers in Digital Health
- Accession number :
- edsair.doi.dedup.....b24b09e4a327662b956ef0c2d43f2355