1. Extracting 2D weak labels from volume labels using multiple instance learning in CT hemorrhage detection
- Author
-
Zihao Wu, Mayur B. Patel, Snehashis Roy, Camilo Bermudez, Samuel Remedios, John A. Butman, Cailey I. Kerley, Dzung L. Pham, and Bennett A. Landman
- Subjects
FOS: Computer and information sciences ,Artificial neural network ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Deep learning ,0206 medical engineering ,Supervised learning ,Computer Science - Computer Vision and Pattern Recognition ,Pattern recognition ,Context (language use) ,02 engineering and technology ,computer.software_genre ,020601 biomedical engineering ,Convolutional neural network ,Article ,Voxel ,0202 electrical engineering, electronic engineering, information engineering ,Medical imaging ,020201 artificial intelligence & image processing ,Segmentation ,Artificial intelligence ,business ,computer - Abstract
Multiple instance learning (MIL) is a supervised learning methodology that aims to allow models to learn instance class labels from bag class labels, where a bag is defined to contain multiple instances. MIL is gaining traction for learning from weak labels but has not been widely applied to 3D medical imaging. MIL is well-suited to clinical CT acquisitions since (1) the highly anisotropic voxels hinder application of traditional 3D networks and (2) patch-based networks have limited ability to learn whole volume labels. In this work, we apply MIL with a deep convolutional neural network to identify whether clinical CT head image volumes possess one or more large hemorrhages (> 20cm$^3$), resulting in a learned 2D model without the need for 2D slice annotations. Individual image volumes are considered separate bags, and the slices in each volume are instances. Such a framework sets the stage for incorporating information obtained in clinical reports to help train a 2D segmentation approach. Within this context, we evaluate the data requirements to enable generalization of MIL by varying the amount of training data. Our results show that a training size of at least 400 patient image volumes was needed to achieve accurate per-slice hemorrhage detection. Over a five-fold cross-validation, the leading model, which made use of the maximum number of training volumes, had an average true positive rate of 98.10%, an average true negative rate of 99.36%, and an average precision of 0.9698. The models have been made available along with source code to enabled continued exploration and adaption of MIL in CT neuroimaging.
- Published
- 2020
- Full Text
- View/download PDF