Back to Search
Start Over
Solution to overcome the sparsity issue of annotated data in medical domain
- Source :
- CAAI Transactions on Intelligence Technology (2018)
- Publication Year :
- 2018
- Publisher :
- Wiley, 2018.
-
Abstract
- Annotations are critical for machine learning and developing computer aided diagnosis (CAD) algorithms. Good performance of CAD is critical to their adoption, which generally rely on training with a wide variety of annotated data. However, a vast amount of medical data is either unlabeled or annotated only at the image-level. This poses a problem for exploring data driven approaches like deep learning for CAD. In this paper, we propose a novel crowdsourcing and synthetic image generation for training deep neural net-based lesion detection. The noisy nature of crowdsourced annotations is overcome by assigning a reliability factor for crowd subjects based on their performance and requiring region of interest markings from the crowd. A generative adversarial network-based solution is proposed to generate synthetic images with lesions to control the overall severity level of the disease. We demonstrate the reliability of the crowdsourced annotations and synthetic images by presenting a solution for training the deep neural network (DNN) with data drawn from a heterogeneous mixture of annotations. Experimental results obtained for hard exudate detection from retinal images show that training with refined crowdsourced data/synthetic images is effective as detection performance in terms of sensitivity improves by 25%/27% over training with just expert-markings.
- Subjects :
- learning (artificial intelligence)
image colour analysis
neural nets
image classification
image segmentation
medical image processing
diseases
annotated data
medical domain
machine learning
developing computer
diagnosis algorithms
CAD
good performance
medical data
image level
data-driven approaches
deep learning
data augmentation
popular solution
synthetic image generation
crowdsourced annotations
interest markings
pixel-level markings
generative adversarial network-based solution
severity level
crowdsourced region
synthetically generated data
colour fundus images
processed/refined crowdsourced data/synthetic images
detection performance
Computational linguistics. Natural language processing
P98-98.5
Computer software
QA76.75-76.765
Subjects
Details
- Language :
- English
- ISSN :
- 24682322
- Database :
- Directory of Open Access Journals
- Journal :
- CAAI Transactions on Intelligence Technology
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.2d8cd25af1044bdb2c43cb8494214c0
- Document Type :
- article
- Full Text :
- https://doi.org/10.1049/trit.2018.1010