Back to Search Start Over

GAN-Based Approaches for Generating Structured Data in the Medical Domain

Authors :
Masoud Abedi
Lars Hempel
Sina Sadeghi
Toralf Kirsten
Source :
Applied Sciences, Vol 12, Iss 14, p 7075 (2022)
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

Modern machine and deep learning methods require large datasets to achieve reliable and robust results. This requirement is often difficult to meet in the medical field, due to data sharing limitations imposed by privacy regulations or the presence of a small number of patients (e.g., rare diseases). To address this data scarcity and to improve the situation, novel generative models such as Generative Adversarial Networks (GANs) have been widely used to generate synthetic data that mimic real data by representing features that reflect health-related information without reference to real patients. In this paper, we consider several GAN models to generate synthetic data used for training binary (malignant/benign) classifiers, and compare their performances in terms of classification accuracy with cases where only real data are considered. We aim to investigate how synthetic data can improve classification accuracy, especially when a small amount of data is available. To this end, we have developed and implemented an evaluation framework where binary classifiers are trained on extended datasets containing both real and synthetic data. The results show improved accuracy for classifiers trained with generated data from more advanced GAN models, even when limited amounts of original data are available.

Details

Language :
English
ISSN :
20763417
Volume :
12
Issue :
14
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.fb40cebab07e41b184fe8d3a03bdca48
Document Type :
article
Full Text :
https://doi.org/10.3390/app12147075