Back to Search
Start Over
A tutorial on generative adversarial networks with application to classification of imbalanced data.
- Source :
- Statistical Analysis & Data Mining; Oct2022, Vol. 15 Issue 5, p543-552, 10p
- Publication Year :
- 2022
-
Abstract
- A challenge unique to classification model development is imbalanced data. In a binary classification problem, class imbalance occurs when one class, the minority group, contains significantly fewer samples than the other class, the majority group. In imbalanced data, the minority class is often the class of interest (e.g., patients with disease). However, when training a classifier on imbalanced data, the model will exhibit bias towards the majority class and, in extreme cases, may ignore the minority class completely. A common strategy for addressing class imbalance is data augmentation. However, traditional data augmentation methods are associated with overfitting, where the model is fit to the noise in the data. In this tutorial we introduce an advanced method for data augmentation: generative adversarial networks (GANs). The advantages of GANs over traditional data augmentation methods are illustrated using the Breast Cancer Wisconsin study. To promote the adoption of GANs for data augmentation, we present an end‐to‐end pipeline that encompasses the complete life cycle of a machine learning project along with alternatives and good practices both in the paper and in a separate video. Our code, data, full results and video tutorial are publicly available in the paper's GitHub repository (https://github.com/yuxiaohuang/research/tree/master/gwu/accepted/sam_2021). [ABSTRACT FROM AUTHOR]
- Subjects :
- GENERATIVE adversarial networks
DATA augmentation
MACHINE learning
Subjects
Details
- Language :
- English
- ISSN :
- 19321864
- Volume :
- 15
- Issue :
- 5
- Database :
- Complementary Index
- Journal :
- Statistical Analysis & Data Mining
- Publication Type :
- Academic Journal
- Accession number :
- 158880271
- Full Text :
- https://doi.org/10.1002/sam.11570