Back to Search Start Over

Combination of ADASYN and random forest for classification of imbalanced lung cancer dataset.

Authors :
Pulungan, Annisa Fadhillah
Selvida, Desilia
Silitonga, Agnes Irene
Source :
AIP Conference Proceedings. 2024, Vol. 2987 Issue 1, p1-6. 6p.
Publication Year :
2024

Abstract

One of the main causes of death in the world is caused by cancer. And one of them is Lung Cancer. According to World Health Organization, in 2014 the death rate caused by lung cancer in Indonesia was 21.8% in men and 9.1% in women with 30,865 cases of death caused by lung cancer each year in men and women. Many studies have been carried out on computational lung cancer. One of them is by implementing machine learning in detecting lung cancer. However, there are obstacles, namely the imbalance in the amount of data between patients and non-patients. So it takes an approach to overcome this imbalance. One of these methods is ADASYN which is then combined with the Random Forest classification algorithm. In this study, we will compare the results of the classification model performance in Random Forest before and after the ADASYN sampling method was used in the training process. The results of this study showed an increase in the performance of the Random Forest classification model with an AUC value of 0.859 after the ADASYN sampling method was carried out with an error rate of 4.9%. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0094243X
Volume :
2987
Issue :
1
Database :
Academic Search Index
Journal :
AIP Conference Proceedings
Publication Type :
Conference
Accession number :
176720891
Full Text :
https://doi.org/10.1063/5.0200590