Start Over

A Classification Method for Incomplete Mixed Data Using Imputation and Feature Selection

Authors :: Gengsong Li
Qibin Zheng
Yi Liu
Xiang Li
Wei Qin
Xingchun Diao
Source :: Applied Sciences, Vol 14, Iss 14, p 5993 (2024)
Publication Year :: 2024
Publisher :: MDPI AG, 2024.
Abstract: Data missing is a ubiquitous problem in real-world systems that adversely affects the performance of machine learning algorithms. Although many useful imputation methods are available to address this issue, they often fail to consider the information provided by both features and labels. As a result, the performance of these methods might be constrained. Furthermore, feature selection as a data quality improvement technique has been widely used and has demonstrated its efficiency. To overcome the limitation of imputation methods, we propose a novel algorithm that combines data imputation and feature selection to tackle classification problems for mixed data. Based on the mean and standard deviation of quantitative features and the selecting probabilities of unique values of categorical features, our algorithm constructs different imputation models for quantitative and categorical features. Particle swarm optimization is used to optimize the parameters of the imputation models and select feature subsets simultaneously. Additionally, we introduce a legacy learning mechanism to enhance the optimization capability of our method. To evaluate the performance of the proposed method, seven algorithms and twelve datasets are used for comparison. The results show that our algorithm outperforms other algorithms in terms of accuracy and F1 score and has reasonable time overhead.

Subjects :: data imputation
feature selection
particle swarm optimization
legacy learning mechanism
mixed data
classification
Technology
Engineering (General). Civil engineering (General)
TA1-2040
Biology (General)
QH301-705.5
Physics
QC1-999
Chemistry
QD1-999

Details

Language :: English
ISSN :: 20763417
Volume :: 14
Issue :: 14
Database :: Directory of Open Access Journals
Journal :: Applied Sciences
Publication Type :: Academic Journal
Accession number :: edsdoj.639c71d778a04baeac4f5c10d2dc70c8
Document Type :: article
Full Text :: https://doi.org/10.3390/app14145993

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

A Classification Method for Incomplete Mixed Data Using Imputation and Feature Selection

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

A Classification Method for Incomplete Mixed Data Using Imputation and Feature Selection

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources