Back to Search Start Over

Multi-Objective Feature Selection based on Clustering and Principal Component Analysis by Enhanced Electromagnetic-likes Algorithm

Authors :
Somaye Jalali-Poor
Erna Budhiarti Nababan
Shokooh Pour Mahyabadi
Majid Abdolrazzagh
Source :
2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA).
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

Given the rapid growth of data and the reduced implementation quality of data mining and pattern extraction techniques, the use of feature reduction has become an important challenge of data mining and pattern recognition. An important goal of data reduction techniques is to make the minimum effort and achieve the maximum efficiency of data selection for the implementation of data mining process. The two primary objectives of feature selection are to minimize the errors of the patterns identified in the reduced subset and minimize the number of features. The majority of available feature selection algorithms adopts a single-objective approach. This is the first paper focused on clustering used as the identifier of unsupervised hidden patterns. It is also focused on the principal component analysis (PCA) to analyze the values of the features. The goals of the new multi-objective feature selection problem are to minimize the coefficient of PCA, maximize the accuracy of k-medoids clustering, and minimize the number of selected features. Another innovation of this study was to select the best subset of features at the best performance by using the electromagnetism-like mechanism (EM) algorithm. The proposed method was tested on 14 standard UCI datasets. The results indicated the competitive advantage of this algorithm over other algorithms implemented to solve this problem.

Details

Database :
OpenAIRE
Journal :
2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA)
Accession number :
edsair.doi...........93fc754335221713a4a9f4e3117b6b09
Full Text :
https://doi.org/10.1109/databia50434.2020.9190226