1. Feature and Subfeature Selection for Classification Using Correlation Coefficient and Fuzzy Model
- Author
-
Subhendu Kumar Pani, Vinayakumar Ravi, Hemanta Kumar Bhuyan, and Chinmay Chakraborty
- Subjects
Correlation coefficient ,business.industry ,Computer science ,Strategy and Management ,Value (computer science) ,Pattern recognition ,Fuzzy logic ,Class (biology) ,Data extraction ,Feature (machine learning) ,Data analysis ,Artificial intelligence ,Electrical and Electronic Engineering ,Gradient descent ,business - Abstract
This article presents an analysis of data extraction for classification using correlation coefficient and fuzzy model. Several traditional methods of data extraction are used for classification that could not provide sufficient information for further step of data analysis on class. It needs refinement of features data to distinguish a class that differs from a traditional class. Thus, it proposes the feature tiny data (subfeature data) to find distinguish class from a traditional class using two methods such as correlation coefficient and fuzzy model to select features as well as subfeature for distinguishing class. In the first approach, the correlation coefficient methods with gradient descent technique are used to select features from the dataset and in the second approach, the fuzzy model with supreme of minimum value is considered to get subfeature data. As per the proposed model, some features (i.e., three features from the acoustic dataset, two features from the QCM dataset, and eight features from the audit dataset, etc.) and subfeatures (as per threshold value like 20 for acoustic; 10 for QCM, and 20 for audit, etc.) are selected based on correlation coefficient as well as fuzzy methods, respectively. Further, the probability approach is used to find the association and availability of subfeature data from the dimensional reduced database. The experimental results show the proposed framework identifies and selects both feature and subfeature data with the effectiveness of the new class. The comparison results of several classifiers on several datasets are explained in the experimental section.
- Published
- 2023