Back to Search Start Over

Structure identification of missing data: a perspective from granular computing.

Authors :
Shen, Yinghua
Zhao, Dan
Hu, Xingchen
Pedrycz, Witold
Chen, Yuan
Li, Jiliang
Xiao, Zhi
Source :
Soft Computing - A Fusion of Foundations, Methodologies & Applications. Jan2024, p1-19.
Publication Year :
2024

Abstract

Missing data are frequently encountered in reality, which inevitably poses great challenges to data mining techniques devoted to data structure identification. In view of the fact that missing data generally exhibits high uncertainty, this paper first introduces the concept of information granule, and performs granular imputation on missing data in a more abstract and inclusive way. With the tolerant nature of the information granule to uncertainty, the error of data imputation and the adverse effects on the subsequent research caused by the low-quality data can be largely reduced. Second, the initial data structure (including granular cluster centers and numeric partition matrix) of the data set with missing values is identified by performing fuzzy clustering on the mixed data set (including both numeric values and information granules) formed by imputation. Third, the bounds of granular cluster centers are further optimized by using the principle of justifiable granularity, and a more robust and reliable granular partition matrix is formed subsequently. Finally, by constructing a reconstruction criterion for mixed data, clustering performance and the optimization of some critical parameters (e.g., the cluster number) used in the proposed method could be investigated. This paper conducts comprehensive experimental studies on both synthetic and publicly available data sets to show the feasibility and effectiveness of the proposed data structure exploration method. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14327643
Database :
Academic Search Index
Journal :
Soft Computing - A Fusion of Foundations, Methodologies & Applications
Publication Type :
Academic Journal
Accession number :
174586446
Full Text :
https://doi.org/10.1007/s00500-023-09523-9