Back to Search Start Over

A Clustering Algorithm for Multi-Modal Heterogeneous Big Data With Abnormal Data

Authors :
An Yan
Wei Wang
Yi Ren
HongWei Geng
Source :
Frontiers in Neurorobotics, Vol 15 (2021)
Publication Year :
2021
Publisher :
Frontiers Media S.A., 2021.

Abstract

The problems of data abnormalities and missing data are puzzling the traditional multi-modal heterogeneous big data clustering. In order to solve this issue, a multi-view heterogeneous big data clustering algorithm based on improved Kmeans clustering is established in this paper. At first, for the big data which involve heterogeneous data, based on multi view data analyzing, we propose an advanced Kmeans algorithm on the base of multi view heterogeneous system to determine the similarity detection metrics. Then, a BP neural network method is used to predict the missing attribute values, complete the missing data and restore the big data structure in heterogeneous state. Last, we ulteriorly propose a data denoising algorithm to denoise the abnormal data. Based on the above methods, we construct a framework namely BPK-means to resolve the problems of data abnormalities and missing data. Our solution approach is evaluated through rigorous performance evaluation study. Compared with the original algorithm, both theoretical verification and experimental results show that the accuracy of the proposed method is greatly improved.

Details

Language :
English
ISSN :
16625218
Volume :
15
Database :
Directory of Open Access Journals
Journal :
Frontiers in Neurorobotics
Publication Type :
Academic Journal
Accession number :
edsdoj.857fd826cfc2481ca5b1b92c88bfca0f
Document Type :
article
Full Text :
https://doi.org/10.3389/fnbot.2021.680613