Back to Search
Start Over
Data Mining Algorithm for Cloud Network Information Based on Artificial Intelligence Decision Mechanism
- Source :
- IEEE Access, Vol 8, Pp 53394-53407 (2020)
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- Due to the rapid development of information technology and network technology, there is a lot of data, but the phenomenon of lack of knowledge is becoming more and more serious. Data mining technology has developed vigorously in this environment, and it has shown more and more vitality. Based on Spark programming model, this paper designs the parallel extension of fuzzy c-means. In order to enhance the performance of fuzzy c-means parallel expansion, the improvement strategy of k-means during the initialization phase is borrowed, and k-means// is extended to fuzzy c-means to obtain better clustering performance. Combined with Spark’s programming model, this paper can obtain extended parallel fuzzy c-means algorithm. Several experiments on the data set of the algorithm proposed in this paper have shown good scalability and parallelism, effectively expanding fuzzy c-means clustering to distributed applications, greatly increasing the scale of the data processed by the algorithm. This improves the robustness of the algorithm and the adaptability of the algorithm to the shape and structure of the data, so that the parallel and scalable clustering algorithm can more effectively perform cluster analysis on big data. Three algorithms were simulated on MATLAB platform. We use simple data sets and complex two-dimensional data sets, and compare with the traditional fuzzy c-means algorithm and fuzzy c-means algorithm based on fuzzy entropy. Experiments show that the scalable parallel fuzzy c-means algorithm not only greatly improves the anti-noise performance, but also improves the convergence speed, and it can automatically determine the optimal number of clusters.
- Subjects :
- Artificial intelligence
General Computer Science
Distributed database
Computer science
business.industry
Big data
cloud computing
General Engineering
Initialization
scalable parallel fuzzy c-means
data mining
computer.software_genre
Fuzzy logic
Data set
Robustness (computer science)
Scalability
Programming paradigm
General Materials Science
Data mining
lcsh:Electrical engineering. Electronics. Nuclear engineering
Cluster analysis
business
computer
lcsh:TK1-9971
cluster analysis
Subjects
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....f8b422d03e4e3ceb84f61ea16841f8c5