Back to Search
Start Over
A differential privacy protecting K-means clustering algorithm based on contour coefficients
- Source :
- PLoS ONE, PLoS ONE, Vol 13, Iss 11, p e0206832 (2018)
- Publication Year :
- 2018
- Publisher :
- Public Library of Science, 2018.
-
Abstract
- This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor availability of clustering results appears because of small privacy budget parameters, an improved differential privacy protecting K-means clustering algorithm was raised in this paper. The improved algorithm uses the contour coefficients to quantitatively evaluate the clustering effect of each iteration and add different noise to different clusters. In order to be adapted to the huge number of data, this paper provides an algorithm design in MapReduce Framework. Experimental finding shows that the new algorithm improves the availability of the algorithm clustering results under the condition of ensuring individual privacy without significantly increasing its operating time.
- Subjects :
- Computer and Information Sciences
Computer science
Physiology
lcsh:Medicine
02 engineering and technology
computer.software_genre
Research and Analysis Methods
Machine Learning
Machine Learning Algorithms
Clustering Algorithms
Mathematical and Statistical Techniques
Artificial Intelligence
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Medicine and Health Sciences
Differential privacy
Cluster Analysis
Data Mining
Point (geometry)
Cluster analysis
lcsh:Science
Computer Science::Cryptography and Security
Multidisciplinary
Data Processing
Applied Mathematics
Simulation and Modeling
lcsh:R
k-means clustering
Biology and Life Sciences
Probability Theory
Probability Distribution
Body Fluids
Noise
Blood
Privacy
Physical Sciences
Probability distribution
020201 artificial intelligence & image processing
Algorithm design
lcsh:Q
Data mining
K Means Clustering
Anatomy
Information Technology
computer
Mathematics
Algorithms
Research Article
Subjects
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 13
- Issue :
- 11
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....f618cb3cf4ac8c908da492f21de9cf68