Back to Search
Start Over
A framework for utility enhanced incomplete microdata anonymization
- Source :
- Cluster Computing. 20:1749-1764
- Publication Year :
- 2017
- Publisher :
- Springer Science and Business Media LLC, 2017.
-
Abstract
- Incomplete microdata, i.e., microdata with missing value, is very common in real-world datasets. However, existing anonymization techniques, which were developed for complete datasets, suffer from serious information loss on incomplete microdata, due to the missing value pollution. In this paper, we propose a framework for utility enhanced anonymization of incomplete microdata to address this issue. First, we study the properties of missing value pollution on generalization. Guided by these properties, we develop two top-down anonymization algorithms to preserve data utility on incomplete microdata. Extensive experiments on real-world datasets show that our techniques outperform the state-of-the-art techniques in terms of information loss and missing value pollution.
- Subjects :
- Data anonymization
Computer Networks and Communications
Computer science
02 engineering and technology
k-anonymity
computer.software_genre
020204 information systems
Microdata (HTML)
0202 electrical engineering, electronic engineering, information engineering
ComputingMilieux_COMPUTERSANDSOCIETY
020201 artificial intelligence & image processing
Data mining
computer
Software
Subjects
Details
- ISSN :
- 15737543 and 13867857
- Volume :
- 20
- Database :
- OpenAIRE
- Journal :
- Cluster Computing
- Accession number :
- edsair.doi...........38a77ed088aa9c2684b2098238196dd3
- Full Text :
- https://doi.org/10.1007/s10586-017-0795-6