Back to Search
Start Over
Correlated Differential Privacy: Feature Selection in Machine Learning
- Source :
- IEEE Transactions on Industrial Informatics. 16:2115-2124
- Publication Year :
- 2020
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2020.
-
Abstract
- Privacy preserving in machine learning is a crucial issue in industry informatics since data used for training in industries usually contain sensitive information. Existing differentially private machine learning algorithms have not considered the impact of data correlation, which may lead to more privacy leakage than expected in industrial applications. For example, data collected for traffic monitoring may contain some correlated records due to temporal correlation or user correlation. To fill this gap, we propose a correlation reduction scheme with differentially private feature selection considering the issue of privacy loss when data have correlation in machine learning tasks. %The key to the proposed scheme is to describe the data correlation and select features which leads to less data correlation across the whole dataset. The proposed scheme involves five steps with the goal of managing the extent of data correlation, preserving the privacy, and supporting accuracy in the prediction results. In this way, the impact of data correlation is relieved with the proposed feature selection scheme, and moreover, the privacy issue of data correlation in learning is guaranteed. The proposed method can be widely used in machine learning algorithms which provide services in industrial areas. Experiments show that the proposed scheme can produce better prediction results with machine learning tasks and fewer mean square errors for data queries compared to existing schemes.<br />Comment: This paper has been published in IEEE Transactions on Industrial Informatics
- Subjects :
- FOS: Computer and information sciences
Scheme (programming language)
Electrical & Electronic Engineering
Computer Science - Machine Learning
Computer Science - Cryptography and Security
Computer science
Feature extraction
Feature selection
02 engineering and technology
Machine learning
computer.software_genre
Machine Learning (cs.LG)
Reduction (complexity)
0202 electrical engineering, electronic engineering, information engineering
Differential privacy
Sensitivity (control systems)
Electrical and Electronic Engineering
computer.programming_language
business.industry
020208 electrical & electronic engineering
Computer Science Applications
Information sensitivity
Control and Systems Engineering
Artificial intelligence
business
Cryptography and Security (cs.CR)
computer
Information Systems
Subjects
Details
- ISSN :
- 19410050 and 15513203
- Volume :
- 16
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Industrial Informatics
- Accession number :
- edsair.doi.dedup.....0f00023109380e25acae70ee8921dba2
- Full Text :
- https://doi.org/10.1109/tii.2019.2936825