Back to Search Start Over

An Understanding of the Vulnerability of Datasets to Disparate Membership Inference Attacks.

Authors :
Moore, Hunter D.
Stephens, Andrew
Scherer, William
Source :
Journal of Cybersecurity & Privacy; Dec2022, Vol. 2 Issue 4, p882-906, 25p
Publication Year :
2022

Abstract

Recent efforts have shown that training data is not secured through the generalization and abstraction of algorithms. This vulnerability to the training data has been expressed through membership inference attacks that seek to discover the use of specific records within the training dataset of a model. Additionally, disparate membership inference attacks have been shown to achieve better accuracy compared with their macro attack counterparts. These disparate membership inference attacks use a pragmatic approach to attack individual, more vulnerable sub-sets of the data, such as underrepresented classes. While previous work in this field has explored model vulnerability to these attacks, this effort explores the vulnerability of datasets themselves to disparate membership inference attacks. This is accomplished through the development of a vulnerability-classification model that classifies datasets as vulnerable or secure to these attacks. To develop this model, a vulnerability-classification dataset is developed from over 100 datasets—including frequently cited datasets within the field. These datasets are described using a feature set of over 100 features and assigned labels developed from a combination of various modeling and attack strategies. By averaging the attack accuracy over 13 different modeling and attack strategies, the authors explore the vulnerabilities of the datasets themselves as opposed to a particular modeling or attack effort. The in-class observational distance, width ratio, and the proportion of discrete features are found to dominate the attributes defining dataset vulnerability to disparate membership inference attacks. These features are explored in deeper detail and used to develop exploratory methods for hardening these class-based sub-datasets against attacks showing preliminary mitigation success with combinations of feature reduction and class-balancing strategies. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
2624800X
Volume :
2
Issue :
4
Database :
Complementary Index
Journal :
Journal of Cybersecurity & Privacy
Publication Type :
Academic Journal
Accession number :
160976605
Full Text :
https://doi.org/10.3390/jcp2040045