Back to Search Start Over

Dual Query: Practical Private Query Release for High Dimensional Data

Authors :
Emilio Jesús Gallego Arias
Zhiwei Steven Wu
Marco Gaboardi
Justin Hsu
Aaron Roth
University at Buffalo [SUNY] (SUNY Buffalo)
State University of New York (SUNY)
Centre de Recherche en Informatique (CRI)
MINES ParisTech - École nationale supérieure des mines de Paris
Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)
Université Paris sciences et lettres (PSL)
University of Pennsylvania [Philadelphia]
Source :
Journal of Privacy and Confidentiality, Journal of Privacy and Confidentiality, 2016, 7 (Issue 2, Article N°4), pp.53-77, The Journal of Privacy and Confidentiality, Vol 7, Iss 2 (2017)
Publication Year :
2014
Publisher :
arXiv, 2014.

Abstract

International audience; We present a practical, differentially private algorithm for answering a large number of queries on high dimensional datasets. Like all algorithms for this task, ours necessarily has worst-case complexity exponential in the dimension of the data. However, our algorithm packages the computationally hard step into a concisely defined integer program, which can be solved non-privately using standard solvers. We prove accuracy and privacy theorems for our algorithm, and then demonstrate experimentally that our algorithm performs well in practice. For example , our algorithm can efficiently and accurately answer millions of queries on the Netflix dataset, which has over 17,000 attributes; this is an improvement on the state of the art by multiple orders of magnitude.

Details

Database :
OpenAIRE
Journal :
Journal of Privacy and Confidentiality, Journal of Privacy and Confidentiality, 2016, 7 (Issue 2, Article N°4), pp.53-77, The Journal of Privacy and Confidentiality, Vol 7, Iss 2 (2017)
Accession number :
edsair.doi.dedup.....91dae54ff8fb58e1096cb9e9fe3fcfe4
Full Text :
https://doi.org/10.48550/arxiv.1402.1526