Back to Search Start Over

Remote access methods for exploratory data analysis and statistical modelling: Privacy-Preserving Analytics®

Authors :
Sparks, Ross
Carter, Chris
Donnelly, John B.
O’Keefe, Christine M.
Duncan, Jodie
Keighley, Tim
McAullay, Damien
Source :
Computer Methods & Programs in Biomedicine. Sep2008, Vol. 91 Issue 3, p208-222. 15p.
Publication Year :
2008

Abstract

Abstract: This paper is concerned with the challenge of enabling the use of confidential or private data for research and policy analysis, while protecting confidentiality and privacy by reducing the risk of disclosure of sensitive information. Traditional solutions to the problem of reducing disclosure risk include releasing de-identified data and modifying data before release. In this paper we discuss the alternative approach of using a remote analysis server which does not enable any data release, but instead is designed to deliver useful results of user-specified statistical analyses with a low risk of disclosure. The techniques described in this paper enable a user to conduct a wide range of methods in exploratory data analysis, regression and survival analysis, while at the same time reducing the risk that the user can read or infer any individual record attribute value. We illustrate our methods with examples from biostatistics using publicly available data. We have implemented our techniques into a software demonstrator called Privacy-Preserving Analytics®(PPA®), via a web-based interface to the R software. We believe that PPA® may provide an effective balance between the competing goals of providing useful information and reducing disclosure risk in some situations. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
01692607
Volume :
91
Issue :
3
Database :
Academic Search Index
Journal :
Computer Methods & Programs in Biomedicine
Publication Type :
Academic Journal
Accession number :
33135400
Full Text :
https://doi.org/10.1016/j.cmpb.2008.04.001