1. A sandbox study proposal for private and distributed health data analysis
- Author
-
Brännvall, Rickard, Svensson, Hanna, Kaliyaperumal, Kannaki, Burden, Håkan, and Stenberg, Susanne
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Computers and Society ,Computer Science - Distributed, Parallel, and Cluster Computing ,68M14 (Primary) 92C60, 68P25, 68P20 (Secondary) ,K.4.1 ,J.3.2 ,H.2.8 ,D.4.6 - Abstract
This paper presents a sandbox study proposal focused on the distributed processing of personal health data within the Vinnova-funded SARDIN project. The project aims to develop the Health Data Bank (H\"alsodatabanken in Swedish), a secure platform for research and innovation that complies with the European Health Data Space (EHDS) legislation. By minimizing the sharing and storage of personal data, the platform sends analysis tasks directly to the original data locations, avoiding centralization. This approach raises questions about data controller responsibilities in distributed environments and the anonymization status of aggregated statistical results. The study explores federated analysis, secure multi-party aggregation, and differential privacy techniques, informed by real-world examples from clinical research on Parkinson's disease, stroke rehabilitation, and wound analysis. To validate the proposed study, numerical experiments were conducted using four open-source datasets to assess the feasibility and effectiveness of the proposed methods. The results support the methods for the proposed sandbox study by demonstrating that differential privacy in combination with secure aggregation techniques significantly improves the privacy-utility trade-off., Comment: 20 pages, 5 figures, 4 tables
- Published
- 2025