Back to Search
Start Over
Fed-MIWAE: Federated Imputation of Incomplete Data via Deep Generative Models
- Publication Year :
- 2023
- Publisher :
- HAL CCSD, 2023.
-
Abstract
- Federated learning allows for the training of machine learn- ing models on multiple decentralized local datasets without requiring explicit data exchange. However, data pre-processing, including strate- gies for handling missing data, remains a major bottleneck in real-world federated learning deployment, and is typically performed locally. This approach may be biased, since the subpopulations locally observed at each center may not be representative of the overall one. To address this issue, this paper first proposes a more consistent approach to data stan- dardization through a federated model. Additionally, we propose Fed- MIWAE, a federated version of the state-of-the-art imputation method MIWAE, a deep latent variable model for missing data imputation based on variational autoencoders. MIWAE has the great advantage of being easily trainable with classical federated aggregators. Furthermore, it is able to deal with MAR (Missing At Random) data, a more challenging missing-data mechanism than MCAR (Missing Completely At Random), where the missingness of a variable can depend on the observed ones. We evaluate our method on multi-modal medical imaging data and clinical scores from a simulated federated scenario with the ADNI dataset. We compare Fed-MIWAE with respect to classical imputation methods, ei- ther performed locally or in a centralized fashion. Fed-MIWAE allows to achieve imputation accuracy comparable with the best centralized method, even when local data distributions are highly heterogeneous. In addition, thanks to the variational nature of Fed-MIWAE, our method is designed to perform multiple imputation, allowing for the quantification of the imputation uncertainty in the federated scenario.
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
Deep Learning
Statistics - Machine Learning
Missing data
Federated pre-processing
Federated learning
Machine Learning (stat.ML)
[STAT.OT]Statistics [stat]/Other Statistics [stat.ML]
Variational autoencoders
Machine Learning (cs.LG)
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....a23b7730ca31364d21d69fdb0648cb89