Back to Search Start Over

Systematically missing data in distributed data networks: multiple imputation when data cannot be pooled.

Authors :
Thiesmeier, Robert
Bottai, Matteo
Orsini, Nicola
Source :
Journal of Statistical Computation & Simulation. Nov2024, Vol. 94 Issue 17, p3807-3825. 19p.
Publication Year :
2024

Abstract

Systematically missing data in distributed data networks presents practical and methodological challenges. Failure to handle it appropriately can bias statistical inference. Multiple imputations can be used to address systematic missingness. However, when data from different study sites cannot be pooled into a unified file, conventional imputation approaches become unavailable due to the absence of a basis for imputation. To address such challenges, we introduce an imputation method based on conditional quantiles – conditional quantile imputation (CQI) – which involves four steps: (i) estimating 99 quantiles for the systematically missing variable in studies with observed data; (ii) deriving a weighted average of regression coefficients across studies and transmitting it to sites with systematically missing data; (iii) imputing the systematically missing values based on observed data and the set of regression coefficients from step ii; and (iv) combining estimates of the substantive outcome model across imputations using Rubin's rules. We evaluate CQI in different simulation scenarios and illustrate it with an applied data example. We conclude that CQI can be a suitable approach for the imputation of systematically missing data when data from multiple studies cannot be pooled. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00949655
Volume :
94
Issue :
17
Database :
Academic Search Index
Journal :
Journal of Statistical Computation & Simulation
Publication Type :
Academic Journal
Accession number :
181054276
Full Text :
https://doi.org/10.1080/00949655.2024.2404220