1. Missing Data Methods in Mendelian Randomization Studies With Multiple Instruments.
- Author
-
Burgess, Stephen, Seaman, Shaun, Lawlor, Debbie A., Casas, Juan P., and Thompson, Simon G.
- Subjects
GENETIC research ,EPIDEMIOLOGY research methodology ,ANALYSIS of variance ,C-reactive protein ,COMPUTER simulation ,CONFIDENCE intervals ,STATISTICAL correlation ,DATABASE evaluation ,EPIDEMIOLOGY ,GENETIC polymorphisms ,HEART diseases ,LONGITUDINAL method ,RESEARCH methodology ,PROBABILITY theory ,RESEARCH funding ,STATISTICAL sampling ,SAMPLE size (Statistics) ,DATA analysis ,SECONDARY analysis - Abstract
Mendelian randomization studies typically have low power. Where there are several valid candidate genetic instruments, precision can be gained by using all the instruments available. However, sporadically missing genetic data can offset this gain. The authors describe 4 Bayesian methods for imputing the missing data based on a missing-at-random assumption: multiple imputations, single nucleotide polymorphism (SNP) imputation, latent variables, and haplotype imputation. These methods are demonstrated in a simulation study and then applied to estimate the causal relation between C-reactive protein and each of fibrinogen and coronary heart disease, based on 3 SNPs in British Women’s Heart and Health Study participants assessed at baseline between May 1999 and June 2000. A complete-case analysis based on all 3 SNPs was found to be more precise than analyses using any 1 SNP alone. Precision is further improved by using any of the 4 proposed missing data methods; the improvement is equivalent to about a 25% increase in sample size. All methods gave similar results, which were apparently not overly sensitive to violation of the missing-at-random assumption. Programming code for the analyses presented is available online. [ABSTRACT FROM PUBLISHER]
- Published
- 2011
- Full Text
- View/download PDF