Back to Search Start Over

Analysis in case-control sequencing association studies with different sequencing depths.

Authors :
Chen S
Lin X
Source :
Biostatistics (Oxford, England) [Biostatistics] 2020 Jul 01; Vol. 21 (3), pp. 577-593.
Publication Year :
2020

Abstract

With the advent of next-generation sequencing, investigators have access to higher quality sequencing data. However, to sequence all samples in a study using next generation sequencing can still be prohibitively expensive. One potential remedy could be to combine next generation sequencing data from cases with publicly available sequencing data for controls, but there could be a systematic difference in quality of sequenced data, such as sequencing depths, between sequenced study cases and publicly available controls. We propose a regression calibration (RC)-based method and a maximum-likelihood method for conducting an association study with such a combined sample by accounting for differential sequencing errors between cases and controls. The methods allow for adjusting for covariates, such as population stratification as confounders. Both methods control type I error and have comparable power to analysis conducted using the true genotype with sufficiently high but different sequencing depths. We show that the RC method allows for analysis using naive variance estimate (closely approximates true variance in practice) and standard software under certain circumstances. We evaluate the performance of the proposed methods using simulation studies and apply our methods to a combined data set of exome sequenced acute lung injury cases and healthy controls from the 1000 Genomes project.<br /> (© The Author 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1468-4357
Volume :
21
Issue :
3
Database :
MEDLINE
Journal :
Biostatistics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
30590456
Full Text :
https://doi.org/10.1093/biostatistics/kxy073