Back to Search
Start Over
Statistical method for modeling sequencing data from different technologies in longitudinal studies with application to Huntington disease
- Source :
- Biometrical Journal. Biometrische Zeitschrift, Biometrical Journal, 63(4), 745-760. WILEY, Biometrical Journal
- Publication Year :
- 2020
- Publisher :
- WILEY, 2020.
-
Abstract
- Advancement of gene expression measurements in longitudinal studies enables the identification of genes associated with disease severity over time. However, problems arise when the technology used to measure gene expression differs between time points. Observed differences between the results obtained at different time points can be caused by technical differences. Modeling the two measurements jointly over time might provide insight into the causes of these different results. Our work is motivated by a study of gene expression data of blood samples from Huntington disease patients, which were obtained using two different sequencing technologies. At time point 1, DeepSAGE technology was used to measure the gene expression, with a subsample also measured using RNA‐Seq technology. At time point 2, all samples were measured using RNA‐Seq technology. Significant associations between gene expression measured by DeepSAGE and disease severity using data from the first time point could not be replicated by the RNA‐Seq data from the second time point. We modeled the relationship between the two sequencing technologies using the data from the overlapping samples. We used linear mixed models with either DeepSAGE or RNA‐Seq measurements as the dependent variable and disease severity as the independent variable. In conclusion, (1) for one out of 14 genes, the initial significant result could be replicated with both technologies using data from both time points; (2) statistical efficiency is lost due to disagreement between the two technologies, measurement error when predicting gene expressions, and the need to include additional parameters to account for possible differences.
- Subjects :
- Statistics and Probability
Mixed model
Technology
DeepSAGE
Computer science
media_common.quotation_subject
RNA-Seq
01 natural sciences
Generalized linear mixed model
010104 statistics & probability
03 medical and health sciences
Longitudinal and Time‐to‐event Analysis
Statistics
Humans
RNA‐Seq
Longitudinal Studies
0101 mathematics
Time point
quality control
030304 developmental biology
media_common
0303 health sciences
Measure (data warehouse)
Variables
Observational error
Gene Expression Profiling
General Medicine
Identification (information)
Huntington Disease
Statistics, Probability and Uncertainty
linear mixed model
measurement error
Research Paper
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Biometrical Journal. Biometrische Zeitschrift, Biometrical Journal, 63(4), 745-760. WILEY, Biometrical Journal
- Accession number :
- edsair.doi.dedup.....5e23cd4ca186f78d25ac70000df1060f