Back to Search
Start Over
Statistical modeling of an outcome variable with integrated omics data
- Publication Year :
- 2023
- Publisher :
- Utrecht University, 2023.
-
Abstract
- In human disease studies, it has become common to collect multiple omics datasets measured on various molecular levels. The aim is to study the underlying mechanisms of disease from different perspectives by jointly analyzing these datasets. This thesis develops statistical methodologies to model a disease outcome with two omics datasets. We consider latent variable methods for constructing low-dimensional components representing the two omics, and linear models for associating the components to a disease. The latent variable methods address the statistical challenges of high dimensionality, correlations within and between omics, and systematic differences between datasets. The linear models provide flexibility for various study designs and different distributions of disease outcomes. Both two-stage methods where latent variable model and linear model are fitted separately and one-stage methods where the two are fitted simultaneously are developed. The two-stage methods are computationally fast and offer more flexibility in the linear models, while the one-stage models provide unbiased inference results. The methods are all validated and can be used in a wide range of disease studies.
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.od.....10691..e71991d21ad506cd59c9bb7d4459819d