Back to Search Start Over

Statistical modeling of an outcome variable with integrated omics data

Authors :
Gu, Zhujie
Sturkenboom, M.C.J.M.
Houwing-Duistermaat, J.J.
Uh, H.W.
Bouhaddani, S. el
Publication Year :
2023
Publisher :
Utrecht University, 2023.

Abstract

In human disease studies, it has become common to collect multiple omics datasets measured on various molecular levels. The aim is to study the underlying mechanisms of disease from different perspectives by jointly analyzing these datasets. This thesis develops statistical methodologies to model a disease outcome with two omics datasets. We consider latent variable methods for constructing low-dimensional components representing the two omics, and linear models for associating the components to a disease. The latent variable methods address the statistical challenges of high dimensionality, correlations within and between omics, and systematic differences between datasets. The linear models provide flexibility for various study designs and different distributions of disease outcomes. Both two-stage methods where latent variable model and linear model are fitted separately and one-stage methods where the two are fitted simultaneously are developed. The two-stage methods are computationally fast and offer more flexibility in the linear models, while the one-stage models provide unbiased inference results. The methods are all validated and can be used in a wide range of disease studies.

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.od.....10691..e71991d21ad506cd59c9bb7d4459819d