1. Bayesian modeling of interaction between features in sparse multivariate count data with application to microbiome study
- Author
-
Zhang, Shuangjie, Shen, Yuning, Chen, Irene A, and Lee, Juhee
- Subjects
Mathematical Sciences ,Statistics ,Covariance matrix ,differential abundance ,factor model ,joint sparsity ,kernel model ,zero inflation ,multivariate count data ,Econometrics ,Statistics & Probability - Abstract
Many statistical methods have been developed for the analysis of microbial community profiles, but due to the complexity of typical microbiome measurements, inference of interactions between microbial features remains challenging. We develop a Bayesian zero-inflated rounded log-normal kernel method to model interaction between microbial features in a community using multivariate count data in the presence of covariates and excess zeros. The model carefully constructs the interaction structure by imposing joint sparsity on the covariance matrix of the kernel and obtains a reliable estimate of the structure with a small sample size. The model also includes zero inflation to account for excess zeros observed in data and infers differential abundance of microbial features associated with covariates through log-linear regression. We provide simulation studies and real data analysis examples to demonstrate the developed model. Comparison of the model to a simpler model and popular alternatives in simulation studies shows that, in addition to an added and important insight on the feature interaction, it yields superior parameter estimates and model fit in various settings.
- Published
- 2023