Back to Search
Start Over
Synthetic data generation with probabilistic Bayesian Networks
- Source :
- Mathematical Biosciences and Engineering, Vol 18, Iss 6, Pp 8603-8621 (2021)
- Publication Year :
- 2021
- Publisher :
- AIMS Press, 2021.
-
Abstract
- Bayesian Network (BN) modeling is a prominent and increasingly popular computational systems biology method. It aims to construct network graphs from the large heterogeneous biological datasets that reflect the underlying biological relationships. Currently, a variety of strategies exist for evaluating BN methodology performance, ranging from utilizing artificial benchmark datasets and models, to specialized biological benchmark datasets, to simulation studies that generate synthetic data from predefined network models. The last is arguably the most comprehensive approach; however, existing implementations often rely on explicit and implicit assumptions that may be unrealistic in a typical biological data analysis scenario, or are poorly equipped for automated arbitrary model generation. In this study, we develop a purely probabilistic simulation framework that addresses the demands of statistically sound simulations studies in an unbiased fashion. Additionally, we expand on our current understanding of the theoretical notions of causality and dependence / conditional independence in BNs and the Markov Blankets within.
Details
- Language :
- English
- ISSN :
- 15510018
- Volume :
- 18
- Issue :
- 6
- Database :
- Directory of Open Access Journals
- Journal :
- Mathematical Biosciences and Engineering
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.02b13fa56cb2452892c55bbe03f653a3
- Document Type :
- article
- Full Text :
- https://doi.org/10.3934/mbe.2021426?viewType=HTML