Back to Search Start Over

Synthetic data generation with probabilistic Bayesian Networks

Authors :
Grigoriy Gogoshin
Sergio Branciamore
Andrei S. Rodin
Source :
Mathematical Biosciences and Engineering, Vol 18, Iss 6, Pp 8603-8621 (2021)
Publication Year :
2021
Publisher :
AIMS Press, 2021.

Abstract

Bayesian Network (BN) modeling is a prominent and increasingly popular computational systems biology method. It aims to construct network graphs from the large heterogeneous biological datasets that reflect the underlying biological relationships. Currently, a variety of strategies exist for evaluating BN methodology performance, ranging from utilizing artificial benchmark datasets and models, to specialized biological benchmark datasets, to simulation studies that generate synthetic data from predefined network models. The last is arguably the most comprehensive approach; however, existing implementations often rely on explicit and implicit assumptions that may be unrealistic in a typical biological data analysis scenario, or are poorly equipped for automated arbitrary model generation. In this study, we develop a purely probabilistic simulation framework that addresses the demands of statistically sound simulations studies in an unbiased fashion. Additionally, we expand on our current understanding of the theoretical notions of causality and dependence / conditional independence in BNs and the Markov Blankets within.

Details

Language :
English
ISSN :
15510018
Volume :
18
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Mathematical Biosciences and Engineering
Publication Type :
Academic Journal
Accession number :
edsdoj.02b13fa56cb2452892c55bbe03f653a3
Document Type :
article
Full Text :
https://doi.org/10.3934/mbe.2021426?viewType=HTML