Back to Search Start Over

Prediction model construction of mouse stem cell pluripotency using CpG and non-CpG DNA methylation markers

Authors :
Soobok Joe
Hojung Nam
Source :
BMC Bioinformatics, Vol 21, Iss 1, Pp 1-12 (2020), BMC Bioinformatics
Publication Year :
2020
Publisher :
BMC, 2020.

Abstract

Background Genome-wide studies of DNA methylation across the epigenetic landscape provide insights into the heterogeneity of pluripotent embryonic stem cells (ESCs). Differentiating into embryonic somatic and germ cells, ESCs exhibit varying degrees of pluripotency, and epigenetic changes occurring in this process have emerged as important factors explaining stem cell pluripotency. Results Here, using paired scBS-seq and scRNA-seq data of mice, we constructed a machine learning model that predicts degrees of pluripotency for mouse ESCs. Since the biological activities of non-CpG markers have yet to be clarified, we tested the predictive power of CpG and non-CpG markers, as well as a combination thereof, in the model. Through rigorous performance evaluation with both internal and external validation, we discovered that a model using both CpG and non-CpG markers predicted the pluripotency of ESCs with the highest prediction performance (0.956 AUC, external test). The prediction model consisted of 16 CpG and 33 non-CpG markers. The CpG and most of the non-CpG markers targeted depletions of methylation and were indicative of cell pluripotency, whereas only a few non-CpG markers reflected accumulations of methylation. Additionally, we confirmed that there exists the differing pluripotency between individual developmental stages, such as E3.5 and E6.5, as well as between induced mouse pluripotent stem cell (iPSC) and somatic cell. Conclusions In this study, we investigated CpG and non-CpG methylation in relation to mouse stem cell pluripotency and developed a model thereon that successfully predicts the pluripotency of mouse ESCs.

Details

Language :
English
ISSN :
14712105
Volume :
21
Issue :
1
Database :
OpenAIRE
Journal :
BMC Bioinformatics
Accession number :
edsair.doi.dedup.....41d7d2259db2c56b7a77520095db5d0c
Full Text :
https://doi.org/10.1186/s12859-020-3448-3