Back to Search Start Over

Identification of human circadian genes based on time course gene expression profiles by using a deep learning method.

Authors :
Cui, Peng
Zhong, Tingyan
Wang, Zhuo
Wang, Tao
Zhao, Hongyu
Liu, Chenglin
Lu, Hui
Source :
BBA: Molecular Basis of Disease. Jun2018:Part B, Vol. 1864 Issue 6, p2274-2283. 10p.
Publication Year :
2018

Abstract

Circadian genes express periodically in an approximate 24-h period and the identification and study of these genes can provide deep understanding of the circadian control which plays significant roles in human health. Although many circadian gene identification algorithms have been developed, large numbers of false positives and low coverage are still major problems in this field. In this study we constructed a novel computational framework for circadian gene identification using deep neural networks (DNN) – a deep learning algorithm which can represent the raw form of data patterns without imposing assumptions on the expression distribution. Firstly, we transformed time-course gene expression data into categorical-state data to denote the changing trend of gene expression. Two distinct expression patterns emerged after clustering of the state data for circadian genes from our manually created learning dataset. DNN was then applied to discriminate the aperiodic genes and the two subtypes of periodic genes. In order to assess the performance of DNN, four commonly used machine learning methods including k-nearest neighbors, logistic regression, naïve Bayes, and support vector machines were used for comparison. The results show that the DNN model achieves the best balanced precision and recall. Next, we conducted large scale circadian gene detection using the trained DNN model for the remaining transcription profiles. Comparing with JTK_CYCLE and a study performed by Möller-Levet et al. (doi: https://doi.org/10.1073/pnas.1217154110 ), we identified 1132 novel periodic genes. Through the functional analysis of these novel circadian genes, we found that the GTPase superfamily exhibits distinct circadian expression patterns and may provide a molecular switch of circadian control of the functioning of the immune system in human blood. Our study provides novel insights into both the circadian gene identification field and the study of complex circadian-driven biological control. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09254439
Volume :
1864
Issue :
6
Database :
Academic Search Index
Journal :
BBA: Molecular Basis of Disease
Publication Type :
Academic Journal
Accession number :
129333196
Full Text :
https://doi.org/10.1016/j.bbadis.2017.12.004