1. A pattern recognition approach to infer time-lagged genetic interactions
- Author
-
Grace S. Shieh, Cheng-Long Chuang, Chung-Ming Chen, and Chih-Hung Jen
- Subjects
Statistics and Probability ,Time Factors ,Gaussian ,Biology ,Biochemistry ,Pattern Recognition, Automated ,Correlation ,symbols.namesake ,Gene interaction ,Artificial Intelligence ,Molecular Biology ,Gene ,Oligonucleotide Array Sequence Analysis ,Binding Sites ,Microarray analysis techniques ,business.industry ,Gene Expression Profiling ,Small number ,Pattern recognition ,Computer Science Applications ,Gene expression profiling ,Computational Mathematics ,Computational Theory and Mathematics ,Multigene Family ,Pattern recognition (psychology) ,symbols ,Artificial intelligence ,business ,Algorithms ,Software ,Protein Binding ,Signal Transduction ,Transcription Factors - Abstract
Motivation: For any time-course microarray data in which the gene interactions and the associated paired patterns are dependent, the proposed pattern recognition (PARE) approach can infer time-lagged genetic interactions, a challenging task due to the small number of time points and large number of genes. PARE utilizes a non-linear score to identify subclasses of gene pairs with different time lags. In each subclass, PARE extracts non-linear characteristics of paired gene-expression curves and learns weights of the decision score applying an optimization algorithm to microarray gene-expression data (MGED) of some known interactions, from biological experiments or published literature. Namely, PARE integrates both MGED and existing knowledge via machine learning, and subsequently predicts the other genetic interactions in the subclass. Results: PARE, a time-lagged correlation approach and the latest advance in graphical Gaussian models were applied to predict 112 (132) pairs of TC/TD (transcriptional regulatory) interactions. Checked against qRT-PCR results (published literature), their true positive rates are 73% (77%), 46% (51%), and 52% (59%), respectively. The false positive rates of predicting TC and TD (AT and RT) interactions in the yeast genome are bounded by 13 and 10% (10 and 14%), respectively. Several predicted TC/TD interactions are shown to coincide with existing pathways involving Sgs1, Srs2 and Mus81. This reinforces the possibility of applying genetic interactions to predict pathways of protein complexes. Moreover, some experimentally testable gene interactions involving DNA repair are predicted. Availability: Supplementary data and PARE software are available at http://www.stat.sinica.edu.tw/~gshieh/pare.htm. Contact: gshieh@stat.sinica.edu.tw
- Published
- 2008