Back to Search Start Over

A Frequent Pattern Mining Method for Finding Planted Motifs of Unknown Length in DNA Sequences

Authors :
Caiyan Jia
Ruqian Lu
Lusheng Chen
Source :
International Journal of Computational Intelligence Systems, Vol 4, Iss 5 (2011)
Publication Year :
2011
Publisher :
Springer, 2011.

Abstract

Identification and characterization of gene regulatory binding motifs is one of the fundamental tasks toward systematically understanding the molecular mechanisms of transcriptional regulation. Recently, the problem has been abstracted as the challenge planted (l,d)-motif problem. Previous studies have developed numerous methods to solve the problem. But most of them need to specify the length l of a planted motif in advance and use depth first search strategy. In this study, we present an exact and efficient algorithm, called Apriori-Motif, without given the length l of a planted motif a priori. And a breadth first search strategy is used to prune search space quickly by the downward closure property utilized in Apriori, which is a classical algorithm for frequent pattern mining. Empirical study shows that Apriori-Motif is better than some existing methods.

Details

Language :
English
ISSN :
18756883
Volume :
4
Issue :
5
Database :
Directory of Open Access Journals
Journal :
International Journal of Computational Intelligence Systems
Publication Type :
Academic Journal
Accession number :
edsdoj.2b8e9644441f495099006334956f13bd
Document Type :
article
Full Text :
https://doi.org/10.2991/ijcis.2011.4.5.26