1. A biological compression model and its applications.
- Author
-
Cao MD, Dix TI, and Allison L
- Subjects
- Computational Biology, Expert Systems, Genome, Human, Genomics statistics & numerical data, Humans, Information Theory, Knowledge Bases, Models, Genetic, Models, Statistical, Phylogeny, Repetitive Sequences, Nucleic Acid, Sequence Alignment statistics & numerical data, Algorithms, Data Compression statistics & numerical data
- Abstract
A biological compression model, expert model, is presented which is superior to existing compression algorithms in both compression performance and speed. The model is able to compress whole eukaryotic genomes. Most importantly, the model provides a framework for knowledge discovery from biological data. It can be used for repeat element discovery, sequence alignment and phylogenetic analysis. We demonstrate that the model can handle statistically biased sequences and distantly related sequences where conventional knowledge discovery tools often fail.
- Published
- 2011
- Full Text
- View/download PDF