1. Application of 2D graphic representation of protein sequence based on Huffman tree method
- Author
-
Qi, Zhao-Hui, Feng, Jun, Qi, Xiao-Qin, and Li, Ling
- Subjects
- *
AMINO acid sequence , *COMPUTER graphics , *TWO-dimensional models , *ESCHERICHIA coli , *GENOMES , *ALGORITHMS - Abstract
Abstract: Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0–1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0–1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF