1. AN AUTOMATA APPROACH TO MATCH GAPPED SEQUENCE TAGS AGAINST PROTEIN DATABASE.
- Author
-
Han, Yonghua, Ma, Bin, and Zhang, Kaizhong
- Subjects
- *
BIOCHEMISTRY , *MASS spectrometry , *PEPTIDES , *PROTEINS , *AMINO acid sequence , *ALGORITHMS - Abstract
In Biochemistry, tandem mass spectrometry (MS/MS) is the most common method for peptide and protein identifications. One computational method to get a peptide sequence from the MS/MS data is called de novo sequencing, which is becoming more and more important in this area. However De novo sequencing usually can only confidently determine partial sequences, while the undetermined parts are represented by "mass gaps". We call such a partially determined sequence a gapped sequence tag. When a gapped sequence tag is searched in a database for protein identification, the determined pars should match the database sequence exactly, while each mass gap should match a substring of amino acids whose masses add up to the value of the mass gap. In such a case, the standard string matching algorithm does not work any more. In this paper, we present a new efficient algorithm to find the matches of gapped sequence tags in a protein database. [ABSTRACT FROM AUTHOR]
- Published
- 2005
- Full Text
- View/download PDF