Back to Search Start Over

A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

Authors :
Jian L
Niu X
Xia Z
Samir P
Sumanasekera C
Mu Z
Jennings JL
Hoek KL
Allos T
Howard LM
Edwards KM
Weil PA
Link AJ
Source :
Journal of proteome research [J Proteome Res] 2013 Mar 01; Vol. 12 (3), pp. 1108-19. Date of Electronic Publication: 2013 Feb 12.
Publication Year :
2013

Abstract

Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.

Details

Language :
English
ISSN :
1535-3907
Volume :
12
Issue :
3
Database :
MEDLINE
Journal :
Journal of proteome research
Publication Type :
Academic Journal
Accession number :
23402659
Full Text :
https://doi.org/10.1021/pr300631t