Back to Search Start Over

Extending the coverage of spectral libraries: A neighbor-based approach to predicting intensities of peptide fragmentation spectra

Authors :
Predrag Radivojac
Randy J. Arnold
Haixu Tang
Richard W. Hardy
Kevin J. Sokoloski
Chao Ji
Source :
PROTEOMICS. 13:756-765
Publication Year :
2013
Publisher :
Wiley, 2013.

Abstract

Searching spectral libraries in tandem mass spectrometry (MS/MS) is an important new approach to improving the quality of peptide and protein identification. The idea relies on the observation that ion intensities in an MS/MS spectrum of a given peptide are generally reproducible across experiments, and thus, matching between spectra from an experiment and the spectra of previously identified peptides stored in a spectral library can lead to better peptide identification compared to the traditional database search. However, the use of libraries is greatly limited by their coverage of peptide sequences: even for well-studied organisms a large fraction of peptides have not been previously identified. To address this issue, we propose to expand spectral libraries by predicting the MS/MS spectra of peptides based on the spectra of peptides with similar sequences. We first demonstrate that the intensity patterns of dominant fragment ions between similar peptides tend to be similar. In accordance with this observation, we develop a neighbor-based approach which first selects peptides that are likely to have spectra similar to the target peptide and then combines their spectra using a weighted K-nearest neighbor method to accurately predict fragment ion intensities corresponding to the target peptide. This approach has the potential to predict spectra for every peptide in the proteome. When rigorous quality criteria are applied, we estimate that the method increases the coverage of spectral libraries available from the National Institute of Standards and Technology by 20–60%, although the values vary with peptide length and charge state. We find that the overall best search performance is achieved when spectral libraries are supplemented by the high quality predicted spectra.

Details

ISSN :
16159853
Volume :
13
Database :
OpenAIRE
Journal :
PROTEOMICS
Accession number :
edsair.doi.dedup.....7cc7f92f0dee9993a4d756cb04433441