Back to Search Start Over

Algorithms for Database-Dependent Search of MS/MS Data

Authors :
Rune Matthiesen
Source :
Mass Spectrometry Data Analysis in Proteomics ISBN: 9781627033916
Publication Year :
2013
Publisher :
Humana Press, 2013.

Abstract

The frequent used bottom-up strategy for identification of proteins and their associated modifications generate nowadays typically thousands of MS/MS spectra that normally are matched automatically against a protein sequence database. Search engines that take as input MS/MS spectra and a protein sequence database are referred as database-dependent search engines. Many programs both commercial and freely available exist for database-dependent search of MS/MS spectra and most of the programs have excellent user documentation. The aim here is therefore to outline the algorithm strategy behind different search engines rather than providing software user manuals. The process of database-dependent search can be divided into search strategy, peptide scoring, protein scoring, and finally protein inference. Most efforts in the literature have been put in to comparing results from different software rather than discussing the underlining algorithms. Such practical comparisons can be cluttered by suboptimal implementation and the observed differences are frequently caused by software parameters settings which have not been set proper to allow even comparison. In other words an algorithmic idea can still be worth considering even if the software implementation has been demonstrated to be suboptimal. The aim in this chapter is therefore to split the algorithms for database-dependent searching of MS/MS data into the above steps so that the different algorithmic ideas become more transparent and comparable. Most search engines provide good implementations of the first three data analysis steps mentioned above, whereas the final step of protein inference are much less developed for most search engines and is in many cases performed by an external software. The final part of this chapter illustrates how protein inference is built into the VEMS search engine and discusses a stand-alone program SIR for protein inference that can import a Mascot search result.

Details

ISBN :
978-1-62703-391-6
ISBNs :
9781627033916
Database :
OpenAIRE
Journal :
Mass Spectrometry Data Analysis in Proteomics ISBN: 9781627033916
Accession number :
edsair.doi...........72c858005310e5525ca2dff7a838dfac
Full Text :
https://doi.org/10.1007/978-1-62703-392-3_5