Back to Search
Start Over
A protein identification algorithm for tandem mass spectrometry by incorporating the abundance of mRNA into a binomial probability scoring model
- Source :
- Journal of proteomics. 197
- Publication Year :
- 2018
-
Abstract
- Peptide-spectrum matches (PSM) scoring between the experimental and theoretical spectrum is a key step in the identification of proteins using mass spectrometry (MS)-based proteomics analyses. Efficient protein identification using MS/MS data remains a challenge. The strategy of using RNA-seq data increases the number of proteins identified by re-constructing the custom search database and integrating mRNA abundance into the false discovery rate of post-PSM. However, this process lacks an algorithm that can allow the incorporation of mRNA abundance into the key scoring model of PSM. Therefore, we developed a novel PSM scoring model, which incorporates mRNA abundance for improved peptide and protein identification. In the new algorithm, abundance information of mRNA was transformed to the prior probability of protein identification and integrated to re-score in PSM using the binomial probability distribution model. Compared with other algorithms using five MS/MS datasets, the results showed that the least improvement ratios of peptide and protein groups were 3.39%-9.79% and 0.48%-8.16% in different datasets (human, rat, zebrafish, yeast, and Arabidopsis thaliana). The new strategy offers an effective solution for MS-based identification of peptides and proteins. SIGNIFICANCE: The new algorithm identifies proteins by quantifying mRNA abundance (FPKM) and incorporating it into a scoring model for peptide-spectrum matches. It is important to improve peptide and protein identification from MS/MS datasets in proteomics research.
- Subjects :
- 0301 basic medicine
False discovery rate
Biophysics
Arabidopsis
Peptide
RNA-Seq
Saccharomyces cerevisiae
Biology
Tandem mass spectrometry
Mass spectrometry
Proteomics
Biochemistry
03 medical and health sciences
Tandem Mass Spectrometry
Prior probability
Animals
Humans
RNA, Messenger
Zebrafish
chemistry.chemical_classification
030102 biochemistry & molecular biology
fungi
RNA, Fungal
Rats
Binomial distribution
030104 developmental biology
chemistry
RNA, Plant
Databases, Nucleic Acid
Algorithm
Algorithms
Subjects
Details
- ISSN :
- 18767737
- Volume :
- 197
- Database :
- OpenAIRE
- Journal :
- Journal of proteomics
- Accession number :
- edsair.doi.dedup.....07a999f3ec3b23ad61c609c9c6c55570