Back to Search
Start Over
State of the Human Proteome in 2014/2015 As Viewed through PeptideAtlas: Enhancing Accuracy and Coverage through the AtlasProphet
- Source :
- Journal of Proteome Research. 14:3461-3473
- Publication Year :
- 2015
- Publisher :
- American Chemical Society (ACS), 2015.
-
Abstract
- The Human PeptideAtlas is a compendium of the highest quality peptide identifications from over 1000 shotgun mass spectrometry proteomics experiments collected from many different labs, all reanalyzed through a uniform processing pipeline. The latest 2015-03 build contains substantially more input data than past releases, is mapped to a recent version of our merged reference proteome, and uses improved informatics processing and the development of the AtlasProphet to provide the highest quality results. Within the set of ~20,000 neXtProt primary entries, 14,070 (70%) are confidently detected in the latest build, 5% are ambiguous, 9% are redundant, leaving the total percentage of proteins for which there are no mapping detections at just 16% (3166), all derived from over 133 million peptide-spectrum matches identifying more than 1 million distinct peptides using AtlasProphet to characterize and classify the protein matches. Improved handling for detection and presentation of single amino-acid variants (SAAVs) reveals the detection of 5,326 uniquely mapping SAAVs across 2,794 proteins. With such a large amount of data, the control of false positives is a challenge. We present the methodology and results for maintaining rigorous quality, along with a discussion of the implications of the remaining sources of errors in the build. We check our uncertainty estimates against a set of olfactory receptor proteins not expected to be present in the set. We show how the use of synthetic reference spectra can provide confirmatory evidence for claims of detection of proteins with weak evidence.
- Subjects :
- Proteomics
Molecular Sequence Data
Biology
computer.software_genre
Biochemistry
Article
Set (abstract data type)
03 medical and health sciences
False positive paradox
Human proteome project
Humans
Amino Acid Sequence
Databases, Protein
Shotgun proteomics
030304 developmental biology
0303 health sciences
Sequence Homology, Amino Acid
NeXtProt
030302 biochemistry & molecular biology
Proteins
General Chemistry
Amino Acid Substitution
Proteome
Data mining
PeptideAtlas
computer
Subjects
Details
- ISSN :
- 15353907 and 15353893
- Volume :
- 14
- Database :
- OpenAIRE
- Journal :
- Journal of Proteome Research
- Accession number :
- edsair.doi.dedup.....0f97772289a9298ab8466e4817087d4c
- Full Text :
- https://doi.org/10.1021/acs.jproteome.5b00500