Back to Search Start Over

Carafe enables high quality in silico spectral library generation for data-independent acquisition proteomics.

Authors :
Wen B
Hsu C
Zeng WF
Riffle M
Chang A
Mudge M
Nunn B
Berg MD
Villén J
MacCoss MJ
Noble WS
Source :
BioRxiv : the preprint server for biology [bioRxiv] 2024 Oct 18. Date of Electronic Publication: 2024 Oct 18.
Publication Year :
2024

Abstract

Data-independent acquisition (DIA)-based mass spectrometry is becoming an increasingly popular mass spectrometry acquisition strategy for carrying out quantitative proteomics experiments. Most of the popular DIA search engines make use of in silico generated spectral libraries. However, the generation of high-quality spectral libraries for DIA data analysis remains a challenge, particularly because most such libraries are generated directly from data-dependent acquisition (DDA) data or are from in silico prediction using models trained on DDA data. In this study, we developed Carafe, a tool that generates high-quality experiment-specific in silico spectral libraries by training deep learning models directly on DIA data. We demonstrate the performance of Carafe on a wide range of DIA datasets, where we observe improved fragment ion intensity prediction and peptide detection relative to existing pretrained DDA models.<br />Competing Interests: The MacCoss Lab at the University of Washington receives funding from Agilent, Bruker, Sciex, Shimadzu, Thermo Fisher Scientific, and Waters to support the development of Skyline, a quantitative analysis software tool. MJM is a paid consultant for Thermo Fisher Scientific.

Details

Language :
English
ISSN :
2692-8205
Database :
MEDLINE
Journal :
BioRxiv : the preprint server for biology
Publication Type :
Academic Journal
Accession number :
39463980
Full Text :
https://doi.org/10.1101/2024.10.15.618504