1. Predicting Peptide HLA-II Presentation Using Immunopeptidomics, Transcriptomics and Deep Multimodal Learning
- Author
-
Hesham ElAbd, Mareike Wendorff, Tomas Koudelka, Christian Hentschker, Ann-Kristin Kamps, Christoph Prieß, Lars Wienbrandt, Frauke Degenhardt, Tim A. Steiert, Petra Bacher, Piyush Mathur, David Ellinghaus, Uwe Völker, Andreas Tholey, Tobias L. Lenz, and Andre Franke
- Abstract
The human leukocyte antigen (HLA) class II proteins present peptides to CD4+T cells through an interaction with T cell receptors (TCRs). Thus, HLA proteins are key players in shaping immunogenicity and immunodominance. Nevertheless, factors governing peptide presentation by HLA-II proteins are still poorly understood. To address this problem, we profiled the blood transcriptome and immunopeptidome of 20 healthy individuals and integrated the profiles with publicly available immunopeptidomics datasets. In depth multi-omics analysis identified expression levels and subcellular locations as import sequence-independent features governing presentation. Levering this knowledge, we developed the Peptide Immune Annotator Multimodal (PIA-M) tool, as a novel pan multimodal transformer-based framework that utilises sequence-dependent along with sequence-independent features to model presentation by HLA-II proteins.PIA-Millustrated a consistently superior performance relative to existing tools across two independent test datasets (area under the curve: 0.93 vs. 0.84 and 0.95 vs. 0.86), respectively. Besides achieving a higher predictive accuracy,PIA-Mwith its Rust-based pre-processing engine, had significantly shorter runtimes.PIA-Mis freely available with a permissive licence as a standalone pipeline and as a webserver (https://hybridcomputing.ikmb.uni-kiel.de/pia). In conclusion,PIA-Menables a new state-of-the-art accuracy in predicting peptide presentation by HLA-II proteinsin vivo.
- Published
- 2022
- Full Text
- View/download PDF