Back to Search Start Over

Deep generative models for T cell receptor protein sequences

Authors :
Kristian Davidsen
Branden J Olson
William S DeWitt III
Jean Feng
Elias Harkins
Philip Bradley
Frederick A Matsen IV
Source :
eLife, Vol 8 (2019)
Publication Year :
2019
Publisher :
eLife Sciences Publications Ltd, 2019.

Abstract

Probabilistic models of adaptive immune repertoire sequence distributions can be used to infer the expansion of immune cells in response to stimulus, differentiate genetic from environmental factors that determine repertoire sharing, and evaluate the suitability of various target immune sequences for stimulation via vaccination. Classically, these models are defined in terms of a probabilistic V(D)J recombination model which is sometimes combined with a selection model. In this paper we take a different approach, fitting variational autoencoder (VAE) models parameterized by deep neural networks to T cell receptor (TCR) repertoires. We show that simple VAE models can perform accurate cohort frequency estimation, learn the rules of VDJ recombination, and generalize well to unseen sequences. Further, we demonstrate that VAE-like models can distinguish between real sequences and sequences generated according to a recombination-selection model, and that many characteristics of VAE-generated sequences are similar to those of real sequences.

Details

Language :
English
ISSN :
2050084X
Volume :
8
Database :
Directory of Open Access Journals
Journal :
eLife
Publication Type :
Academic Journal
Accession number :
edsdoj.830ffb8a0de4a7489ad9c4b44a47423
Document Type :
article
Full Text :
https://doi.org/10.7554/eLife.46935