Back to Search
Start Over
Autoencoder based local T cell repertoire density can be used to classify samples and T cell receptors
- Source :
- PLoS Computational Biology, Vol 17, Iss 7, p e1009225 (2021), PLoS Computational Biology
- Publication Year :
- 2021
- Publisher :
- Public Library of Science (PLoS), 2021.
-
Abstract
- Recent advances in T cell repertoire (TCR) sequencing allow for the characterization of repertoire properties, as well as the frequency and sharing of specific TCR. However, there is no efficient measure for the local density of a given TCR. TCRs are often described either through their Complementary Determining region 3 (CDR3) sequences, or theirV/J usage, or their clone size. We here show that the local repertoire density can be estimated using a combined representation of these components through distance conserving autoencoders and Kernel Density Estimates (KDE). We present ELATE–an Encoder-based LocAl Tcr dEnsity and show that the resulting density of a sample can be used as a novel measure to study repertoire properties. The cross-density between two samples can be used as a similarity matrix to fully characterize samples from the same host. Finally, the same projection in combination with machine learning algorithms can be used to predict TCR-peptide binding through the local density of known TCRs binding a specific target.<br />Author summary T cell repertoires contain a vast amount of information on the donors, and can be used to characterize the donor, and apply machine learning algorithms on such repertoires. A limiting factor in the analysis of such repertoire is the lack of a good representation of the T cell receptors. We here propose an autoencoder, named ELATE to present receptors as real vectors. We show that this encoder can be used to characterize both full donors and specific receptors using either supervised or unsupervised methods.
- Subjects :
- 0301 basic medicine
Computer science
Receptors, Antigen, T-Cell, alpha-beta
Entropy
Immunoglobulin Variable Region
Immune Receptors
Biochemistry
Machine Learning
White Blood Cells
Electronics Engineering
0302 clinical medicine
Animal Cells
Databases, Genetic
Medicine and Health Sciences
Signal Decoders
Gene Rearrangement, beta-Chain T-Cell Antigen Receptor
Amino Acids
Biology (General)
Projection (set theory)
Immune System Proteins
Ecology
Organic Compounds
T Cells
Physics
Repertoire
food and beverages
hemic and immune systems
Chemistry
Oncology
Computational Theory and Mathematics
030220 oncology & carcinogenesis
Modeling and Simulation
Physical Sciences
Engineering and Technology
Thermodynamics
Gene Cloning
Cellular Types
Encoder
Algorithms
Research Article
Signal Transduction
QH301-705.5
Immune Cells
Immunology
Kernel density estimation
Receptors, Antigen, T-Cell
chemical and pharmacologic phenomena
Computational biology
Research and Analysis Methods
Measure (mathematics)
03 medical and health sciences
Cellular and Molecular Neuroscience
Genetics
Humans
Sulfur Containing Amino Acids
Entropy (information theory)
Amino Acid Sequence
Cysteine
Molecular Biology Techniques
Representation (mathematics)
Molecular Biology
Ecology, Evolution, Behavior and Systematics
Blood Cells
Organic Chemistry
T-cell receptor
Chemical Compounds
Computational Biology
Biology and Life Sciences
Proteins
Cancers and Neoplasms
Cell Biology
Gene rearrangement
Complementarity Determining Regions
Autoencoder
Hierarchical clustering
T Cell Receptors
030104 developmental biology
Electronics
Gene Rearrangement, alpha-Chain T-Cell Antigen Receptor
Software
Cloning
Subjects
Details
- ISSN :
- 15537358
- Volume :
- 17
- Database :
- OpenAIRE
- Journal :
- PLOS Computational Biology
- Accession number :
- edsair.doi.dedup.....aaf565acf77714b54c490d035dc2e126
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1009225