Back to Search
Start Over
On the viability of unsupervised T-cell receptor sequence clustering for epitope preference
- Source :
- Bioinformatics
- Publication Year :
- 2019
-
Abstract
- Motivation The T-cell receptor (TCR) is responsible for recognizing epitopes presented on cell surfaces. Linking TCR sequences to their ability to target specific epitopes is currently an unsolved problem, yet one of great interest. Indeed, it is currently unknown how dissimilar TCR sequences can be before they no longer bind the same epitope. This question is confounded by the fact that there are many ways to define the similarity between two TCR sequences. Here we investigate both issues in the context of TCR sequence unsupervised clustering. Results We provide an overview of the performance of various distance metrics on two large independent datasets with 412 and 2835 TCR sequences respectively. Our results confirm the presence of structural distinct TCR groups that target identical epitopes. In addition, we put forward several recommendations to perform unsupervised T-cell receptor sequence clustering. Availability and implementation Source code implemented in Python 3 available at https://github.com/pmeysman/TCRclusteringPaper. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Source code
Computer science
media_common.quotation_subject
Receptors, Antigen, T-Cell
Context (language use)
chemical and pharmacologic phenomena
Computational biology
Biochemistry
Epitope
03 medical and health sciences
Epitopes
Similarity (network science)
Cluster Analysis
Molecular Biology
Biology
030304 developmental biology
Sequence (medicine)
Sequence clustering
media_common
Computer. Automation
0303 health sciences
030302 biochemistry & molecular biology
T-cell receptor
hemic and immune systems
Computer Science Applications
Computational Mathematics
Chemistry
Computational Theory and Mathematics
Human medicine
Engineering sciences. Technology
Software
Mathematics
Subjects
Details
- Language :
- English
- ISSN :
- 13674803
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....bcbeacecaaa97c74c461ece57a2358e3