Back to Search
Start Over
Visual analysis of mass cytometry data by hierarchical stochastic neighbour embedding reveals rare cell types
- Source :
- Nature Communications, Vol 8, Iss 1, Pp 1-10 (2017), Nature Communications, Nature Communications, 8
- Publication Year :
- 2017
- Publisher :
- Springer Science and Business Media LLC, 2017.
-
Abstract
- Mass cytometry allows high-resolution dissection of the cellular composition of the immune system. However, the high-dimensionality, large size, and non-linear structure of the data poses considerable challenges for the data analysis. In particular, dimensionality reduction-based techniques like t-SNE offer single-cell resolution but are limited in the number of cells that can be analyzed. Here we introduce Hierarchical Stochastic Neighbor Embedding (HSNE) for the analysis of mass cytometry data sets. HSNE constructs a hierarchy of non-linear similarities that can be interactively explored with a stepwise increase in detail up to the single-cell level. We apply HSNE to a study on gastrointestinal disorders and three other available mass cytometry data sets. We find that HSNE efficiently replicates previous observations and identifies rare cell populations that were previously missed due to downsampling. Thus, HSNE removes the scalability limit of conventional t-SNE analysis, a feature that makes it highly suitable for the analysis of massive high-dimensional data sets.<br />Single cell profiling yields high dimensional data of very large numbers of cells, posing challenges of visualization and analysis. Here the authors introduce a method for analysis of mass cytometry data that can handle very large datasets and allows their intuitive and hierarchical exploration.
- Subjects :
- CD4-Positive T-Lymphocytes
0301 basic medicine
Databases, Factual
Gastrointestinal Diseases
Computer science
Science
Immunology
Cytological Techniques
General Physics and Astronomy
Bioinformatics
Article
General Biochemistry, Genetics and Molecular Biology
03 medical and health sciences
Antigens, CD
T-Lymphocyte Subsets
Feature (machine learning)
Humans
Mass cytometry
Lymphocytes
Limit (mathematics)
lcsh:Science
Image Cytometry
Stochastic Processes
Multidisciplinary
Hierarchy (mathematics)
business.industry
Dimensionality reduction
Pattern recognition
General Chemistry
Flow Cytometry
Computational biology and bioinformatics
030104 developmental biology
Scalability
Embedding
lcsh:Q
Artificial intelligence
Single-Cell Analysis
business
Algorithms
Biomarkers
Subjects
Details
- ISSN :
- 20411723
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- Nature Communications
- Accession number :
- edsair.doi.dedup.....be295722737badab78d208c3f9be5c46
- Full Text :
- https://doi.org/10.1038/s41467-017-01689-9