Back to Search
Start Over
Imputation of Spatially-resolved Transcriptomes by Graph-regularized Tensor Completion
- Source :
- PLoS Computational Biology, Vol 17, Iss 4, p e1008218 (2021), PLoS Computational Biology
- Publication Year :
- 2020
- Publisher :
- Cold Spring Harbor Laboratory, 2020.
-
Abstract
- High-throughput spatial-transcriptomics RNA sequencing (sptRNA-seq) based on in-situ capturing technologies has recently been developed to spatially resolve transcriptome-wide mRNA expressions mapped to the captured locations in a tissue sample. Due to the low RNA capture efficiency by in-situ capturing and the complication of tissue section preparation, sptRNA-seq data often only provides an incomplete profiling of the gene expressions over the spatial regions of the tissue. In this paper, we introduce a graph-regularized tensor completion model for imputing the missing mRNA expressions in sptRNA-seq data, namely FIST, Fast Imputation of Spatially-resolved transcriptomes by graph-regularized Tensor completion. We first model sptRNA-seq data as a 3-way sparse tensor in genes (p-mode) and the (x, y) spatial coordinates (x-mode and y-mode) of the observed gene expressions, and then consider the imputation of the unobserved entries or fibers as a tensor completion problem in Canonical Polyadic Decomposition (CPD) form. To improve the imputation of highly sparse sptRNA-seq data, we also introduce a protein-protein interaction network to add prior knowledge of gene functions, and a spatial graph to capture the the spatial relations among the capture spots. The tensor completion model is then regularized by a Cartesian product graph of protein-protein interaction network and the spatial graph to capture the high-order relations in the tensor. In the experiments, FIST was tested on ten 10x Genomics Visium spatial transcriptomic datasets of different tissue sections with cross-validation among the known entries in the imputation. FIST significantly outperformed the state-of-the-art methods for single-cell RNAseq data imputation. We also demonstrate that both the spatial graph and PPI network play an important role in improving the imputation. In a case study, we further analyzed the gene clusters obtained from the imputed gene expressions to show that the imputations by FIST indeed capture the spatial characteristics in the gene expressions and reveal functions that are highly relevant to three different kinds of tissues in mouse kidney.<br />Author summary Biological tissues are composed of different types of structurally organized cell units playing distinct functional roles. The exciting new spatial gene expression profiling methods have enabled the analysis of spatially resolved transcriptomes to understand the spatial and functional characteristics of these cells in the context of eco-environment of tissue. Due to the technical limitations, spatial transcriptomics data suffers from only sparsely measured mRNAs by in-situ capture and possibly missing spots in tissue regions that entirely failed fixing and permeabilizing RNAs. Our method, FIST (Fast Imputation of Spatially-resolved transcriptomes by graph-regularized Tensor completion), focuses on the spatial and high-sparsity nature of spatial transcriptomics data by modeling the data as a 3-way gene-by-(x, y)-location tensor and a product graph of a spatial graph and a protein-protein interaction network. Our comprehensive evaluation of FIST on ten 10x Genomics Visium spatial genomics datasets and comparison with the methods for single-cell RNA sequencing data imputation demonstrate that FIST is a better method more suitable for spatial gene expression imputation. Overall, we found FIST a useful new method for analyzing spatially resolved gene expressions based on novel modeling of spatial and functional information.
- Subjects :
- Proteomics
Metabolic Processes
0301 basic medicine
Fist
Computer science
Gene Identification and Analysis
Datasets as Topic
Gene Expression
Genetic Networks
Kidney
Biochemistry
Mice
Sequencing techniques
0302 clinical medicine
Spatial reference system
Tensor (intrinsic definition)
Medicine and Health Sciences
Imputation (statistics)
Biology (General)
Mammalian Genomics
Ecology
RNA sequencing
Genomics
Cartesian product
Quantitative Biology::Genomics
Computational Theory and Mathematics
030220 oncology & carcinogenesis
Modeling and Simulation
symbols
Graph (abstract data type)
Protein Interaction Networks
Anatomy
Transcriptome Analysis
Algorithms
Network Analysis
Research Article
Computer and Information Sciences
QH301-705.5
03 medical and health sciences
Cellular and Molecular Neuroscience
symbols.namesake
Interaction network
Genetics
Animals
Tensor
Molecular Biology
Ecology, Evolution, Behavior and Systematics
Sequence Analysis, RNA
business.industry
Gene Expression Profiling
Biology and Life Sciences
Computational Biology
Kidneys
Pattern recognition
Renal System
Genome Analysis
Research and analysis methods
Gene expression profiling
Spatial relation
Molecular biology techniques
Metabolism
030104 developmental biology
Animal Genomics
Artificial intelligence
Transcriptome
business
Imputation (genetics)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- PLoS Computational Biology, Vol 17, Iss 4, p e1008218 (2021), PLoS Computational Biology
- Accession number :
- edsair.doi.dedup.....2b744f431b44b52be52444f56a4519b3