1. scLENS: data-driven signal detection for unbiased scRNA-seq data analysis.
- Author
-
Kim, Hyun, Chang, Won, Chae, Seok Joo, Park, Jong-Eun, Seo, Minseok, and Kim, Jae Kyoung
- Subjects
SIGNAL detection ,SIGNAL filtering ,RANDOM matrices ,DATA analysis ,SIGNALS & signaling - Abstract
High dimensionality and noise have limited the new biological insights that can be discovered in scRNA-seq data. While dimensionality reduction tools have been developed to extract biological signals from the data, they often require manual determination of signal dimension, introducing user bias. Furthermore, a common data preprocessing method, log normalization, can unintentionally distort signals in the data. Here, we develop scLENS, a dimensionality reduction tool that circumvents the long-standing issues of signal distortion and manual input. Specifically, we identify the primary cause of signal distortion during log normalization and effectively address it by uniformizing cell vector lengths with L2 normalization. Furthermore, we utilize random matrix theory-based noise filtering and a signal robustness test to enable data-driven determination of the threshold for signal dimensions. Our method outperforms 11 widely used dimensionality reduction tools and performs particularly well for challenging scRNA-seq datasets with high sparsity and variability. To facilitate the use of scLENS, we provide a user-friendly package that automates accurate signal detection of scRNA-seq data without manual time-consuming tuning. Single-cell RNA sequencing data analysis is limited by noise and high dimensionality. Here, authors present scLENS, a tool that automates accurate signal detection without manual input, particularly in complex datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF