Back to Search Start Over

Normalization of gene counts affects principal components-based exploratory analysis of RNA-sequencing data.

Authors :
van Lingen HJ
Suarez-Diez M
Saccenti E
Source :
Biochimica et biophysica acta. Gene regulatory mechanisms [Biochim Biophys Acta Gene Regul Mech] 2024 Dec; Vol. 1867 (4), pp. 195058. Date of Electronic Publication: 2024 Aug 16.
Publication Year :
2024

Abstract

Normalization of gene expression count data is an essential step of in the analysis of RNA-sequencing data. Its statistical analysis has been mostly addressed in the context of differential expression analysis, that is in the univariate setting. However, relationships among genes and samples are better explored and quantified using multivariate exploratory data analysis tools like Principal Component Analysis (PCA). In this study we investigate how normalization impacts the PCA model and its interpretation, considering twelve different widely used normalization methods that were applied on simulated and experimental data. Correlation patterns in the normalized data were explored using both summary statistics and Covariance Simultaneous Component Analysis. The impact of normalization on the PCA solution was assessed by exploring the model complexity, the quality of sample clustering in the low-dimensional PCA space and gene ranking in the model fit to normalized data. PCA models upon normalization were interpreted in the context gene enrichment pathway analysis. We found that although PCA score plots are often similar independently form the normalization used, biological interpretation of the models can depend heavily on the normalization method applied.<br />Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.<br /> (Copyright © 2024 The Authors. Published by Elsevier B.V. All rights reserved.)

Details

Language :
English
ISSN :
1876-4320
Volume :
1867
Issue :
4
Database :
MEDLINE
Journal :
Biochimica et biophysica acta. Gene regulatory mechanisms
Publication Type :
Academic Journal
Accession number :
39154857
Full Text :
https://doi.org/10.1016/j.bbagrm.2024.195058