1. Normalization methods for reducing interbatch effect without quality control samples in liquid chromatography-mass spectrometry-based studies
- Author
-
Vladimir Frankevich, Vitaliy Chagovets, Alisa O. Tokareva, Eugene N. Nikolaev, Natalia L. Starodubtseva, and Alexey S. Kononikhin
- Subjects
Normalization (statistics) ,010401 analytical chemistry ,02 engineering and technology ,021001 nanoscience & nanotechnology ,01 natural sciences ,Biochemistry ,Autoscaling ,0104 chemical sciences ,Analytical Chemistry ,Database normalization ,Liquid chromatography–mass spectrometry ,Principal component analysis ,Statistics ,Range (statistics) ,Pairwise comparison ,0210 nano-technology ,Quantile normalization ,Mathematics - Abstract
Data normalization is an essential part of a large-scale untargeted mass spectrometry metabolomics analysis. Autoscaling, Pareto scaling, range scaling, and level scaling methods for liquid chromatography-mass spectrometry data processing were compared with the most common normalization methods, including quantile normalization, probabilistic quotient normalization, and variance stabilizing normalization. These methods were tested on eight datasets from various clinical studies. The efficiency of the data normalization was assessed by the distance between clusters corresponding to batches and the distance between clusters corresponding to clinical groups in the space of principal components, as well as by the number of features with a pairwise statistically significant difference between the batches and the number of features with a pairwise statistically significant difference between clinical groups. Autoscaling demonstrated the most effective reduction in interbatch variation and can be preferable to probabilistic quotient or quantile normalization in liquid chromatography-mass spectrometry data.
- Published
- 2021