Back to Search Start Over

GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data

Authors :
Li Chen
James Reeve
Lujun Zhang
Shengbing Huang
Xuefeng Wang
Jun Chen
Source :
PeerJ, Vol 6, p e4600 (2018)
Publication Year :
2018
Publisher :
PeerJ Inc., 2018.

Abstract

Normalization is the first critical step in microbiome sequencing data analysis used to account for variable library sizes. Current RNA-Seq based normalization methods that have been adapted for microbiome data fail to consider the unique characteristics of microbiome data, which contain a vast number of zeros due to the physical absence or under-sampling of the microbes. Normalization methods that specifically address the zero-inflation remain largely undeveloped. Here we propose geometric mean of pairwise ratios—a simple but effective normalization method—for zero-inflated sequencing data such as microbiome data. Simulation studies and real datasets analyses demonstrate that the proposed method is more robust than competing methods, leading to more powerful detection of differentially abundant taxa and higher reproducibility of the relative abundances of taxa.

Details

Language :
English
ISSN :
21678359
Volume :
6
Database :
Directory of Open Access Journals
Journal :
PeerJ
Publication Type :
Academic Journal
Accession number :
edsdoj.2ac19b9b93894ad2a7a79d6fa0da638a
Document Type :
article
Full Text :
https://doi.org/10.7717/peerj.4600