Back to Search
Start Over
GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data
- Source :
- PeerJ, PeerJ, Vol 6, p e4600 (2018)
- Publication Year :
- 2018
- Publisher :
- PeerJ, 2018.
-
Abstract
- Normalization is the first critical step in microbiome sequencing data analysis used to account for variable library sizes. Current RNA-Seq based normalization methods that have been adapted for microbiome data fail to consider the unique characteristics of microbiome data, which contain a vast number of zeros due to the physical absence or under-sampling of the microbes. Normalization methods that specifically address the zero-inflation remain largely undeveloped. Here we propose geometric mean of pairwise ratios—a simple but effective normalization method—for zero-inflated sequencing data such as microbiome data. Simulation studies and real datasets analyses demonstrate that the proposed method is more robust than competing methods, leading to more powerful detection of differentially abundant taxa and higher reproducibility of the relative abundances of taxa.
- Subjects :
- 0301 basic medicine
Normalization (statistics)
Bioinformatics
Computer science
Zero inflation
Sequencing data
lcsh:Medicine
RNA-Seq
Computational biology
General Biochemistry, Genetics and Molecular Biology
03 medical and health sciences
Quantitative Biology::Populations and Evolution
Microbiome
General Neuroscience
lcsh:R
Statistics
Genomics
General Medicine
Quantitative Biology::Genomics
Normalization
Quantitative Biology::Quantitative Methods
030104 developmental biology
Metagenomics
Pairwise comparison
RNA-seq
General Agricultural and Biological Sciences
Zero-inflation
Count data
Subjects
Details
- ISSN :
- 21678359
- Volume :
- 6
- Database :
- OpenAIRE
- Journal :
- PeerJ
- Accession number :
- edsair.doi.dedup.....b7b0c36ed1b109f32b1c656d0b187c49
- Full Text :
- https://doi.org/10.7717/peerj.4600