Back to Search
Start Over
Implementation of quality controls is essential to prevent batch effects in breathomics data and allow for cross-study comparisons
- Source :
- Journal of Breath Research, 14(2):026012. IOP Publishing Ltd.
- Publication Year :
- 2020
- Publisher :
- IOP Publishing Ltd., 2020.
-
Abstract
- Exhaled breath analysis has become a promising monitoring tool for various ailments by identifying volatile organic compounds (VOCs) as indicative biomarkers excreted in the human body. Throughout the process of sampling, measuring, and data processing, non-biological variations are introduced in the data leading to batch effects. Algorithmic approaches have been developed to cope with within-study batch effects. Batch differences, however, may occur among different studies too, and up-to-date, ways to correct for cross-study batch effects are lacking; ultimately, cross-study comparisons to verify the uniqueness of found VOC profiles for a specific disease may be challenging. This study applies within-study batch-effect-correction approaches to correct for cross-study batch effects; suggestions are made that may help prevent the introduction of cross-study variations. Three batch-effect-correction algorithms were investigated: zero-centering, combat, and the analysis of covariance framework. The breath samples were collected from inflammatory bowel disease ([Formula: see text]), chronic liver disease ([Formula: see text]), and irritable bowel syndrome ([Formula: see text]) patients at different periods, and they were analysed via gas chromatography-mass spectrometry. Multivariate statistics were used to visualise and verify the results. The visualisation of the data before any batch-effect-correction technique was applied showed a clear distinction due to probable batch effects among the datasets of the three cohorts. The visualisation of the three datasets after implementing all three correction techniques showed that the batch effects were still present in the data. Predictions made using partial least squares discriminant analysis and random forest confirmed this observation. The within-study batch-effect-correction approaches fail to correct for cross-study batch effects present in the data. The present study proposes a framework for systematically standardising future breathomics data by using internal standards or quality control samples at regular analysis intervals. Further knowledge regarding the nature of the unsolicited variations among cross-study batches must be obtained to move the field further.
- Subjects :
- Male
Multivariate statistics
IRRITABLE-BOWEL-SYNDROME
Computer science
PARTIAL LEAST-SQUARES
PREDICTION
data analysis
computer.software_genre
01 natural sciences
DISEASE
Irritable Bowel Syndrome
0302 clinical medicine
volatile organic compounds
Partial least squares regression
GENE-EXPRESSION
Analysis of covariance
Principal Component Analysis
Liver Diseases
Discriminant Analysis
Sampling (statistics)
Reference Standards
Random forest
Breath Tests
exhaled breath
Exhalation
Principal component analysis
Female
Data mining
Algorithms
Quality Control
Pulmonary and Respiratory Medicine
liver cirrhosis
IBD
VOLATILE ORGANIC-COMPOUNDS
DIAGNOSIS
Gas Chromatography-Mass Spectrometry
CLASSIFICATION
03 medical and health sciences
batch effects
IBS
Humans
Least-Squares Analysis
010401 analytical chemistry
Reproducibility of Results
VOCs
PERFORMANCE
Inflammatory Bowel Diseases
Linear discriminant analysis
0104 chemical sciences
030228 respiratory system
Breath gas analysis
Chronic Disease
MICROARRAY DATA
computer
Biomarkers
Subjects
Details
- Language :
- English
- ISSN :
- 17527163 and 17527155
- Volume :
- 14
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- Journal of Breath Research
- Accession number :
- edsair.doi.dedup.....f46aaa2e1962edeaca3e6c31c7ec648a