1. An OMICs-based meta-analysis to support infection state stratification
- Author
-
Jonathan David, Phillippa Spencer, David Rushton, Simon Perkins, Philipp Antczak, Andrew R. Jones, and Ashleigh C. Myall
- Subjects
Statistics and Probability ,AcademicSubjects/SCI01060 ,medicine.drug_class ,Antibiotics ,Gene Expression ,Computational biology ,Disease ,Biology ,Biochemistry ,03 medical and health sciences ,0302 clinical medicine ,Antibiotic resistance ,medicine ,Molecular Biology ,Gene ,030304 developmental biology ,Supplementary data ,0303 health sciences ,Omics ,Original Papers ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,030220 oncology & carcinogenesis ,Meta-analysis ,DNA microarray - Abstract
Motivation A fundamental problem for disease treatment is that while antibiotics are a powerful counter to bacteria, they are ineffective against viruses. Often, bacterial and viral infections are confused due to their similar symptoms and lack of rapid diagnostics. With many clinicians relying primarily on symptoms for diagnosis, overuse and misuse of modern antibiotics are rife, contributing to the growing pool of antibiotic resistance. To ensure an individual receives optimal treatment given their disease state and to reduce over-prescription of antibiotics, the host response can in theory be measured quickly to distinguish between the two states. To establish a predictive biomarker panel of disease state (viral/bacterial/no-infection), we conducted a meta-analysis of human blood infection studies using machine learning. Results We focused on publicly available gene expression data from two widely used platforms, Affymetrix and Illumina microarrays as they represented a significant proportion of the available data. We were able to develop multi-class models with high accuracies with our best model predicting 93% of bacterial and 89% viral samples correctly. To compare the selected features in each of the different technologies, we reverse-engineered the underlying molecular regulatory network and explored the neighbourhood of the selected features. The networks highlighted that although on the gene-level the models differed, they contained genes from the same areas of the network. Specifically, this convergence was to pathways including the Type I interferon Signalling Pathway, Chemotaxis, Apoptotic Processes and Inflammatory/Innate Response. Availability Data and code are available on the Gene Expression Omnibus and github. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2021