Back to Search
Start Over
A biochemically-interpretable machine learning classifier for microbial GWAS
- Source :
- Nature Communications, Vol 11, Iss 1, Pp 1-11 (2020), Nature communications, vol 11, iss 1, Nature Communications
- Publication Year :
- 2020
- Publisher :
- Nature Publishing Group, 2020.
-
Abstract
- Current machine learning classifiers have successfully been applied to whole-genome sequencing data to identify genetic determinants of antimicrobial resistance (AMR), but they lack causal interpretation. Here we present a metabolic model-based machine learning classifier, named Metabolic Allele Classifier (MAC), that uses flux balance analysis to estimate the biochemical effects of alleles. We apply the MAC to a dataset of 1595 drug-tested Mycobacterium tuberculosis strains and show that MACs predict AMR phenotypes with accuracy on par with mechanism-agnostic machine learning models (isoniazid AUC = 0.93) while enabling a biochemical interpretation of the genotype-phenotype map. Interpretation of MACs for three antibiotics (pyrazinamide, para-aminosalicylic acid, and isoniazid) recapitulates known AMR mechanisms and suggest a biochemical basis for how the identified alleles cause AMR. Extending flux balance analysis to identify accurate sequence classifiers thus contributes mechanistic insights to GWAS, a field thus far dominated by mechanism-agnostic results.<br />Current machine learning classifiers have been applied to whole-genome sequencing data to identify determinants of antimicrobial resistance, but they lack interpretability. Here the authors present a metabolic machine learning classifier that uses flux balance analysis to estimate the biochemical effects of alleles.
- Subjects :
- 0301 basic medicine
Computer science
Mathematics and computing
Science
030106 microbiology
Sequencing data
Drug Resistance
General Physics and Astronomy
Genome-wide association study
Computational biology
General Biochemistry, Genetics and Molecular Biology
Article
Machine Learning
03 medical and health sciences
Microbial
Drug Resistance, Bacterial
Genetics
Isoniazid
2.1 Biological and endogenous factors
Tuberculosis
Aetiology
Allele
lcsh:Science
Genome
Multidisciplinary
Learning classifier system
Biochemical networks
Human Genome
Bacterial
Reproducibility of Results
General Chemistry
Mycobacterium tuberculosis
Aminosalicylic Acid
Pyrazinamide
Flux balance analysis
Anti-Bacterial Agents
Genome, Microbial
Good Health and Well Being
030104 developmental biology
Metabolic Model
lcsh:Q
Classifier (UML)
Microbial genetics
Genome, Bacterial
Genome-Wide Association Study
Subjects
Details
- Language :
- English
- ISSN :
- 20411723
- Volume :
- 11
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- Nature Communications
- Accession number :
- edsair.doi.dedup.....4595ef1d3d0e5085e604fef93bfc4a1d