Back to Search
Start Over
Representing Genetic Determinants in Bacterial GWAS with Compacted De Bruijn Graphs
- Publication Year :
- 2017
- Publisher :
- Cold Spring Harbor Laboratory, 2017.
-
Abstract
- MotivationAntimicrobial resistance has become a major worldwide public health concern, calling for a better characterization of existing and novel resistance mechanisms. GWAS methods applied to bacterial genomes have shown encouraging results for new genetic marker discovery. Most existing approaches either look at SNPs obtained by sequence alignment or consider sets of kmers, whose presence in the genome is associated with the phenotype of interest. While the former approach can only be performed when genomes are similar enough for an alignment to make sense, the latter can lead to redundant descriptions and to results which are hard to interpret.ResultsWe propose an alignment-free GWAS method detecting haplotypes of variable length associated to resistance, using compacted De Bruijn graphs. Our representation is flexible enough to deal with very plastic genomes subject to gene transfers while drastically reducing the number of features to explore compared to kmers, without loss of information. It accomodates polymorphisms in core genes, accessory genes and noncoding regions. Using our representation in a GWAS leads to the selection of a small number of entities which are easier to visualize and interpret than fixed-length kmers. We illustrate the benefit of our approach by describing known as well as potential novel determinants of antimicrobial resistance in P. aeruginosa, a pathogenic bacteria with a highly plastic genome.Availability and implementationThe code and data used in the experiments will be made available upon acceptance of this manuscript.Contactmagali.dancette@biomerieux.com
- Subjects :
- De Bruijn sequence
Genetics
0303 health sciences
Small number
Sequence alignment
Genome-wide association study
Bacterial genome size
Computational biology
Biology
Genome
03 medical and health sciences
0302 clinical medicine
Gene
030217 neurology & neurosurgery
Selection (genetic algorithm)
030304 developmental biology
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....9128057021b0f9e01a085aedc8be5acf
- Full Text :
- https://doi.org/10.1101/113563