1. Comparative genomic analysis of clinical Candida glabrata isolates identifies multiple polymorphic loci that can improve existing multilocus sequence typing strategy
- Author
-
D. Batra, Toni Gabaldón, Amir Arastehfar, M. Marcet-Houben, E. Shor, D.S. Perlin, Saad J. Taj-Aldeen, F. Daneshnia, Shawn R. Lockhart, and Barcelona Supercomputing Center
- Subjects
Informàtica::Aplicacions de la informàtica::Bioinformàtica [Àrees temàtiques de la UPC] ,Sequence type ,QH301-705.5 ,Antifungal drug ,Single-nucleotide polymorphism ,Candida glabrata ,Computational biology ,Biology ,Genome ,03 medical and health sciences ,Typing ,Biology (General) ,Indel ,030304 developmental biology ,Whole genome sequencing ,0303 health sciences ,Whole-genome sequencing ,030306 microbiology ,Càndida ,Comparative genomics ,biology.organism_classification ,bacterial infections and mycoses ,Agricultural and Biological Sciences (miscellaneous) ,3. Good health ,Genòmica ,Drug resistance ,Multilocus sequence typing ,Research Paper ,MLST - Abstract
Candida glabrata is the second leading cause of candidemia in many countries and is one of the most concerning yeast species of nosocomial importance due to its increasing rate of antifungal drug resistance and emerging multidrug-resistant isolates. Application of multilocus sequence typing (MLST) to clinical C. glabrata isolates revealed an association of certain sequence types (STs) with drug resistance and mortality. The current C. glabrata MLST scheme is based on single nucleotide polymorphisms (SNPs) at six loci and is therefore relatively laborious and costly. Furthermore, only a few high-quality C. glabrata reference genomes are available, limiting rapid analysis of clinical isolates by whole genome sequencing. In this study we provide long-read based assemblies for seven additional clinical strains belonging to three different STs and use this information to simplify the C. glabrata MLST scheme. Specifically, a comparison of these genomes identified highly polymorphic loci (HPL) defined by frequent insertions and deletions (indels), two of which proved to be highly resolutive for ST. When challenged with 53 additional isolates, a combination of TRP1 (a component of the current MLST scheme) with either of the two HPL fully recapitulated ST identification. Therefore, our comparative genomic analysis identified a new typing approach combining SNPs and indels and based on only two loci, thus significantly simplifying ST identification in C. glabrata. Because typing tools are instrumental in addressing numerous clinical and biological questions, our new MLST scheme can be used for high throughput typing of C. glabrata in clinical and research settings. We thank Dibyendu Kumar (Rutgers University) for help with C. glabrata PacBio sequencing. This work was supported by NIH 5R01AI109025 to D.S.P. TG group acknowledges support from the Spanish Ministry of Science and Innovation for grant PGC2018-099921-B-I00, cofounded by European Regional Development Fund (ERDF); from the Catalan Research Agency (AGAUR) SGR423; from the European Union's Horizon 2020 research and innovation programme (ERC-2016-724173); from the Gordon and Betty Moore Foundation (Grant GBMF9742) and from the Instituto de Salud Carlos III (INB Grant PT17/0009/0023 – ISCIII-SGEFI/ERDF).
- Published
- 2021