Back to Search Start Over

Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe

Authors :
Gerry Tonkin-Hill
Rebecca A Gladstone
Anna K Pöntinen
Sergio Arredondo-Alonso
Stephen D Bentley
Jukka Corander
Helsinki Institute for Information Technology
Jukka Corander / Principal Investigator
Department of Mathematics and Statistics
Department of Computer Science
Publication Year :
2022
Publisher :
Cold Spring Harbor Laboratory, 2022.

Abstract

Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on Generalised Linear Regression that is robust to population structure, sampling bias and errors in the predicted presence/absence of genes. We demonstrate using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analysing several diverse bacterial genome datasets representing major human pathogens. Panstripe is freely available as an R package at https://github.com/gtonkinhill/panstripe.

Details

Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....38f8028f37a088656b5b0a9b35de4707
Full Text :
https://doi.org/10.1101/2022.04.23.489244