Back to Search
Start Over
When are pathogen genome sequences informative of transmission events?
- Source :
- PLOS Pathogens, PLoS Pathogens, PLoS Pathogens, Vol 14, Iss 2, p e1006885 (2018)
- Publication Year :
- 2018
- Publisher :
- Public Library of Science (PLoS), 2018.
-
Abstract
- Recent years have seen the development of numerous methodologies for reconstructing transmission trees in infectious disease outbreaks from densely sampled whole genome sequence data. However, a fundamental and as of yet poorly addressed limitation of such approaches is the requirement for genetic diversity to arise on epidemiological timescales. Specifically, the position of infected individuals in a transmission tree can only be resolved by genetic data if mutations have accumulated between the sampled pathogen genomes. To quantify and compare the useful genetic diversity expected from genetic data in different pathogen outbreaks, we introduce here the concept of ‘transmission divergence’, defined as the number of mutations separating whole genome sequences sampled from transmission pairs. Using parameter values obtained by literature review, we simulate outbreak scenarios alongside sequence evolution using two models described in the literature to describe transmission divergence of ten major outbreak-causing pathogens. We find that while mean values vary significantly between the pathogens considered, their transmission divergence is generally very low, with many outbreaks characterised by large numbers of genetically identical transmission pairs. We describe the impact of transmission divergence on our ability to reconstruct outbreaks using two outbreak reconstruction tools, the R packages outbreaker and phybreak, and demonstrate that, in agreement with previous observations, genetic sequence data of rapidly evolving pathogens such as RNA viruses can provide valuable information on individual transmission events. Conversely, sequence data of pathogens with lower mean transmission divergence, including Streptococcus pneumoniae, Shigella sonnei and Clostridium difficile, provide little to no information about individual transmission events. Our results highlight the informational limitations of genetic sequence data in certain outbreak scenarios, and demonstrate the need to expand the toolkit of outbreak reconstruction tools to integrate other types of epidemiological data.<br />Author summary The increasing availability of genetic sequence data has sparked an interest in using pathogen whole genome sequences to reconstruct the history of individual transmission events in an infectious disease outbreak. However, such methodologies rely on pathogen genomes mutating rapidly enough to discriminate between infected individuals, an assumption that remains to be investigated. To determine pathogen outbreaks for which genetic data is expected to be informative of transmission events, we introduce here the concept of ‘transmission divergence’, defined as the number of mutations separating pathogen genome sequences sampled from transmission pairs. We characterise transmission divergence of ten major outbreak causing pathogens using simulations and find significant variation between diseases, with viral outbreaks generally exhibiting higher transmission divergence than bacterial ones. We reconstruct these outbreaks using the R-packages outbreaker and phybreak and find that genetic sequence data, though useful for rapidly evolving pathogens, provides little to no information about outbreaks with low transmission divergence, such as Streptococcus pneumoniae and Shigella sonnei. Our results demonstrate the need to incorporate other sources of outbreak data, such as contact tracing data and spatial location data, into outbreak reconstruction tools.
- Subjects :
- 0301 basic medicine
RNA viruses
EPIDEMIOLOGIC DATA
Epidemiology
Coronaviruses
Entropy
STREPTOCOCCUS-PNEUMONIAE
CLOSTRIDIUM-DIFFICILE INFECTION
Pathology and Laboratory Medicine
Genome
Klebsiella Pneumoniae
Disease Outbreaks
0302 clinical medicine
RESPIRATORY SYNDROME CORONAVIRUS
1108 Medical Microbiology
Klebsiella
Medicine and Health Sciences
030212 general & internal medicine
Biology (General)
Phylogeny
Data Management
MATHEMATICAL-THEORY
Physics
RESISTANT STAPHYLOCOCCUS-AUREUS
Chromosome Mapping
Phylogenetic Analysis
3. Good health
Bacterial Pathogens
Phylogenetics
1107 Immunology
Medical Microbiology
Genetic Epidemiology
Viruses
Physical Sciences
Thermodynamics
MYCOBACTERIUM-TUBERCULOSIS
KLEBSIELLA-PNEUMONIAE
Pathogens
Life Sciences & Biomedicine
0605 Microbiology
Research Article
Computer and Information Sciences
SARS coronavirus
QH301-705.5
Sequence analysis
Immunology
Genomics
Genome, Viral
Biology
Communicable Diseases
Microbiology
03 medical and health sciences
Virology
NOSOCOMIAL OUTBREAK
Genetic variation
Disease Transmission, Infectious
Genetics
Humans
Evolutionary Systematics
Genetic Predisposition to Disease
Microbial Pathogens
Molecular Biology
Taxonomy
Whole genome sequencing
Genetic diversity
Evolutionary Biology
Science & Technology
Bacteria
Base Sequence
Whole Genome Sequencing
Organisms
Viral pathogens
Outbreak
Biology and Life Sciences
Genetic Variation
Sequence Analysis, DNA
RC581-607
030104 developmental biology
Evolutionary biology
Communicable disease transmission
Parasitology
Immunologic diseases. Allergy
Genome, Bacterial
Subjects
Details
- ISSN :
- 15537374 and 15537366
- Volume :
- 14
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- PLOS Pathogens
- Accession number :
- edsair.doi.dedup.....1da1720df1e3dd6fe24e734bf675d53a
- Full Text :
- https://doi.org/10.1371/journal.ppat.1006885