Back to Search
Start Over
Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
- Source :
- Molecular biology and evolution, Molecular Biology and Evolution, Molecular Biology and Evolution, 31 (5)
- Publication Year :
- 2014
- Publisher :
- ETH Zurich, 2014.
-
Abstract
- Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step in any evolutionary analysis. Instead of constructing genome assemblies for all taxa, annotating these assemblies, and aligning orthologous genes, many recent studies 1) directly map raw sequencing reads to a single reference sequence, 2) extract single nucleotide polymorphisms (SNPs), and 3) infer the phylogenetic tree using maximum likelihood methods from the aligned SNP positions. However, here we show that, when using such methods to reconstruct phylogenies from sets of simulated sequences, both the exclusion of nonpolymorphic positions and the alignment to a single reference genome, introduce systematic biases and errors in phylogeny reconstruction. To address these problems, we developed a new method that combines alignments from mappings to multiple reference sequences and show that this successfully removes biases from the reconstructed phylogenies. We implemented this method as a web server named REALPHY (Reference sequence Alignment-based Phylogeny builder), which fully automates phylogenetic reconstruction from raw sequencing reads.<br />Molecular Biology and Evolution, 31 (5)<br />ISSN:0737-4038<br />ISSN:1537-1719
- Subjects :
- Fast Tracks
Pseudomonas syringae
Sequence alignment
Genomics
Computational biology
Biology
Polymorphism, Single Nucleotide
Genome
DNA sequencing
Evolution, Molecular
Phylogenetics
Genetics
Escherichia coli
Computer Simulation
Next-generation sequencing
Molecular Biology
Phylogeny
Ecology, Evolution, Behavior and Systematics
Sequence (medicine)
Likelihood Functions
Models, Genetic
Phylogenetic tree
High-Throughput Nucleotide Sequencing
Reproducibility of Results
Sequence Alignment
Algorithms
Genome, Bacterial
Sinorhizobium meliloti
Reference genome
Subjects
Details
- Language :
- English
- ISSN :
- 07374038 and 15371719
- Database :
- OpenAIRE
- Journal :
- Molecular biology and evolution, Molecular Biology and Evolution, Molecular Biology and Evolution, 31 (5)
- Accession number :
- edsair.doi.dedup.....cdc1630791dce4bd455be6ca52c38805
- Full Text :
- https://doi.org/10.3929/ethz-b-000094813