Back to Search
Start Over
Exon probe sets and bioinformatics pipelines for all levels of fish phylogenomics
- Source :
- Molecular Ecology Resources. 21:816-833
- Publication Year :
- 2020
- Publisher :
- Wiley, 2020.
-
Abstract
- Exon markers have a long history of use in phylogenetics of ray-finned fishes, the most diverse clade of vertebrates with more than 35,000 species. As the number of published genomes increases, it has become easier to test exons and other genetic markers for signals of ancient duplication events and filter out paralogues that can mislead phylogenetic analysis. We present seven new probe sets for current target-capture phylogenomic protocols that capture 1,104 exons explicitly filtered for paralogues using gene trees. These seven probe sets span the diversity of teleost fishes, including four sets that target five hyperdiverse percomorph clades which together comprise ca. 17,000 species (Carangaria, Ovalentaria, Eupercaria, and Syngnatharia + Pelagiaria combined). We additionally included probes to capture legacy nuclear exons and mitochondrial markers that have been commonly used in fish phylogenetics (despite some exons being flagged for paralogues) to facilitate integration of old and new molecular phylogenetic matrices. We tested these probes experimentally for 56 fish species (eight species per probe set) and merged new exon-capture sequence data into an existing data matrix of 1,104 exons and 300 ray-finned fish species. We provide an optimized bioinformatics pipeline to assemble exon capture data from raw reads to alignments for downstream analysis. We show that legacy loci with known paralogues are at risk of assembling duplicated sequences with target-capture, but we also assembled many useful orthologous sequences that can be integrated with many PCR-generated matrices. These probe sets are a valuable resource for advancing fish phylogenomics because targeted exons can easily be extracted from increasingly available whole genome and transcriptome data sets, and also may be integrated with existing PCR-based exon and mitochondrial data.
- Subjects :
- 0106 biological sciences
0301 basic medicine
Biology
Bioinformatics
010603 evolutionary biology
01 natural sciences
Genome
Evolution, Molecular
03 medical and health sciences
Exon
Phylogenetics
Phylogenomics
Gene duplication
Genetics
Animals
14. Life underwater
Clade
Phylogeny
Ecology, Evolution, Behavior and Systematics
030304 developmental biology
0303 health sciences
Phylogenetic tree
Fishes
Computational Biology
Exons
030104 developmental biology
Genetic marker
Biotechnology
Subjects
Details
- ISSN :
- 17550998 and 1755098X
- Volume :
- 21
- Database :
- OpenAIRE
- Journal :
- Molecular Ecology Resources
- Accession number :
- edsair.doi.dedup.....08078e6c082d91db8151e2cf2628b7ae