Back to Search Start Over

A comprehensive phylogenomic platform for exploring the angiosperm tree of life [Dataset]

Authors :
Calleva Foundation
Dr Mortimer and Theresa Sackler Foundation
Garfield Weston Foundation
Baker, William J.
Bailey, Paul
Barber, Vanessa
Barker, Abigail
Bellot, Sidonie
Bishop, David
Botigué, Laura R.
Brewer, Grace
Carruthers, Tom
Clarkson, James
Cook, Jeffrey
Cowan, Robyn
Dodsworth, Steven
Epitawalage, Niroshini
Françoso, Elaine
Gallego, Berta
Johnson, Matthew
Kim, Jan
Leempoel, Kevin
Maurin, Olivier
McGinnie, Catherine
Pokorny, Lisa
Roy, Shyamali
Stone, Malcolm
Toledo, Eduardo
Wickett, Norman
Zuntini, Alexandre
Eiserhardt, Wolf
Kersey, Paul
Leitch, Ilia J.
Forest, Félix
Calleva Foundation
Dr Mortimer and Theresa Sackler Foundation
Garfield Weston Foundation
Baker, William J.
Bailey, Paul
Barber, Vanessa
Barker, Abigail
Bellot, Sidonie
Bishop, David
Botigué, Laura R.
Brewer, Grace
Carruthers, Tom
Clarkson, James
Cook, Jeffrey
Cowan, Robyn
Dodsworth, Steven
Epitawalage, Niroshini
Françoso, Elaine
Gallego, Berta
Johnson, Matthew
Kim, Jan
Leempoel, Kevin
Maurin, Olivier
McGinnie, Catherine
Pokorny, Lisa
Roy, Shyamali
Stone, Malcolm
Toledo, Eduardo
Wickett, Norman
Zuntini, Alexandre
Eiserhardt, Wolf
Kersey, Paul
Leitch, Ilia J.
Forest, Félix
Publication Year :
2022

Abstract

The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this paper are to (i) document our methods, (ii) describe our first data release and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic dataset for angiosperms to date, comprising 3,099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96%) and 2,333 genera (17%). A “first pass” angiosperm tree of life was inferred from the data, which totalled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated dataset, species tree and all intermediates are openly accessible via the Kew Tree of Life Explo

Details

Database :
OAIster
Notes :
English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1395212709
Document Type :
Electronic Resource