Back to Search Start Over

Predicting amphibian intraspecific diversity with machine learning: Challenges and prospects for integrating traits, geography, and genetic data.

Authors :
Barrow LN
Masiero da Fonseca E
Thompson CEP
Carstens BC
Source :
Molecular ecology resources [Mol Ecol Resour] 2021 Nov; Vol. 21 (8), pp. 2818-2831. Date of Electronic Publication: 2020 Dec 14.
Publication Year :
2021

Abstract

The growing availability of genetic data sets, in combination with machine learning frameworks, offers great potential to answer long-standing questions in ecology and evolution. One such question has intrigued population geneticists, biogeographers, and conservation biologists: What factors determine intraspecific genetic diversity? This question is challenging to answer because many factors may influence genetic variation, including life history traits, historical influences, and geography, and the relative importance of these factors varies across taxonomic and geographic scales. Furthermore, interpreting the influence of numerous, potentially correlated variables is difficult with traditional statistical approaches. To address these challenges, we analysed repurposed data using machine learning and investigated predictors of genetic diversity, focusing on Nearctic amphibians as a case study. We aggregated species traits, range characteristics, and >42,000 genetic sequences for 299 species using open-access scripts and various databases. After identifying important predictors of nucleotide diversity with random forest regression, we conducted follow-up analyses to examine the roles of phylogenetic history, geography, and demographic processes on intraspecific diversity. Although life history traits were not important predictors for this data set, we found significant phylogenetic signal in genetic diversity within amphibians. We also found that salamander species at northern latitudes contained low genetic diversity. Data repurposing and machine learning provide valuable tools for detecting patterns with relevance for conservation, but concerted efforts are needed to compile meaningful data sets with greater utility for understanding global biodiversity.<br /> (© 2020 John Wiley & Sons Ltd.)

Details

Language :
English
ISSN :
1755-0998
Volume :
21
Issue :
8
Database :
MEDLINE
Journal :
Molecular ecology resources
Publication Type :
Academic Journal
Accession number :
33249725
Full Text :
https://doi.org/10.1111/1755-0998.13303