Back to Search Start Over

Tracing the genealogy origin of geographic populations based on genomic variation and deep learning.

Authors :
Yang B
Zhou X
Liu S
Source :
Molecular phylogenetics and evolution [Mol Phylogenet Evol] 2024 Sep; Vol. 198, pp. 108142. Date of Electronic Publication: 2024 Jul 02.
Publication Year :
2024

Abstract

Assigning a query individual animal or plant to its derived population is a prime task in diverse applications related to organismal genealogy. Such endeavors have conventionally relied on short DNA sequences under a phylogenetic framework. These methods naturally show constraints when the inferred population sources are ambiguously phylogenetically structured, a scenario demanding substantially more informative genetic signals. Recent advances in cost-effective production of whole-genome sequences and artificial intelligence have created an unprecedented opportunity to trace the population origin for essentially any given individual, as long as the genome reference data are comprehensive and standardized. Here, we developed a convolutional neural network method to identify population origins using genomic SNPs. Three empirical datasets (an Asian honeybee, a red fire ant, and a chicken datasets) and two simulated populations are used for the proof of concepts. The performance tests indicate that our method can accurately identify the genealogy origin of query individuals, with success rates ranging from  93 % to 100 %. We further showed that the accuracy of the model can be significantly increased by refining the informative sites through F <subscript>ST</subscript> filtering. Our method is robust to configurations related to batch sizes and epochs, whereas model learning benefits from the setting of a proper preset learning rate. Moreover, we explained the importance score of key sites for algorithm interpretability and credibility, which has been largely ignored. We anticipate that by coupling genomics and deep learning, our method will see broad potential in conservation and management applications that involve natural resources, invasive pests and weeds, and illegal trades of wildlife products.<br />Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships in this paper.<br /> (Copyright © 2024 Elsevier Inc. All rights reserved.)

Details

Language :
English
ISSN :
1095-9513
Volume :
198
Database :
MEDLINE
Journal :
Molecular phylogenetics and evolution
Publication Type :
Academic Journal
Accession number :
38964594
Full Text :
https://doi.org/10.1016/j.ympev.2024.108142