Back to Search Start Over

Mind the Gap: A Neural Network Framework for Imputing Genotypes in Non-Model Species.

Authors :
Bougiouri K
Source :
Molecular ecology resources [Mol Ecol Resour] 2025 Jan 03, pp. e14066. Date of Electronic Publication: 2025 Jan 03.
Publication Year :
2025
Publisher :
Ahead of Print

Abstract

Reduced representation sequencing (RRS) has proven to be a cost-effective solution for sequencing subsets of the genome in non-model species for large-scale studies. However, the targeted nature of RRS approaches commonly introduces large amounts of missing data, leading to reduced statistical power and biased estimates in downstream analyses. Genotype imputation, the statistical inference of missing sites across the genome, is a powerful alternative to overcome the caveats associated with missing sites. Typically, genotype imputation requires the presence of a reference panel of haplotypes, however, this is not always feasible for non-model species. In this issue of Molecular Ecology Resources, Mora-Márquez et al. (2024) develop gtImputation, an unsupervised machine learning imputation tool with an interactive GUI, which leverages information from the underlying data structure itself, without the need for a reference panel. They showcase that their method performs equally well and even surpasses existing haplotype-clustering and unsupervised machine learning algorithms, particularly for sites with low minor allele frequency (MAF) and for data sets with strong underlying population structure. This innovative framework adds to the ongoing efforts to expand the applicability of imputation to non-model species, offering the opportunity to apply varied types of analyses requiring dense sets of markers, while also maintaining lower sequencing costs.<br /> (© 2025 John Wiley & Sons Ltd.)

Details

Language :
English
ISSN :
1755-0998
Database :
MEDLINE
Journal :
Molecular ecology resources
Publication Type :
Academic Journal
Accession number :
39749403
Full Text :
https://doi.org/10.1111/1755-0998.14066