Back to Search Start Over

Different reference genomes determine different results: Comparing SNP calling in RAD-seq of Engelhardia roxburghiana using different reference genomes.

Authors :
Huang, Pei-Han
Wang, Tian-Rui
Li, Min
Fang, Ou-Yan
Su, Ren-Ping
Meng, Hong-Hu
Song, Yi-Gang
Li, Jie
Source :
Plant Science. Jul2024, Vol. 344, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Advances in next-generation sequencing (NGS) have significantly reduced the cost and improved the efficiency of obtaining single nucleotide polymorphism (SNP) markers, particularly through restriction site-associated DNA sequencing (RAD-seq). Meanwhile, the progression in whole genome sequencing has led to the utilization of an increasing number of reference genomes in SNP calling processes. This study utilized RAD-seq data from 242 individuals of Engelhardia roxburghiana , a tropical tree of the walnut family (Juglandaceae), with SNP calling conducted using the STACKS pipeline. We aimed to compare both reference-based approaches, namely, employing a closely related species as the reference genome versus the species itself as the reference genome, to evaluate their respective merits and limitations. Our findings indicate a substantial discrepancy in the number of obtained SNPs between using a closely related species as opposed to the species itself as reference genomes, the former yielded approximately an order of magnitude fewer SNPs compared to the latter. While the missing rate of individuals and sites of the final SNPs obtained in the two scenarios showed no significant difference. The results showed that using the reference genome of the species itself tends to be prioritized in RAD-seq studies. However, if this is unavailable, considering closely related genomes is feasible due to their wide applicability and low missing rate as alternatives. This study contributes to enrich the understanding of the impact of SNP acquisition when utilizing different reference genomes. • This study contributes to enrich the knowledge concerning the influence of SNP acquisition when employing different reference genomes. • The SNP counts vary based on the choice of reference genomes, with using the species itself yielding optimal results for SNP calling. • Whole-genome sequencing is called for to obtain more accurate results. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01689452
Volume :
344
Database :
Academic Search Index
Journal :
Plant Science
Publication Type :
Academic Journal
Accession number :
177288166
Full Text :
https://doi.org/10.1016/j.plantsci.2024.112109