Back to Search Start Over

A resource of single-nucleotide polymorphisms for rainbow trout generated by restriction-site associated DNA sequencing of doubled haploids

Authors :
Mohamed Salem
Yniv Palti
Paul A. Wheeler
Jianbo Yao
Roger L. Vallejo
Caird E. Rexroad
Gary H. Thorgaard
Michael R. Miller
Edwige Quillet
Guangtu Gao
National Center for Cool and Cold Water Aquaculture, ARS-USDA
USDA-ARS : Agricultural Research Service
Institute of Molecular Biology
University of Oregon [Eugene]
Department of Animal Science
University of California
School of Biological Sciences and Center for Reproductive Biology
Washington State University (WSU)
Génétique Animale et Biologie Intégrative (GABI)
Institut National de la Recherche Agronomique (INRA)-AgroParisTech
Division of Animal and Nutritional Sciences
West Virginia University
Department of Biology
Middle Tennessee State University [Murfreesboro] (MTSU)
This project was supported by Agriculture and Food Research Initiative Competitive Grant no. 2011-67015-30091 from the USDA National Institute of Food and Agriculture.
National Center for Cool and Cold Water Aquaculture
United States Department of Agriculture - Agricultural Research Service
AgroParisTech-Institut National de la Recherche Agronomique (INRA)
Source :
Molecular Ecology Resources, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online (3), Non paginé. ⟨10.1111/1755-0998.12204⟩, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online, Non paginé. ⟨10.1111/1755-0998.12204⟩
Publication Year :
2014
Publisher :
HAL CCSD, 2014.

Abstract

Chantier qualité GA; Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of a genome duplication event that occurred between 25 and 100 Ma. This situation complicates single-nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and not simple allelic variants. To differentiate PSVs from simple allelic variants, we used 19 homozygous doubled haploid (DH) lines that represent a wide geographical range of rainbow trout populations. In the first phase of the study, we analysed SbfI restriction-site associated DNA (RAD) sequence data from all the 19 lines and selected 11 lines for an extended SNP discovery. In the second phase, we conducted the extended SNP discovery using PstI RAD sequence data from the selected 11 lines. The complete data set is composed of 145 168 high-quality putative SNPs that were genotyped in at least nine of the 11 lines, of which 71 446 (49%) had minor allele frequencies (MAF) of at least 18% (i.e. at least two of the 11 lines). Approximately 14% of the RAD SNPs in this data set are from expressed or coding rainbow trout sequences. Our comparison of the current data set with previous SNP discovery data sets revealed that 99% of our SNPs are novel. In the support files for this resource, we provide annotation to the positions of the SNPs in the working draft of the rainbow trout reference genome, provide the genotypes of each sample in the discovery panel and identify SNPs that are likely to be in coding sequences.

Details

Language :
English
ISSN :
1755098X and 17550998
Database :
OpenAIRE
Journal :
Molecular Ecology Resources, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online (3), Non paginé. ⟨10.1111/1755-0998.12204⟩, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online, Non paginé. ⟨10.1111/1755-0998.12204⟩
Accession number :
edsair.doi.dedup.....6c038c616b68e4022f0d0390c9277601