Back to Search
Start Over
A resource of single-nucleotide polymorphisms for rainbow trout generated by restriction-site associated DNA sequencing of doubled haploids
- Source :
- Molecular Ecology Resources, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online (3), Non paginé. ⟨10.1111/1755-0998.12204⟩, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online, Non paginé. ⟨10.1111/1755-0998.12204⟩
- Publication Year :
- 2014
- Publisher :
- HAL CCSD, 2014.
-
Abstract
- Chantier qualité GA; Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of a genome duplication event that occurred between 25 and 100 Ma. This situation complicates single-nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and not simple allelic variants. To differentiate PSVs from simple allelic variants, we used 19 homozygous doubled haploid (DH) lines that represent a wide geographical range of rainbow trout populations. In the first phase of the study, we analysed SbfI restriction-site associated DNA (RAD) sequence data from all the 19 lines and selected 11 lines for an extended SNP discovery. In the second phase, we conducted the extended SNP discovery using PstI RAD sequence data from the selected 11 lines. The complete data set is composed of 145 168 high-quality putative SNPs that were genotyped in at least nine of the 11 lines, of which 71 446 (49%) had minor allele frequencies (MAF) of at least 18% (i.e. at least two of the 11 lines). Approximately 14% of the RAD SNPs in this data set are from expressed or coding rainbow trout sequences. Our comparison of the current data set with previous SNP discovery data sets revealed that 99% of our SNPs are novel. In the support files for this resource, we provide annotation to the positions of the SNPs in the working draft of the rainbow trout reference genome, provide the genotypes of each sample in the discovery panel and identify SNPs that are likely to be in coding sequences.
- Subjects :
- 0106 biological sciences
[SDV.SA]Life Sciences [q-bio]/Agricultural sciences
Genotype
Molecular Sequence Data
Single-nucleotide polymorphism
Biology
Haploidy
010603 evolutionary biology
01 natural sciences
Genome
Polymorphism, Single Nucleotide
DNA sequencing
03 medical and health sciences
Genetics
SNP
Animals
Deoxyribonucleases, Type II Site-Specific
Ecology, Evolution, Behavior and Systematics
Alleles
Phylogeny
restriction-site-associated dna sequencing
030304 developmental biology
0303 health sciences
double haploid
Base Sequence
Sequence Analysis, DNA
rainbow trout
single-nucleotide polymorphism discovery
Minor allele frequency
Restriction site
Oncorhynchus mykiss
Rainbow trout
paralogous sequence variant
Biotechnology
Reference genome
Subjects
Details
- Language :
- English
- ISSN :
- 1755098X and 17550998
- Database :
- OpenAIRE
- Journal :
- Molecular Ecology Resources, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online (3), Non paginé. ⟨10.1111/1755-0998.12204⟩, Molecular Ecology Resources, Wiley/Blackwell, 2014, Early View, online, Non paginé. ⟨10.1111/1755-0998.12204⟩
- Accession number :
- edsair.doi.dedup.....6c038c616b68e4022f0d0390c9277601