Back to Search Start Over

Genomic exploration and molecular marker development in a large and complex conifer genome using RADseq and mRNAseq

Authors :
M. Bou Dagher-Kharrat
Giovanni G. Vendramin
Marie-Joe Karam
Sara Pinosio
François Lefèvre
Unité de Recherches Forestières Méditerranéennes (URFM)
Institut National de la Recherche Agronomique (INRA)
laboratoire de Caractérisation Génomique des Plantes
Université Saint-Joseph de Beyrouth (USJ)
Applied Genomics Institute (IGA)
Institute of Biosciences and Bioresources (IBBR)
Consiglio Nazionale delle Ricerche [Roma] (CNR)
This project was supported by the Agropolis Fondation (Montpellier, France) under the reference ID 'BIOFIS' 1001-001
Istituto di Bioscienze e BioRisorse [Palermo] (IBBR)
Consiglio Nazionale delle Ricerche (CNR)
Source :
Molecular Ecology Resources, Molecular Ecology Resources, Wiley/Blackwell, 2015, 15 (3), pp.601-612. ⟨10.1111/1755-0998.12329⟩, Molecular ecology resources (Online) 15 (2015): 601–612. doi:10.1111/1755-0998.12329, info:cnr-pdr/source/autori:M.-J. Karam1, F. Lefèvre1,*, M. Bou Dagher-Kharrat2, S. Pinosio3,4 G.G. Vendramin4/titolo:Genomic exploration and molecular marker development in a large and complex conifer genome using RADseq and mRNAseq/doi:10.1111%2F1755-0998.12329/rivista:Molecular ecology resources (Online)/anno:2015/pagina_da:601/pagina_a:612/intervallo_pagine:601–612/volume:15
Publication Year :
2014

Abstract

We combined restriction site associated DNA sequencing (RADseq) using a hypomethylation-sensitive enzyme and messenger RNA sequencing (mRNAseq) to develop molecular markers for the 16 gigabase genome of[i] Cedrus atlantica[/i], a conifer tree species. With each method, Illumina(®) reads from one individual were used to generate de novo assemblies. SNPs from the RADseq data set were detected in a panel of one single individual and three pools of three individuals each. We developed a flexible script to estimate the ascertainment bias in SNP detection considering the pooling and sampling effects on the probability of not detecting an existing polymorphism. Gene Ontology (GO) and transposable element (TE) search analyses were applied to both data sets. The RADseq and the mRNAseq assemblies represented 0.1% and 0.6% of the genome, respectively. Genome complexity reduction resulted in 17% of the RADseq contigs potentially coding for proteins. This rate was doubled in the mRNAseq data set, suggesting that RADseq also explores noncoding low-repeat regions. The two methods gave very similar GO-slim profiles. As expected, the two assemblies were poor in TE-like sequences (

Details

ISSN :
17550998 and 1755098X
Volume :
15
Issue :
3
Database :
OpenAIRE
Journal :
Molecular ecology resources
Accession number :
edsair.doi.dedup.....dc52665070e9525186a61e37ddff1e77
Full Text :
https://doi.org/10.1111/1755-0998.12329⟩