Back to Search
Start Over
De novo transcriptome assembly and polymorphism detection in the flowering plant Silene vulgaris (Caryophyllaceae)
- Source :
- Molecular ecology resources. 12(2)
- Publication Year :
- 2011
-
Abstract
- Members of the angiosperm genus Silene are widely used in studies of ecology and evolution, but available genomic and population genetic resources within Silene remain limited. Deep transcriptome (i.e. expressed sequence tag or EST) sequencing has proven to be a rapid and cost-effective means to characterize gene content and identify polymorphic markers in non-model organisms. In this study, we report the results of 454 GS-FLX Titanium sequencing of a polyA-selected and normalized cDNA library from Silene vulgaris. The library was generated from a single pool of transcripts, combining RNA from leaf, root and floral tissue from three genetically divergent European subpopulations of S. vulgaris. A single full-plate 454 run produced 959,520 reads totalling 363.6 Mb of sequence data with an average read length of 379.0 bp after quality trimming and removal of custom library adaptors. We assembled 832,251 (86.7%) of these reads into 40,964 contigs, which have a total length of 25.4 Mb and can be organized into 18,178 graph-based clusters or 'isogroups'. Assembled sequences were annotated based on homology to genes in multiple public databases. Analysis of sequence variants identified 13,432 putative single-nucleotide polymorphisms (SNPs) and 1320 simple sequence repeats (SSRs) that are candidates for microsatellite analysis. Estimates of nucleotide diversity from 1577 contigs were used to generate genome-wide distributions that revealed several outliers with high diversity. All of these resources are publicly available through NCBI and/or our website (http://silenegenomics.biology.virginia.edu) and should provide valuable genomic and population genetic tools for the Silene research community.
- Subjects :
- Sequence analysis
De novo transcriptome assembly
Population
Molecular Sequence Data
Polymorphism, Single Nucleotide
Nucleotide diversity
Genetics
education
Silene
Ecology, Evolution, Behavior and Systematics
Phylogeny
Silene vulgaris
Gene Library
Expressed Sequence Tags
education.field_of_study
Expressed sequence tag
biology
Gene Expression Profiling
food and beverages
Genetic Variation
Molecular Sequence Annotation
Sequence Analysis, DNA
biology.organism_classification
Genetic marker
Biotechnology
Microsatellite Repeats
Subjects
Details
- ISSN :
- 17550998
- Volume :
- 12
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- Molecular ecology resources
- Accession number :
- edsair.doi.dedup.....9877afb8c50e5c61541f4dd4baf5ae1f