Back to Search
Start Over
De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis)
- Source :
- G3: Genes, Genomes, Genetics, Vol 7, Iss 2, Pp 755-773 (2017), G3: Genes|Genomes|Genetics
- Publication Year :
- 2017
- Publisher :
- Oxford University Press, 2017.
-
Abstract
- The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon–gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology.
- Subjects :
- 0301 basic medicine
Beaver
genome annotation
Sequence assembly
Investigations
QH426-470
Genome
Transcriptome
03 medical and health sciences
0302 clinical medicine
biology.animal
Genetics
Molecular Biology
Genetics (clinical)
Whole genome sequencing
Castor canadensis
biology
rodent
Genome project
biology.organism_classification
030104 developmental biology
whole-genome sequencing
genome assembly
Mammalian genome
030217 neurology & neurosurgery
Canadian beaver
Subjects
Details
- Language :
- English
- ISSN :
- 21601836
- Volume :
- 7
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- G3: Genes, Genomes, Genetics
- Accession number :
- edsair.doi.dedup.....e1eb7bfd099d71c9e6786e184cd227a2
- Full Text :
- https://doi.org/10.1534/g3.116.038208