Back to Search Start Over

Chromosome-length genome assembly and structural variations of the primal Basenji dog (Canis lupus familiaris) genome

Authors :
Edwards, Richard J ; https://orcid.org/0000-0002-3645-5539
Field, Matthew
Ferguson, James ; https://orcid.org/0000-0002-6192-6937
Dudchenko, Olga
Keilwagen, Jens
Rosen, Benjamin
Johnston, Gary
Rice, Edward
Hillier, LaDeanna
Hammond, Jillian
Towarnicki, Samuel
Omer, Arina
Skvortsova, Ksenia
Bogdanovic, Ozren ; https://orcid.org/0000-0001-5680-0056
Zammit, Robert
Aiden, Erez
Warren, Wesley
Ballard, Bill ; https://orcid.org/0000-0002-2358-6003
Edwards, Richard J ; https://orcid.org/0000-0002-3645-5539
Field, Matthew
Ferguson, James ; https://orcid.org/0000-0002-6192-6937
Dudchenko, Olga
Keilwagen, Jens
Rosen, Benjamin
Johnston, Gary
Rice, Edward
Hillier, LaDeanna
Hammond, Jillian
Towarnicki, Samuel
Omer, Arina
Skvortsova, Ksenia
Bogdanovic, Ozren ; https://orcid.org/0000-0001-5680-0056
Zammit, Robert
Aiden, Erez
Warren, Wesley
Ballard, Bill ; https://orcid.org/0000-0002-2358-6003
Source :
urn:ISSN:2692-8205; bioRxiv, 2020.11.11.379073
Publication Year :
2020

Abstract

Background: Basenjis are considered an ancient dog breed of central African origins that still live and hunt with tribesmen in the African Congo. Nicknamed the barkless dog, Basenjis possess unique phylogeny, geographical origins and traits make understanding their genome structure relative to more modern dog breeds of great interest. Here, we report the de novo assemblies of two Basenji: a female, China, and a male, Wags. We conduct pairwise comparisons and report structural variations between assembled genomes of three dog breeds: Basenji (CanFam_Bas), Boxer (CanFam3.1) and German Shepherd Dog (GSD) (CanFam_GSD). We then align representative whole genome sequences from 58 dog breeds and show the importance of genome reference when assessing variation among dog breeds. Results: Here we present two high quality Basenji genome assemblies, CanFam_Bas (China) and Wags. CanFam_Bas is superior to CanFam v3,1 is terms of genome contiguity and comparable overall to the high quality CanFam_GSD assembly. The increasing number of available canid reference genomes allows us to examine the impact the choice of reference genome makes with regard to reference genome quality and breed relatedness. By aligning short read data from 58 representative dog breeds to three reference genomes, we demonstrate how the choice of reference genome significantly impacts both read mapping and variant detection. Further, we generate a conservative list of structural variant calls using a consensus of both Pacific Bioscience and Oxford Nanopore long reads to identify large structural breed differences. Collectively this work highlights the importance the choice of reference genome makes in canid variation studies. Conclusions: The growing number of high-quality canid reference genomes means the choice of reference genome is an increasingly critical decision in subsequent canid variant analyses. The basal position of the Basenji makes it suitable for variant analysis for targeted applications of sp

Details

Database :
OAIster
Journal :
urn:ISSN:2692-8205; bioRxiv, 2020.11.11.379073
Notes :
application/pdf
Publication Type :
Electronic Resource
Accession number :
edsoai.on1230136456
Document Type :
Electronic Resource