Back to Search
Start Over
Genomes of the Mouse Collaborative Cross
- Source :
- Genetics
- Publication Year :
- 2017
-
Abstract
- The Collaborative Cross (CC) is a multiparent panel of recombinant inbred (RI) mouse strains derived from eight founder laboratory strains. RI panels are popular because of their long-term genetic stability, which enhances reproducibility and integration of data collected across time and conditions. Characterization of their genomes can be a community effort, reducing the burden on individual users. Here we present the genomes of the CC strains using two complementary approaches as a resource to improve power and interpretation of genetic experiments. Our study also provides a cautionary tale regarding the limitations imposed by such basic biological processes as mutation and selection. A distinct advantage of inbred panels is that genotyping only needs to be performed on the panel, not on each individual mouse. The initial CC genome data were haplotype reconstructions based on dense genotyping of the most recent common ancestors (MRCAs) of each strain followed by imputation from the genome sequence of the corresponding founder inbred strain. The MRCA resource captured segregating regions in strains that were not fully inbred, but it had limited resolution in the transition regions between founder haplotypes, and there was uncertainty about founder assignment in regions of limited diversity. Here we report the whole genome sequence of 69 CC strains generated by paired-end short reads at 30× coverage of a single male per strain. Sequencing leads to a substantial improvement in the fine structure and completeness of the genomes of the CC. Both MRCAs and sequenced samples show a significant reduction in the genome-wide haplotype frequencies from two wild-derived strains, CAST/EiJ and PWK/PhJ. In addition, analysis of the evolution of the patterns of heterozygosity indicates that selection against three wild-derived founder strains played a significant role in shaping the genomes of the CC. The sequencing resource provides the first description of tens of thousands of new genetic variants introduced by mutation and drift in the CC genomes. We estimate that new SNP mutations are accumulating in each CC strain at a rate of 2.4 ± 0.4 per gigabase per generation. The fixation of new mutations by genetic drift has introduced thousands of new variants into the CC strains. The majority of these mutations are novel compared to currently sequenced laboratory stocks and wild mice, and some are predicted to alter gene function. Approximately one-third of the CC inbred strains have acquired large deletions (>10 kb) many of which overlap known coding genes and functional elements. The sequence of these mice is a critical resource to CC users, increases threefold the number of mouse inbred strain genomes available publicly, and provides insight into the effect of mutation and drift on common resources.
- Subjects :
- 0301 basic medicine
Male
MPP
Genotype
Quantitative Trait Loci
selection
Mice, Inbred Strains
Biology
Investigations
Genome
Polymorphism, Single Nucleotide
03 medical and health sciences
Mice
Inbred strain
Genetic drift
Multiparental Populations
Genetics
Animals
Gene
Genotyping
Crosses, Genetic
Whole genome sequencing
drift
genetic variants
Haplotype
Genetic Drift
whole genome sequence
Chromosome Mapping
030104 developmental biology
Haplotypes
Mutation
Imputation (genetics)
Subjects
Details
- ISSN :
- 19432631
- Volume :
- 206
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- Genetics
- Accession number :
- edsair.doi.dedup.....c7621c98f57e17759f1445a99e057503