1. Characterization of single-nucleotide variation in Indian-origin rhesus macaques (Macaca mulatta)
- Author
-
Ned H. Kalin, Yanru Ren, Jeffrey Rogers, Donna M. Muzny, Muthuswamy Raveendran, Aleksandar Milosavljevic, Gloria L. Fawcett, Ronald A. Harris, David K. Chen, David Rio Deiros, David A. Wheeler, Fuli Yu, Steven E. Shelton, Jeffrey G. Reid, Richard A. Gibbs, and Kimberly C Worley
- Subjects
lcsh:QH426-470 ,lcsh:Biotechnology ,India ,Single-nucleotide polymorphism ,Computational biology ,Genome ,Macaque ,Polymorphism, Single Nucleotide ,03 medical and health sciences ,0302 clinical medicine ,Species Specificity ,Genetic linkage ,single nucleotide polymorphism ,lcsh:TP248.13-248.65 ,biology.animal ,Genetic variation ,common variants ,SOLiD™ ,Genetics ,SNP ,Animals ,030304 developmental biology ,Whole genome sequencing ,0303 health sciences ,biology ,Sequence Analysis, DNA ,biology.organism_classification ,Macaca mulatta ,lcsh:Genetics ,Rhesus macaque ,genetic variation ,030217 neurology & neurosurgery ,Research Article ,rhesus macaque ,Biotechnology - Abstract
Background Rhesus macaques are the most widely utilized nonhuman primate model in biomedical research. Previous efforts have validated fewer than 900 single nucleotide polymorphisms (SNPs) in this species, which limits opportunities for genetic studies related to health and disease. Extensive information about SNPs and other genetic variation in rhesus macaques would facilitate valuable genetic analyses, as well as provide markers for genome-wide linkage analysis and the genetic management of captive breeding colonies. Results We used the available rhesus macaque draft genome sequence, new sequence data from unrelated individuals and existing published sequence data to create a genome-wide SNP resource for Indian-origin rhesus monkeys. The original reference animal and two additional Indian-origin individuals were resequenced to low coverage using SOLiD™ sequencing. We then used three strategies to validate SNPs: comparison of potential SNPs found in the same individual using two different sequencing chemistries, and comparison of potential SNPs in different individuals identified with either the same or different sequencing chemistries. Our approach validated approximately 3 million SNPs distributed across the genome. Preliminary analysis of SNP annotations suggests that a substantial number of these macaque SNPs may have functional effects. More than 700 non-synonymous SNPs were scored by Polyphen-2 as either possibly or probably damaging to protein function and these variants now constitute potential models for studying functional genetic variation relevant to human physiology and disease. Conclusions Resequencing of a small number of animals identified greater than 3 million SNPs. This provides a significant new information resource for rhesus macaques, an important research animal. The data also suggests that overall genetic variation is high in this species. We identified many potentially damaging non-synonymous coding SNPs, providing new opportunities to identify rhesus models for human disease.
- Full Text
- View/download PDF