Back to Search Start Over

Structurally divergent and recurrently mutated regions of primate genomes.

Authors :
Mao Y
Harvey WT
Porubsky D
Munson KM
Hoekzema K
Lewis AP
Audano PA
Rozanski A
Yang X
Zhang S
Gordon DS
Wei X
Logsdon GA
Haukness M
Dishuck PC
Jeong H
Del Rosario R
Bauer VL
Fattor WT
Wilkerson GK
Lu Q
Paten B
Feng G
Sawyer SL
Warren WC
Carbone L
Eichler EE
Source :
BioRxiv : the preprint server for biology [bioRxiv] 2023 Mar 07. Date of Electronic Publication: 2023 Mar 07.
Publication Year :
2023

Abstract

To better understand the pattern of primate genome structural variation, we sequenced and assembled using multiple long-read sequencing technologies the genomes of eight nonhuman primate species, including New World monkeys (owl monkey and marmoset), Old World monkey (macaque), Asian apes (orangutan and gibbon), and African ape lineages (gorilla, bonobo, and chimpanzee). Compared to the human genome, we identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. Across 50 million years of primate evolution, we estimate that 819.47 Mbp or ~27% of the genome has been affected by SVs based on analysis of these primate lineages. We identify 1,607 structurally divergent regions (SDRs) wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost ( CARDs , ABCD7 , OLAH ) and new lineage-specific genes are generated (e.g., CKAP2 , NEK5 ) and have become targets of rapid chromosomal diversification and positive selection (e.g., RGPD s). High-fidelity long-read sequencing has made these dynamic regions of the genome accessible for sequence-level analyses within and between primate species for the first time.<br />Competing Interests: Competing interests E.E.E. is a scientific advisory board (SAB) member of Variant Bio, Inc. The other authors declare no competing interests.

Details

Language :
English
Database :
MEDLINE
Journal :
BioRxiv : the preprint server for biology
Accession number :
36945442
Full Text :
https://doi.org/10.1101/2023.03.07.531415