35 results on '"Bernard Y. Kim"'
Search Results
2. Single-fly genome assemblies fill major phylogenomic gaps across the Drosophilidae Tree of Life.
- Author
-
Bernard Y Kim, Hannah R Gellert, Samuel H Church, Anton Suvorov, Sean S Anderson, Olga Barmina, Sofia G Beskid, Aaron A Comeault, K Nicole Crown, Sarah E Diamond, Steve Dorus, Takako Fujichika, James A Hemker, Jan Hrcek, Maaria Kankare, Toru Katoh, Karl N Magnacca, Ryan A Martin, Teruyuki Matsunaga, Matthew J Medeiros, Danny E Miller, Scott Pitnick, Michele Schiffer, Sara Simoni, Tessa E Steenwinkel, Zeeshan A Syed, Aya Takahashi, Kevin H-C Wei, Tsuya Yokoyama, Michael B Eisen, Artyom Kopp, Daniel Matute, Darren J Obbard, Patrick M O'Grady, Donald K Price, Masanori J Toda, Thomas Werner, and Dmitri A Petrov
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1 Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.
- Published
- 2024
- Full Text
- View/download PDF
3. Rapid evolutionary diversification of the flamenco locus across simulans clade Drosophila species.
- Author
-
Sarah Signor, Jeffrey Vedanayagam, Bernard Y Kim, Filip Wierzbicki, Robert Kofler, and Eric C Lai
- Subjects
Genetics ,QH426-470 - Abstract
Suppression of transposable elements (TEs) is paramount to maintain genomic integrity and organismal fitness. In D. melanogaster, the flamenco locus is a master suppressor of TEs, preventing the mobilization of certain endogenous retrovirus-like TEs from somatic ovarian support cells to the germline. It is transcribed by Pol II as a long (100s of kb), single-stranded, primary transcript, and metabolized into ~24-32 nt Piwi-interacting RNAs (piRNAs) that target active TEs via antisense complementarity. flamenco is thought to operate as a trap, owing to its high content of recent horizontally transferred TEs that are enriched in antisense orientation. Using newly-generated long read genome data, which is critical for accurate assembly of repetitive sequences, we find that flamenco has undergone radical transformations in sequence content and even copy number across simulans clade Drosophilid species. Drosophila simulans flamenco has duplicated and diverged, and neither copy exhibits synteny with D. melanogaster beyond the core promoter. Moreover, flamenco organization is highly variable across D. simulans individuals. Next, we find that D. simulans and D. mauritiana flamenco display signatures of a dual-stranded cluster, with ping-pong signals in the testis and/or embryo. This is accompanied by increased copy numbers of germline TEs, consistent with these regions operating as functional dual-stranded clusters. Overall, the physical and functional diversity of flamenco orthologs is testament to the extremely dynamic consequences of TE arms races on genome organization, not only amongst highly related species, but even amongst individuals.
- Published
- 2023
- Full Text
- View/download PDF
4. Correction: Highly contiguous assemblies of 101 drosophilid genomes
- Author
-
Bernard Y Kim, Jeremy R Wang, Danny E Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A Comeault, David Peede, Emmanuel RR D'Agostino, Julianne Pelaez, Jessica M Aguilar, Diler Haji, Teruyuki Matsunaga, Ellie Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Pavle Erić, Jian-Jun Gao, Takehiro K Katoh, Masanori J Toda, Hideaki Watabe, Masayoshi Watada, Jeremy S Davis, Leonie C Moyle, Giulia Manoli, Enrico Bertolini, Vladimír Košťál, R Scott Hawley, Aya Takahashi, Corbin D Jones, Donald K Price, Noah Whiteman, Artyom Kopp, Daniel R Matute, and Dmitri A Petrov
- Subjects
Medicine ,Science ,Biology (General) ,QH301-705.5 - Published
- 2022
- Full Text
- View/download PDF
5. Highly contiguous assemblies of 101 drosophilid genomes
- Author
-
Bernard Y Kim, Jeremy R Wang, Danny E Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A Comeault, David Peede, Emmanuel RR D'Agostino, Julianne Pelaez, Jessica M Aguilar, Diler Haji, Teruyuki Matsunaga, Ellie E Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Pavle Erić, Jian-Jun Gao, Takehiro K Katoh, Masanori J Toda, Hideaki Watabe, Masayoshi Watada, Jeremy S Davis, Leonie C Moyle, Giulia Manoli, Enrico Bertolini, Vladimír Košťál, R Scott Hawley, Aya Takahashi, Corbin D Jones, Donald K Price, Noah Whiteman, Artyom Kopp, Daniel R Matute, and Dmitri A Petrov
- Subjects
Drosophila ,Drosophilidae ,comparative genomics ,genome assembly ,nanopore ,long reads ,Medicine ,Science ,Biology (General) ,QH301-705.5 - Abstract
Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.
- Published
- 2021
- Full Text
- View/download PDF
6. A community-maintained standard library of population genetic models
- Author
-
Jeffrey R Adrion, Christopher B Cole, Noah Dukler, Jared G Galloway, Ariella L Gladstein, Graham Gower, Christopher C Kyriazis, Aaron P Ragsdale, Georgia Tsambos, Franz Baumdicker, Jedidiah Carlson, Reed A Cartwright, Arun Durvasula, Ilan Gronau, Bernard Y Kim, Patrick McKenzie, Philipp W Messer, Ekaterina Noskova, Diego Ortega-Del Vecchyo, Fernando Racimo, Travis J Struck, Simon Gravel, Ryan N Gutenkunst, Kirk E Lohmueller, Peter L Ralph, Daniel R Schrider, Adam Siepel, Jerome Kelleher, and Andrew D Kern
- Subjects
simulation ,reproducibility ,open source ,Medicine ,Science ,Biology (General) ,QH301-705.5 - Abstract
The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.
- Published
- 2020
- Full Text
- View/download PDF
7. Secondary reversion to sexual monomorphism associated with tissue‐specific loss of doublesex expression
- Author
-
Jian‐jun Gao, Olga Barmina, Ammon Thompson, Bernard Y. Kim, Anton Suvorov, Kohtaro Tanaka, Hideaki Watabe, Masanori J. Toda, Ji‐Min Chen, Takehiro K. Katoh, and Artyom Kopp
- Subjects
DNA-Binding Proteins ,Male ,Sex Characteristics ,Drosophila melanogaster ,Sex Differentiation ,Genetics ,Animals ,Drosophila Proteins ,Gene Expression Regulation, Developmental ,Female ,General Agricultural and Biological Sciences ,Ecology, Evolution, Behavior and Systematics - Abstract
Animal evolution is characterized by frequent turnover of sexually dimorphic traits – new sex- specific characters are gained, and some ancestral sex-specific characters are lost, in many lineages. In insects, sexual differentiation is predominantly cell-autonomous and depends on the expression of the doublesex (dsx) transcription factor. In most cases, cells that transcribe dsx have the potential to undergo sex-specific differentiation, while those that lack dsx expression do not. Consistent with this mode of development, comparative research has shown that the origin of new sex-specific traits can be associated with the origin of new spatial domains of dsx expression. In this report, we examine the opposite situation – a secondary loss of the sex comb, a male-specific grasping structure that develops on the front legs of some drosophilid species. We show that, while the origin of the sex comb is linked to an evolutionary gain of dsx expression in the leg, sex comb loss in a newly identified species of Lordiphosa (Drosophilidae) is associated with a secondary loss of dsx expression. We discuss how the developmental control of sexual dimorphism affects the mechanisms by which sex-specific traits can evolve.
- Published
- 2022
8. GENOME REPORT: Chromosome-scale genome assembly of the African spiny mouse (Acomys cahirinus)
- Author
-
Elizabeth Dong Nguyen, Vahid Nikoonejad Fard, Bernard Y. Kim, Sarah Collins, Miranda Galey, Branden R. Nelson, Paul Wakenight, Simone M. Gable, Aaron McKenna, Theo K. Bammler, Jim MacDonald, Daryl M. Okamura, Jay Shendure, David R. Beier, Jan Marino Ramirez, Mark W. Majesky, Kathleen J. Millen, Marc Tollis, and Danny E. Miller
- Abstract
There is increasing interest in the African spiny mouse (Acomys cahirinus) as a model organism because of its ability for regeneration of tissue after injury in skin, muscle, and internal organs such as the kidneys. A high-quality reference genome is needed to better understand these regenerative properties at the molecular level. Here, we present an improved reference genome forA. cahirinusgenerated from long Nanopore sequencing reads. We confirm the quality of our annotations using RNA sequencing data from four different tissues. Our genome is of higher contiguity and quality than previously reported genomes from this species and will facilitate ongoing efforts to better understand the regenerative properties of this organism.
- Published
- 2023
9. Antigenic diversity in malaria parasites is maintained on extrachromosomal DNA
- Author
-
Emily R. Ebel, Bernard Y. Kim, Marina McDew-White, Elizabeth S. Egan, Timothy J.C. Anderson, and Dmitri A. Petrov
- Abstract
Sequence variation among antigenicvargenes enablesPlasmodium falciparummalaria parasites to evade host immunity. Using long sequence reads from haploid clones from a mutation accumulation experiment, we detectvardiversity inconsistent with simple chromosomal inheritance. We discover putatively circular DNA that is strongly enriched forvargenes, which exist in multiple alleles per locus separated by recombination and indel events. Extrachromosomal DNA likely contributes to rapid antigenic diversification inP. falciparum.
- Published
- 2023
10. Deleterious variation shapes the genomic landscape of introgression.
- Author
-
Bernard Y Kim, Christian D Huber, and Kirk E Lohmueller
- Subjects
Genetics ,QH426-470 - Abstract
While it is appreciated that population size changes can impact patterns of deleterious variation in natural populations, less attention has been paid to how gene flow affects and is affected by the dynamics of deleterious variation. Here we use population genetic simulations to examine how gene flow impacts deleterious variation under a variety of demographic scenarios, mating systems, dominance coefficients, and recombination rates. Our results show that admixture between populations can temporarily reduce the genetic load of smaller populations and cause increases in the frequency of introgressed ancestry, especially if deleterious mutations are recessive. Additionally, when fitness effects of new mutations are recessive, between-population differences in the sites at which deleterious variants exist creates heterosis in hybrid individuals. Together, these factors lead to an increase in introgressed ancestry, particularly when recombination rates are low. Under certain scenarios, introgressed ancestry can increase from an initial frequency of 5% to 30-75% and fix at many loci, even in the absence of beneficial mutations. Further, deleterious variation and admixture can generate correlations between the frequency of introgressed ancestry and recombination rate or exon density, even in the absence of other types of selection. The direction of these correlations is determined by the specific demography and whether mutations are additive or recessive. Therefore, it is essential that null models of admixture include both demography and deleterious variation before invoking other mechanisms to explain unusual patterns of genetic variation.
- Published
- 2018
- Full Text
- View/download PDF
11. Two new hybrid populations expand the swordtail hybridization model system
- Author
-
Bernard Y. Kim, Paola Fascinetto-Zago, Quinn K. Langdon, Shreya M. Banerjee, Daniel L. Powell, Stepfanie M. Aguillon, Molly Schumer, and Benjamin M. Moran
- Subjects
Gene Flow ,Genome ,Human evolutionary genetics ,Assortative mating ,Xiphophorus cortezi ,Model system ,Biology ,biology.organism_classification ,Article ,Gene flow ,Cyprinodontiformes ,Genetics, Population ,Evolutionary biology ,Xiphophorus birchmanni ,Genetics ,Animals ,Humans ,Hybridization, Genetic ,General Agricultural and Biological Sciences ,Ecology, Evolution, Behavior and Systematics ,Hybrid - Abstract
Natural hybridization events provide unique windows into the barriers that keep species apart as well as the consequences of their breakdown. Here, we characterize hybrid populations formed between the northern swordtail fish Xiphophorus cortezi and Xiphophorus birchmanni from collection sites on two rivers. We use simulations and new genetic reference panels to develop sensitive and accurate local ancestry calling in this novel system. Strikingly, we find that hybrid populations on both rivers consist of two genetically distinct subpopulations: a cluster of pure X. birchmanni individuals and one of phenotypically intermediate hybrids that derive ∼85-90% of their genome from X. cortezi. Simulations suggest that initial hybridization occurred ∼150 generations ago at both sites, with little evidence for contemporary gene flow between subpopulations. This population structure is consistent with strong assortative mating between individuals of similar ancestry. The patterns of population structure uncovered here mirror those seen in hybridization between X. birchmanni and its sister species, Xiphophorus malinche, indicating an important role for assortative mating in the evolution of hybrid populations. Future comparisons will provide a window into the shared mechanisms driving the outcomes of hybridization not only among independent hybridization events between the same species but also across distinct species pairs.
- Published
- 2021
12. Rapid evolutionary diversification of theflamencolocus across simulans cladeDrosophilaspecies
- Author
-
Sarah Signor, Jeffrey Vedanayagam, Bernard Y. Kim, Filip Wierzbicki, Robert Kofler, and Eric C. Lai
- Abstract
Effective suppression of transposable elements (TEs) is paramount to maintain genomic integrity and organismal fitness. InD. melanogaster,flamencois a master suppressor of TEs, preventing their movement from somatic ovarian support cells to the germline. It is transcribed by Pol II as a long (100s of kb), single-stranded, primary transcript, that is metabolized into Piwi-interacting RNAs (piRNAs) that target active TEs via antisense complementarity.flamencois thought to operate as a trap, owing to its high content of recent horizontally transferred TEs that are enriched in antisense orientation. Using newly-generated long read genome data, which is critical for accurate assembly of repetitive sequences, we find thatflamencohas undergone radical transformations in sequence content and even copy number acrosssimulansclade Drosophilid species.D. simulans flamencohas duplicated and diverged, and neither copy exhibits synteny withD. melanogasterbeyond the core promoter. Moreover,flamencoorganization is highly variable acrossD. simulansindividuals. Next, we find thatD. simulansandD. mauritiana flamencodisplay signatures of a dual-stranded cluster, with ping-pong signals in the testis and/or embryo. This is accompanied by increased copy numbers of germline TEs, consistent with these regions operating as functional dual stranded clusters. Overall, the physical and functional diversity offlamencoorthologs is testament to the extremely dynamic consequences of TE arms races on genome organization, not only amongst highly related species, but even amongst individuals.
- Published
- 2022
13. DNA extraction and Nanopore library prep from single wild-caught drosophilids v2
- Author
-
Bernard Y Kim
- Abstract
We have been assembling the genomes of many Drosophila species. This protocol fork allows for the sequencing of single wild-caught flies using Nanopore MinION sequencers, in contrast to our previous protocol that originally used gDNA extracted from tens or hundreds of flies from an inbred line. While read lengths are not nearly close to what we were seeing before, libraries range from 2-10kb read N50 and typically produce genome assemblies with contig N50 >1 Mbp. The protocol for ultra-long read sequencing from large pools of flies can be found here: dx.doi.org/10.17504/protocols.io.bdfqi3mw For now, please cite the original genomes paper if you use this protocol: https://doi.org/10.7554/eLife.66405 An update that includes many single wild-caught fly genomes is in progress.
- Published
- 2022
14. Identification and Characterization of Breakpoints and Mutations on Drosophila melanogaster Balancer Chromosomes
- Author
-
Kasun Buddika, Brian R. Calvi, Michael J. Dixon, Danny E. Miller, Bernard Y. Kim, R. Scott Hawley, Lily Kahsai, Kevin R. Cook, and Nicholas S. Sokol
- Subjects
balancer chromosomes ,p53 ,Biology ,QH426-470 ,Investigations ,medicine.disease_cause ,Fucosyltransferase A ,Chromosomes ,03 medical and health sciences ,0302 clinical medicine ,chromosomal inversions ,Homologous chromosome ,medicine ,Genetics ,Animals ,Ectopic recombination ,Molecular Biology ,Genetics (clinical) ,030304 developmental biology ,0303 health sciences ,Mutation ,Ankyrin 2 ,Breakpoint ,Chromosome ,Balancer chromosome ,Subtelomere ,biology.organism_classification ,Drosophila melanogaster ,Phenotype ,Chromosome Inversion ,030217 neurology & neurosurgery - Abstract
Balancers are rearranged chromosomes used in Drosophila melanogaster to maintain deleterious mutations in stable populations, preserve sets of linked genetic elements and construct complex experimental stocks. Here, we assess the phenotypes associated with breakpoint-induced mutations on commonly used third chromosome balancers and show remarkably few deleterious effects. We demonstrate that a breakpoint in p53 causes loss of radiation-induced apoptosis and a breakpoint in Fucosyltransferase A causes loss of fucosylation in nervous and intestinal tissue—the latter study providing new markers for intestinal cell identity and challenging previous conclusions about the regulation of fucosylation. We also describe thousands of potentially harmful mutations shared among X or third chromosome balancers, or unique to specific balancers, including an Ankyrin 2 mutation present on most TM3 balancers, and reiterate the risks of using balancers as experimental controls. We used long-read sequencing to confirm or refine the positions of two inversions with breakpoints lying in repetitive sequences and provide evidence that one of the inversions, In(2L)Cy, arose by ectopic recombination between foldback transposon insertions and the other, In(3R)C, cleanly separates subtelomeric and telomeric sequences and moves the subtelomeric sequences to an internal chromosome position. In addition, our characterization of In(3R)C shows that balancers may be polymorphic for terminal deletions. Finally, we present evidence that extremely distal mutations on balancers can add to the stability of stocks whose purpose is to maintain homologous chromosomes carrying mutations in distal genes. Overall, these studies add to our understanding of the structure, diversity and effectiveness of balancer chromosomes.
- Published
- 2020
15. Predictability and parallelism in the contemporary evolution of hybrid genomes
- Author
-
Daniel L. Powell, Benjamin M. Moran, Bernard Y. Kim, Fascinetto-Zago P, Cheyenne Payne, Molly Schumer, Quinn K. Langdon, Dodge To, and Suchandrima Banerjee
- Subjects
Evolutionary biology ,Xiphophorus birchmanni ,Tree of life ,%22">Fish ,Predictability ,Biology ,Parallelism (philosophy) ,biology.organism_classification ,Genome ,Selection (genetic algorithm) - Abstract
Hybridization between species is widespread across the tree of life. As a result, many species, including our own, harbor regions of their genome derived from hybridization. Despite the recognition that this process is widespread, we understand little about how the genome stabilizes following hybridization, and whether the mechanisms driving this stabilization tend to be shared across species. Here, we dissect the drivers of variation in local ancestry across the genome in replicated hybridization events between two species pairs of swordtail fish: Xiphophorus birchmanni × X. cortezi and X. birchmanni × X. malinche. We find unexpectedly high levels of repeatability in local ancestry across the two types of hybrid populations. This repeatability is attributable in part to the fact that the recombination landscape and locations of functionally important elements play a major role in driving variation in local ancestry in both types of hybrid populations. Beyond these broad scale patterns, we identify dozens of regions of the genome where minor parent ancestry is unusually low or high across species pairs. Analysis of these regions points to shared sites under selection across species pairs, and in some cases, shared mechanisms of selection. We show that one such region is a previously unknown hybrid incompatibility that is shared across X. birchmanni × X. cortezi and X. birchmanni × X. malinche hybrid populations.
- Published
- 2021
16. Author response: Highly contiguous assemblies of 101 drosophilid genomes
- Author
-
Yoshitaka Ogawa, Takehiro K. Katoh, David Peede, Ellie E. Armstrong, Donald K. Price, Molly Zych, Giulia Manoli, Aaron A. Comeault, Corbin D. Jones, Hideaki Watabe, R. Scott Hawley, Artyom Kopp, Marija Savic Veselinovic, Noah K. Whiteman, Olga Barmina, Emmanuel R. R. D’Agostino, Teruyuki Matsunaga, Masayoshi Watada, Dmitri A. Petrov, Bernard Y. Kim, Jian-Jun Gao, Mihailo Jelić, Aya Takahashi, Marina Stamenkovic-Radak, Ammon Thompson, Masanori J. Toda, Enrico Bertolini, Julianne N. Pelaez, Pavle Erić, Jeremy S Davis, Emily K. Delaney, Danny E. Miller, Marija Tanasković, Daniel R. Matute, Jessica Aguilar, Leonie C. Moyle, Diler Haji, Vladimír Košťál, and Jeremy Wang
- Subjects
Evolutionary biology ,Genome - Published
- 2021
17. Highly contiguous assemblies of 101 drosophilid genomes
- Author
-
Marija Tanasković, Hideaki Watabe, Teruyuki Matsunaga, Yoshitaka Ogawa, Takehiro K. Katoh, Giulia Manoli, Pavle Erić, Marina Stamenkovic-Radak, Ammon Thompson, Jeremy S Davis, Jessica Aguilar, Aya Takahashi, David Peede, Ellie E. Armstrong, Jian-Jun Gao, Masanori J. Toda, Leonie C. Moyle, Danny E. Miller, Molly Zych, Jeremy Wang, Donald K. Price, Emmanuel R. R. D’Agostino, R. Scott Hawley, Daniel R. Matute, Marija Savic Veselinovic, Aaron A. Comeault, Bernard Y. Kim, Dmitri A. Petrov, Emily K. Delaney, Vladimír Košťál, Masayoshi Watada, Corbin D. Jones, Diler Haji, Noah K. Whiteman, Olga Barmina, Artyom Kopp, Julianne N. Pelaez, Mihailo Jelić, and Enrico Bertolini
- Subjects
Sequence assembly ,comparative genomics ,Genome ,Nanopores ,0302 clinical medicine ,Genome Size ,Genus ,long reads ,Drosophilidae ,Biology (General) ,0303 health sciences ,biology ,Contig ,D. melanogaster ,General Neuroscience ,High-Throughput Nucleotide Sequencing ,Genomics ,General Medicine ,Tools and Resources ,Drosophila melanogaster ,Medicine ,Female ,Drosophila ,Biotechnology ,QH301-705.5 ,Ecology (disciplines) ,Science ,General Biochemistry, Genetics and Molecular Biology ,Chromosomes ,Cell Line ,03 medical and health sciences ,genomics ,Genetics ,Animals ,nanopore ,030304 developmental biology ,Comparative genomics ,Evolutionary Biology ,General Immunology and Microbiology ,evolutionary biology ,Human Genome ,Computational Biology ,Genetics and Genomics ,biology.organism_classification ,Evolutionary biology ,genome assembly ,Other ,Nanopore sequencing ,Generic health relevance ,Biochemistry and Cell Biology ,030217 neurology & neurosurgery - Abstract
Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.
- Published
- 2021
18. Broad geographic sampling reveals the shared basis and environmental correlates of seasonal adaptation in Drosophila
- Author
-
Emily L. Behrman, Iryna Kozeretska, Talia L. Karasov, Paul S. Schmidt, Thomas Flatt, Brian P. Lazzaro, Svitlana Serga, Stephen W. Schaeffer, Paula R. Roy, Dmitri A. Petrov, Daniel K. Fabian, Susanne Tilk, Subhash Rajpurohit, Bernard Y. Kim, John E. Pool, Thomas J.S. Merritt, Josefa González, Katherine R. O'Brien, Ryan W. Taylor, Kelly A. Dyer, Alan O. Bergland, Heather E. Machado, National Institutes of Health (US), European Commission, Natural Sciences and Engineering Research Council of Canada, Canada Research Chairs, Machado, Heather E [0000-0002-1523-3937], Bergland, Alan O [0000-0001-7145-7575], Taylor, Ryan [0000-0002-9003-6378], Tilk, Susanne [0000-0002-9156-9360], Behrman, Emily [0000-0002-2472-9635], Flatt, Thomas [0000-0002-5990-1503], Kim, Bernard [0000-0002-5025-1292], Merritt, Thomas Js [0000-0002-4795-7534], O'Brien, Katherine [0000-0003-4660-0338], Petrov, Dmitri A [0000-0002-3664-9130], Apollo - University of Cambridge Repository, Merritt, Thomas JS [0000-0002-4795-7534], and Apollo-University Of Cambridge Repository
- Subjects
Male ,0106 biological sciences ,0301 basic medicine ,Adaptation, Biological ,Population genetics ,01 natural sciences ,Gene Frequency ,Biology (General) ,Ontario ,Ecology ,D. melanogaster ,biology ,General Neuroscience ,General Medicine ,seasonal adaptation ,Drosophila melanogaster ,Austria ,Medicine ,Seasons ,Ukraine ,Research Article ,QH301-705.5 ,Science ,selection ,Growing season ,010603 evolutionary biology ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,evolution ,Genetic variation ,genomics ,Animals ,Selection, Genetic ,Allele frequency ,Drosophila ,Selection (genetic algorithm) ,Evolutionary Biology ,Polymorphism, Genetic ,General Immunology and Microbiology ,fungi ,population genetics ,biology.organism_classification ,United States ,030104 developmental biology ,Spain ,Evolutionary biology ,FOS: Biological sciences ,Chromosome Inversion ,fluctuating selection ,Adaptation - Abstract
To advance our understanding of adaptation to temporally varying selection pressures, we identified signatures of seasonal adaptation occurring in parallel among Drosophila melanogaster populations. Specifically, we estimated allele frequencies genome-wide from flies sampled early and late in the growing season from 20 widely dispersed populations. We identified parallel seasonal allele frequency shifts across North America and Europe, demonstrating that seasonal adaptation is a general phenomenon of temperate fly populations. Seasonally fluctuating polymorphisms are enriched in large chromosomal inversions, and we find a broad concordance between seasonal and spatial allele frequency change. The direction of allele frequency change at seasonally variable polymorphisms can be predicted by weather conditions in the weeks prior to sampling, linking the environment and the genomic response to selection. Our results suggest that fluctuating selection is an important evolutionary force affecting patterns of genetic variation in Drosophila.
- Published
- 2021
19. Widespread introgression across a phylogeny of 155 Drosophila genomes
- Author
-
David Peede, Ellie E. Armstrong, Bernard Y. Kim, Emmanuel R. R. D’Agostino, Michael Lang, Peter Wadell, Jean R. David, Donald K. Price, Aaron A. Comeault, Dmitri A. Petrov, Daniel R. Matute, Virginie Courtier-Orgogozo, Anton Suvorov, Jeremy Wang, Daniel R. Schrider, University of North Carolina [Chapel Hill] (UNC), University of North Carolina System (UNC), Department of Biology, Stanford University, Stanford, CA, USA, DepartmeDepartment of Biology, University of North Carolina, Chapel Hill, NC 27599, USAnt of Biology, School of Life Sciences, University of Nevada, Las Vegas, NV 89119, USA, School of Fundamental Sciences, Massey University, Palmerston North. New Zealand, CNRS, Institut Jacques Monod, Université de Paris. France, Laboratoire Evolution, Génomes, Comportement, Ecologie (EGCE) CNRS, IRD, Univ. Paris-sud, Université Paris-Saclay, Gif sur Yvette, France, School of Natural Sciences, Bangor University, Bangor, Gwynedd, LL57 2DGA, UK, School of Fundamental Sciences, Massey University, Palmerston North 4442, New Zealand, CNRS, Institut Jacques Monod, Université de Paris, Paris 75013, France, Laboratoire Evolution, Génomes, Comportement, Ecologie (EGCE) CNRS, IRD, Univ. Paris-sud, Université Paris-Saclay, Gif sur Yvette 91190, France, Institut de Systématique, Evolution, Biodiversité (ISYEB ), Muséum national d'Histoire naturelle (MNHN)-École pratique des hautes études (EPHE), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Université des Antilles (UA), Molecular Ecology & Evolution Group, School of Natural Sciences, Bangor University, Bangor, Gwynedd LL57 2DGA, UK, Institut Jacques Monod (IJM (UMR_7592)), and Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS)
- Subjects
0106 biological sciences ,Tree of life ,Introgression ,Biology ,[SDV.BID.SPT]Life Sciences [q-bio]/Biodiversity/Systematics, Phylogenetics and taxonomy ,Genome ,010603 evolutionary biology ,01 natural sciences ,Article ,General Biochemistry, Genetics and Molecular Biology ,Gene flow ,03 medical and health sciences ,Monophyly ,Phylogenetics ,Phylogenomics ,Animals ,Drosophila (subgenus) ,Clade ,Phylogeny ,ComputingMilieux_MISCELLANEOUS ,030304 developmental biology ,0303 health sciences ,Phylogenetic tree ,[SDV.BID.EVO]Life Sciences [q-bio]/Biodiversity/Populations and Evolution [q-bio.PE] ,biology.organism_classification ,Biological Evolution ,Reticulate evolution ,[SDV.BA.ZI]Life Sciences [q-bio]/Animal biology/Invertebrate Zoology ,[SDV.GEN.GA]Life Sciences [q-bio]/Genetics/Animal genetics ,Evolutionary biology ,Hybridization, Genetic ,Drosophila ,General Agricultural and Biological Sciences - Abstract
Genome-scale sequence data have invigorated the study of hybridization and introgression, particularly in animals. However, outside of a few notable cases, we lack systematic tests for introgression at a larger phylogenetic scale across entire clades. Here we leverage 155 genome assemblies, from 149 species, to generate a fossil-calibrated phylogeny and conduct multilocus tests for introgression across nine monophyletic radiations within the genus Drosophila. Using complementary phylogenomic approaches, we identify widespread introgression across the evolutionary history of Drosophila. Mapping gene-tree discordance onto the phylogeny revealed that both ancient and recent introgression has occurred across most of the nine clades that we examined. Our results provide the first evidence of introgression occurring across the evolutionary history of Drosophila and highlight the need to continue to study the evolutionary consequences of hybridization and introgression in this genus and across the Tree of Life.
- Published
- 2020
20. Two new hybrid zones expand the swordtail hybridization model system
- Author
-
Bernard Y. Kim, Daniel L. Powell, Shreya M. Banerjee, Benjamin M. Moran, Fascinetto-Zago P, Molly Schumer, Quinn K. Langdon, and Stepfanie M. Aguillon
- Subjects
Whole genome sequencing ,Empirical data ,Evolutionary biology ,Population structure ,Assortative mating ,Model system ,Biology ,Genome ,Hybrid ,Gene flow - Abstract
Natural hybridization events provide unique windows into the barriers that keep species apart as well as the consequences of their breakdown. Here we characterize hybrid populations formed between the northern swordtail fishXiphophorus corteziandX. birchmannifrom collection sites on two rivers. We develop sensitive and accurate local ancestry calling for this system based on low coverage whole genome sequencing. Strikingly, we find that hybrid populations on both rivers consist of two genetically distinct subpopulations: a cluster of nearly pureX. birchmanniindividuals and one of phenotypically intermediate hybrids that derive ~85-90% of their genome fromX. cortezi. Simulations and empirical data suggest that at both sites initial hybridization occurred ~150 generations ago, with little evidence for contemporary gene flow between subpopulations, likely due to strong assortative mating. The patterns of population structure uncovered here mirror those seen in hybridization betweenX. birchmanniand its sister species,X. malinche. Future comparisons will provide a window into the repeatability of the outcomes of hybridization not only across independent hybridization events between the same species but also across distinct species pairs.
- Published
- 2020
21. Author response: A community-maintained standard library of population genetic models
- Author
-
Bernard Y. Kim, Jared Galloway, Ekaterina Noskova, Jerome Kelleher, Noah Dukler, Andrew D. Kern, Franz Baumdicker, Fernando Racimo, Simon Gravel, Daniel R. Schrider, Ariella L. Gladstein, Diego Ortega-Del Vecchyo, Georgia Tsambos, Reed A. Cartwright, Aaron P. Ragsdale, Jeffrey R. Adrion, Travis J. Struck, Jedidiah Carlson, Christopher C. Kyriazis, Arun Durvasula, Ryan N. Gutenkunst, Philipp W. Messer, Graham Gower, Peter L. Ralph, Ilan Gronau, Patrick F. McKenzie, Christopher B. Cole, Adam Siepel, and Kirk E. Lohmueller
- Subjects
education.field_of_study ,Genetic model ,Population ,Biology ,education ,Demography - Published
- 2020
22. Population genetic models of GERP scores suggest pervasive turnover of constrained sites across mammalian evolution
- Author
-
Bernard Y. Kim, Kirk E. Lohmueller, and Christian D. Huber
- Subjects
Cancer Research ,QH426-470 ,Negative selection ,Database and Informatics Methods ,0302 clinical medicine ,Natural Selection ,Genome Evolution ,Genetics (clinical) ,Conserved Sequence ,Data Management ,Mammals ,0303 health sciences ,education.field_of_study ,Natural selection ,Phylogenetic Analysis ,Genomics ,Phylogenetics ,Deletion Mutation ,Sequence Analysis ,Research Article ,Computer and Information Sciences ,Substitution Mutation ,Evolutionary Processes ,Bioinformatics ,Population ,Biology ,Research and Analysis Methods ,Human Genomics ,Molecular Evolution ,Evolution, Molecular ,03 medical and health sciences ,Genetic model ,Genetics ,Animals ,Humans ,Evolutionary Systematics ,Selection, Genetic ,education ,Molecular Biology ,Selection (genetic algorithm) ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Taxonomy ,Comparative genomics ,Evolutionary Biology ,Models, Genetic ,Genome, Human ,Point mutation ,Biology and Life Sciences ,Computational Biology ,Evolution of mammals ,Comparative Genomics ,Genetics, Population ,Turnover ,Evolutionary biology ,Mutation ,Human genome ,Sequence Alignment ,030217 neurology & neurosurgery - Abstract
Comparative genomic approaches have been used to identify sites where mutations are under purifying selection and of functional consequence by searching for sequences that are conserved across distantly related species. However, the performance of these approaches has not been rigorously evaluated under population genetic models. Further, short-lived functional elements may not leave a footprint of sequence conservation across many species. We use simulations to study how one measure of conservation, the Genomic Evolutionary Rate Profiling (GERP) score, relates to the strength of selection (Nes). We show that the GERP score is related to the strength of purifying selection. However, changes in selection coefficients or functional elements over time (i.e. functional turnover) can strongly affect the GERP distribution, leading to unexpected relationships between GERP and Nes. Further, we show that for functional elements that have a high turnover rate, adding more species to the analysis does not necessarily increase statistical power. Finally, we use the distribution of GERP scores across the human genome to compare models with and without turnover of sites where mutations are under purifying selection. We show that mutations in 4.51% of the noncoding human genome are under purifying selection and that most of this sequence has likely experienced changes in selection coefficients throughout mammalian evolution. Our work reveals limitations to using comparative genomic approaches to identify deleterious mutations. Commonly used GERP score thresholds miss over half of the noncoding sites in the human genome where mutations are under purifying selection., Author summary One of the most significant and challenging tasks in modern genomics is to assess the functional consequences of a particular nucleotide change in a genome. A common approach to address this challenge prioritizes sequences that share similar nucleotides across distantly related species, with the rationale that mutations at such positions were deleterious and removed from the population by purifying natural selection. Our manuscript shows that one popular measure of sequence conservation, the GERP score, performs well at identifying selected mutations if mutations at a site were under selection across all of mammalian evolution. Changes in selection at a given site dramatically reduces the power of GERP to detect selected mutations in humans. We also combine population genetic models with the distribution of GERP scores at noncoding sites across the human genome to show that the degree of selection at individual sites has changed throughout mammalian evolution. Importantly, we demonstrate that at least 80 Mb of noncoding sequence under purifying selection in humans will not have extreme GERP scores and will likely be missed by modern comparative genomic approaches. Our work argues that new approaches, potentially based on genetic variation within species, will be required to identify deleterious mutations.
- Published
- 2020
23. DNA extraction and Nanopore library prep from 15-30 whole flies v1
- Author
-
Bernard Y Kim, Danny E. Miller, and Jeremy Wang
- Subjects
Nanopore ,Chromatography ,Chemistry ,DNA extraction - Abstract
We have been assembling the genomes of many Drosophila species. With that in mind, we developed this protocol to keep the cost of sequencing down to 100kbp. Sequencing is halted at about 40-50X depth of coverage (8-10 Gbp for most species). This of course depends on the quality of the sample, quality and quantity of the prepared library, and the frequency at which the flow cell is flushed and reloaded. We typically run 3-4 species per 2 flow cells, usually for ~14-18 Gbp of data per flow cell. This protocol borrows several elements from John Tyson's "Rocky Mountain" protocol and we thank him for several insightful discussions. https://www.protocols.io/view/rocky-mountain-adventures-in-genomic-dna-sample-pr-7euhjew
- Published
- 2020
24. Recessive deleterious variation has a limited impact on signals of adaptive introgression in human populations
- Author
-
Bernard Y. Kim, Emilia Huerta-Sanchez, Xinjun Zhang, and Kirk E. Lohmueller
- Subjects
Genetics ,0303 health sciences ,Candidate gene ,Null model ,Introgression ,Population genetics ,Biology ,03 medical and health sciences ,0302 clinical medicine ,False positive paradox ,Human genome ,Adaptation ,Gene ,030217 neurology & neurosurgery ,030304 developmental biology - Abstract
Admixture with archaic hominins has altered the landscape of genomic variation in modern human populations. Several gene regions have been previously identified as candidates of adaptive introgression (AI) that facilitated human adaptation to specific environments. However, simulation-based studies have suggested that population genetics processes other than adaptive mutations, such as heterosis from recessive deleterious variants private to populations before admixture, can also lead to patterns in genomic data that resemble adaptive introgression. The extent to which the presence of deleterious variants affect the false-positive rate and the power of current methods to detect AI has not been fully assessed. Here, we used extensive simulations to show that recessive deleterious mutations can increase the false positive rates of tests for AI compared to models without deleterious variants. We further examined candidates of AI in modern humans identified from previous studies and show that, although deleterious variants may hinder the performance of AI detection in modern humans, most signals remained robust when deleterious variants are included in the null model. While deleterious variants may have a limited impact on detecting signals of adaptive introgression in humans, we found that at least two AI candidate genes,HYAL2andHLA, are particularly susceptible to high false positive rates due to the recessive deleterious mutations. By quantifying parameters that affect heterosis, we show that the high false positives are largely attributed to the high exon densities together with low recombination rates in the genomic regions, which can further be exaggerated by the population growth in recent human evolution. Although the combination of such parameters is rare in the human genome, caution is still warranted in other species with different genomic composition and demographic histories.
- Published
- 2020
- Full Text
- View/download PDF
25. A community-maintained standard library of population genetic models
- Author
-
Christopher C. Kyriazis, Bernard Y. Kim, Adam Siepel, Graham Gower, Franz Baumdicker, Patrick F. McKenzie, Christopher B. Cole, Jeffrey R. Adrion, Noah Dukler, Jedidiah Carlson, Arun Durvasula, Peter L. Ralph, Georgia Tsambos, Travis J. Struck, Fernando Racimo, Kirk E. Lohmueller, Diego Ortega-Del Vecchyo, Jerome Kelleher, Reed A. Cartwright, Philipp W. Messer, Simon Gravel, Daniel R. Schrider, Ryan N. Gutenkunst, Ilan Gronau, Ariella L. Gladstein, Jared Galloway, Andrew D. Kern, Ekaterina Noskova, and Aaron P. Ragsdale
- Subjects
0301 basic medicine ,Computer science ,ROBUST ,Arabidopsis ,Population genetics ,Inference ,Field (computer science) ,Task (project management) ,Resource (project management) ,0302 clinical medicine ,open source ,HISTORY ,Biology (General) ,computer.programming_language ,0303 health sciences ,education.field_of_study ,Genomic Library ,Genome ,General Neuroscience ,General Medicine ,simulation ,Tools and Resources ,GENOME ,Drosophila melanogaster ,Medicine ,Computational and Systems Biology ,Human ,Pongo abelii ,QH301-705.5 ,Systems biology ,Science ,Population ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Dogs ,Genetic model ,Escherichia coli ,Animals ,Humans ,education ,reproducibility ,030304 developmental biology ,Gene Library ,Evolutionary Biology ,General Immunology and Microbiology ,Models, Genetic ,MUTATIONS ,Genome, Human ,Simulation modeling ,Computational genomics ,Python (programming language) ,Data science ,EVOLUTION ,030104 developmental biology ,Open source ,SIZE ,Genetics, Population ,INFERENCE ,computer ,030217 neurology & neurosurgery ,Software - Abstract
The explosion in population genomic data demands ever more complex modes of analysis, and increasingly these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here we describe a new resource,stdpopsim, that attempts to rectify this situation.Stdpopsimis a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating howstdpopsimcan be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.
- Published
- 2019
- Full Text
- View/download PDF
26. Evolution and development of male-specific leg brushes in Drosophilidae
- Author
-
Kohtaro Tanaka, Olga Barmina, Ammon Thompson, Jonathan H. Massey, Bernard Y. Kim, Anton Suvorov, and Artyom Kopp
- Subjects
Male ,0106 biological sciences ,Morphology (biology) ,Biology ,010603 evolutionary biology ,01 natural sciences ,03 medical and health sciences ,Drosophilidae ,Convergent evolution ,Genetics ,Animals ,Phylogeny ,Selection (genetic algorithm) ,030304 developmental biology ,Sex Characteristics ,0303 health sciences ,Ornaments ,biology.organism_classification ,Biological Evolution ,Sex specific ,Sexual dimorphism ,Drosophila melanogaster ,Phenotype ,Evolutionary biology ,Sexual selection ,Female ,Developmental Biology - Abstract
The origin, diversification, and secondary loss of sexually dimorphic characters are common in animal evolution. In some cases, structurally and functionally similar traits have evolved independently in multiple lineages. Prominent examples of such traits include the male-specific grasping structures that develop on the front legs of many dipteran insects. In this report, we describe the evolution and development of one of these structures, the male-specific “sex brush”. The sex brush is composed of densely packed, irregularly arranged modified bristles and is found in several distantly related lineages in the family Drosophilidae. Phylogenetic analysis using 250 genes from over 200 species provides modest support for a single origin of the sex brush followed by many secondary losses; however, independent origins of the sex brush cannot be ruled out completely. We show that sex brushes develop in very similar ways in all brush-bearing lineages. The dense packing of brush hairs is explained by the specification of bristle precursor cells at a near-maximum density permitted by the lateral inhibition mechanism, as well as by the reduced size of the surrounding epithelial cells. In contrast to the female and the ancestral male condition, where bristles are arranged in stereotypical, precisely spaced rows, cell migration does not contribute appreciably to the formation of the sex brush. The complex phylogenetic history of the sex brush can make it a valuable model for investigating coevolution of sex-specific morphology and mating behavior.
- Published
- 2019
27. Genomic Flatlining in the Endangered Island Fox
- Author
-
Bernard Y. Kim, Diego Ortega-Del Vecchyo, Jacqueline Robinson, Zhenxin Fan, Robert K. Wayne, Kirk E. Lohmueller, Bridgett M. vonHoldt, and Clare D. Marsden
- Subjects
0301 basic medicine ,Genotype ,Population ,Endangered species ,Foxes ,Genomics ,California ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Genetic variation ,Animals ,education ,Demography ,Islands ,education.field_of_study ,Habitat fragmentation ,Natural selection ,biology ,Ecology ,Endangered Species ,Genetic Variation ,Small population size ,biology.organism_classification ,030104 developmental biology ,Evolutionary biology ,Urocyon ,General Agricultural and Biological Sciences - Abstract
Summary Genetic studies of rare and endangered species often focus on defining and preserving genetically distinct populations, especially those having unique adaptations [1, 2]. Much less attention is directed at understanding the landscape of deleterious variation, an insidious consequence of geographic isolation and the inefficiency of natural selection to eliminate harmful variants in small populations [3–5]. With population sizes of many vertebrates decreasing and isolation increasing through habitat fragmentation and loss, understanding the extent and nature of deleterious variation in small populations is essential for predicting and enhancing population persistence. The Channel Island fox ( Urocyon littoralis ) is a dwarfed species that inhabits six of California's Channel Islands and is derived from the mainland gray fox ( U. cinereoargenteus ). These isolated island populations have persisted for thousands of years at extremely small population sizes [6, 7] and, consequently, are a model for testing ideas about the accumulation of deleterious variation in small populations under natural conditions. Analysis of complete genome sequence data from island foxes shows a dramatic decrease in genome-wide variation and a sharp increase in the homozygosity of deleterious variants. The San Nicolas Island population has a near absence of variation, demonstrating a unique genetic flatlining that is punctuated by heterozygosity hotspots, enriched for olfactory receptor genes and other genes with high levels of ancestral variation. These findings question the generality of the small-population paradigm that maintains substantial genetic variation is necessary for short- and long-term persistence.
- Published
- 2016
28. Purging of strongly deleterious mutations explains long-term persistence and absence of inbreeding depression in island foxes
- Author
-
Caitlin Brown, Jacqueline Robinson, Bernard Y. Kim, Kirk E. Lohmueller, and Robert K. Wayne
- Subjects
0301 basic medicine ,Population ,Endangered species ,Zoology ,Foxes ,General Biochemistry, Genetics and Molecular Biology ,Article ,California ,Gene flow ,03 medical and health sciences ,parasitic diseases ,Inbreeding depression ,Animals ,education ,Islands ,Population Density ,Genetic diversity ,education.field_of_study ,Inbreeding Depression ,biology ,Endangered Species ,Genetic Variation ,Small population size ,biology.organism_classification ,030104 developmental biology ,Threatened species ,Urocyon ,General Agricultural and Biological Sciences - Abstract
Summary The recovery and persistence of rare and endangered species are often threatened by genetic factors, such as the accumulation of deleterious mutations, loss of adaptive potential, and inbreeding depression [ 1 ]. Island foxes (Urocyon littoralis), the dwarfed descendants of mainland gray foxes (Urocyon cinereoargenteus), have inhabited California’s Channel Islands for >9,000 years [ 2 , 3 , 4 ]. Previous genomic analyses revealed that island foxes have exceptionally low levels of diversity and elevated levels of putatively deleterious variation [ 5 ]. Nonetheless, all six populations have persisted for thousands of generations, and several populations rebounded rapidly after recent severe bottlenecks [ 6 , 7 ]. Here, we combine morphological and genomic data with population-genetic simulations to determine the mechanism underlying the enigmatic persistence of these foxes. First, through analysis of genomes from 1929 to 2009, we show that island foxes have remained at small population sizes with low diversity for many generations. Second, we present morphological data indicating an absence of inbreeding depression in island foxes, confirming that they are not afflicted with congenital defects common to other small and inbred populations. Lastly, our population-genetic simulations suggest that long-term small population size results in a reduced burden of strongly deleterious recessive alleles, providing a mechanism for the absence of inbreeding depression in island foxes. Importantly, the island fox illustrates a scenario in which genetic restoration through human-assisted gene flow could be a counterproductive or even harmful conservation strategy. Our study sheds light on the puzzle of island fox persistence, a unique success story that provides a model for the preservation of small populations.
- Published
- 2018
29. Deleterious variation mimics signatures of genomic incompatibility and adaptive introgression
- Author
-
Bernard Y. Kim, Christian D. Huber, and Kirk E. Lohmueller
- Subjects
0106 biological sciences ,0303 health sciences ,education.field_of_study ,Heterosis ,Population size ,Population ,Introgression ,Biology ,010603 evolutionary biology ,01 natural sciences ,Genetic load ,Gene flow ,03 medical and health sciences ,Evolutionary biology ,Genetic variation ,education ,030304 developmental biology ,Dominance (genetics) - Abstract
While it is appreciated that population size changes can impact patterns of deleterious variation in natural populations, less attention has been paid to how population admixture affects the dynamics of deleterious variation. Here we use population genetic simulations to examine how admixture impacts deleterious variation under a variety of demographic scenarios, dominance coefficients, and recombination rates. Our results show that gene flow between populations can temporarily reduce the genetic load of smaller populations, especially if deleterious mutations are recessive. Additionally, when fitness effects of new mutations are recessive, between-population differences in the sites at which deleterious variants exist creates heterosis in hybrid individuals. This can lead to an increase in introgressed ancestry, particularly when recombination rates are low. Under certain scenarios, introgressed ancestry can increase from an initial frequency of 5% to 30-75% and fix at many loci, even in the absence of beneficial mutations. Further, deleterious variation and admixture can generate correlations between the frequency of introgressed ancestry and recombination rate or exon density, even in the absence of other types of selection. The direction of these correlations is determined by the specific demography and whether mutations are additive or recessive. Therefore, it is essential that null models include both demography and deleterious variation before invoking reproductive incompatibilities or adaptive introgression to explain unusual patterns of genetic variation.
- Published
- 2017
- Full Text
- View/download PDF
30. RADseq data reveal ancient, but not pervasive, introgression between Californian tree and scrub oak species (Quercus sect. Quercus: Fagaceae)
- Author
-
Victoria L. Sork, Xinzeng Wei, Joaquín Ortego, Bernard Y. Kim, Paul F. Gugger, Kirk E. Lohmueller, and Sorel Fitz-Gibbon
- Subjects
0301 basic medicine ,Systematics ,Gene Flow ,Introgression ,Biology ,California ,Gene flow ,Trees ,Evolution, Molecular ,03 medical and health sciences ,Monophyly ,Quercus ,Genetics ,Clade ,Ecology, Evolution, Behavior and Systematics ,Alleles ,Phylogeny ,Models, Genetic ,Reproductive isolation ,biology.organism_classification ,Fagaceae ,Sympatry ,030104 developmental biology ,Genetics, Population ,Evolutionary biology ,Sympatric speciation ,Hybridization, Genetic - Abstract
A long-term debate in evolutionary biology is the extent to which reproductive isolation is a necessary element of speciation. Hybridizing plants in general are cited as evidence against this notion, and oaks specifically have been used as the classic example of species maintenance without reproductive isolation. Here, we use thousands of SNPs generated by RAD sequencing to describe the phylogeny of a set of sympatric white oak species in California and then test whether these species exhibit pervasive interspecific gene exchange. Using RAD sequencing, we first constructed a phylogeny of ten oak species found in California. Our phylogeny revealed that seven scrub oak taxa occur within one clade that diverged from a common ancestor with Q. lobata, that they comprise two subclades, and they are not monophyletic but include the widespread tree oak Q. douglasii. Next, we searched for genomic patterns of allele sharing consistent with gene flow between long-divergent tree oaks with scrub oaks. Specifically, we utilized the D-statistic as well as model-based inference to compare the signature of shared alleles between two focal tree species (Q. lobata and Q. engelmannii) with multiple scrub species within the two subclades. We found that introgression is not equally pervasive between sympatric tree and scrub oak species. Instead, gene flow commonly occurs from scrub oaks to recently sympatric Q. engelmannii, but less so from scrub oaks to long-sympatric Q. lobata. This case study illustrates the influence of ancient introgression and impact of reproductive isolating mechanisms in preventing indiscriminate interspecific gene exchange.
- Published
- 2017
31. Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples
- Author
-
Bernard Y. Kim, Kirk E. Lohmueller, and Christian D. Huber
- Subjects
0301 basic medicine ,Nonsynonymous substitution ,site frequency spectrum ,Evolution ,Population ,European Continental Ancestry Group ,Population genetics ,deleterious mutations ,Investigations ,Biology ,White People ,Evolution, Molecular ,03 medical and health sciences ,0302 clinical medicine ,Genetic ,Models ,Statistics ,Gamma distribution ,Genetics ,diffusion theory ,Mixture distribution ,Humans ,Exome ,Selection, Genetic ,1000 Genomes Project ,Population and Evolutionary Genetics ,Selection ,Selection (genetic algorithm) ,030304 developmental biology ,0303 health sciences ,Models, Genetic ,Small number ,Selection coefficient ,Molecular ,High-Throughput Nucleotide Sequencing ,population genetics ,Sequence Analysis, DNA ,DNA ,030104 developmental biology ,Genetics, Population ,Evolutionary biology ,Sample Size ,Mutation ,Genetic Fitness ,Sequence Analysis ,030217 neurology & neurosurgery ,Developmental Biology - Abstract
The distribution of fitness effects (DFE) has considerable importance in population genetics. To date, estimates of the DFE come from studies using a small number of individuals. Thus, estimates of the proportion of moderately to strongly deleterious new mutations may be unreliable because such variants are unlikely to be segregating in the data. Additionally, the true functional form of the DFE is unknown, and estimates of the DFE differ significantly between studies. Here we present a flexible and computationally tractable method, called Fit∂a∂i, to estimate the DFE of new mutations using the site frequency spectrum from a large number of individuals. We apply our approach to the frequency spectrum of 1300 Europeans from the Exome Sequencing Project ESP6400 data set, 1298 Danes from the LuCamp data set, and 432 Europeans from the 1000 Genomes Project to estimate the DFE of deleterious nonsynonymous mutations. We infer significantly fewer (0.38–0.84 fold) strongly deleterious mutations with selection coefficient |s| > 0.01 and more (1.24–1.43 fold) weakly deleterious mutations with selection coefficient |s| < 0.001 compared to previous estimates. Furthermore, a DFE that is a mixture distribution of a point mass at neutrality plus a gamma distribution fits better than a gamma distribution in two of the three data sets. Our results suggest that nearly neutral forces play a larger role in human evolution than previously thought.
- Published
- 2017
32. Determining the factors driving selective effects of new nonsynonymous mutations
- Author
-
Bernard Y. Kim, Christian D. Huber, Clare D. Marsden, and Kirk E. Lohmueller
- Subjects
0106 biological sciences ,0301 basic medicine ,Nonsynonymous substitution ,01 natural sciences ,Mice ,Fisher's geometrical model ,Models ,Yeasts ,Poisson random field ,Stabilizing selection ,Genetics ,education.field_of_study ,Multidisciplinary ,Population size ,Selection coefficient ,Biological Sciences ,Adaptation, Physiological ,Phenotype ,Drosophila melanogaster ,protein stability ,Biotechnology ,Evolution ,Physiological ,Population ,Genomics ,Biology ,010603 evolutionary biology ,Fisher’s geometrical model ,Evolution, Molecular ,03 medical and health sciences ,Species Specificity ,Genetic ,Animals ,Humans ,Selection, Genetic ,Adaptation ,education ,Selection ,Models, Genetic ,Human evolutionary genetics ,Human Genome ,Robustness (evolution) ,Molecular ,mutational robustness ,biology.organism_classification ,030104 developmental biology ,Evolutionary biology ,Mutation ,Genetic Fitness ,Generic health relevance ,distribution of fitness effects - Abstract
The distribution of fitness effects (DFE) of new mutations plays a fundamental role in evolutionary genetics. However, the extent to which the DFE differs across species has yet to be systematically investigated. Furthermore, the biological mechanisms determining the DFE in natural populations remain unclear. Here, we show that theoretical models emphasizing different biological factors at determining the DFE, such as protein stability, back-mutations, species complexity, and mutational robustness make distinct predictions about how the DFE will differ between species. Analyzing amino acid-changing variants from natural populations in a comparative population genomic framework, we find that humans have a higher proportion of strongly deleterious mutations than Drosophila melanogaster. Furthermore, when comparing the DFE across yeast, Drosophila, mice, and humans, the average selection coefficient becomes more deleterious with increasing species complexity. Last, pleiotropic genes have a DFE that is less variable than that of nonpleiotropic genes. Comparing four categories of theoretical models, only Fisher's geometrical model (FGM) is consistent with our findings. FGM assumes that multiple phenotypes are under stabilizing selection, with the number of phenotypes defining the complexity of the organism. Our results suggest that long-term population size and cost of complexity drive the evolution of the DFE, with many implications for evolutionary and medical genomics.
- Published
- 2016
- Full Text
- View/download PDF
33. Deleterious variation shapes the genomic landscape of introgression
- Author
-
Christian D. Huber, Bernard Y. Kim, and Kirk E. Lohmueller
- Subjects
0301 basic medicine ,Cancer Research ,Heredity ,Introgression ,Plant Science ,Plant Genetics ,Gene flow ,Gene Frequency ,Natural Selection ,Plant Genomics ,Genetics (clinical) ,education.field_of_study ,Natural selection ,Simulation and Modeling ,Eukaryota ,Genomics ,Plants ,Genetic load ,Deletion Mutation ,Experimental Organism Systems ,Engineering and Technology ,Heterosis ,Genetic Load ,Research Article ,Biotechnology ,Gene Flow ,Evolutionary Processes ,lcsh:QH426-470 ,Arabidopsis Thaliana ,Population ,Bioengineering ,Brassica ,Biology ,Research and Analysis Methods ,Evolution, Molecular ,03 medical and health sciences ,Model Organisms ,Plant and Algal Models ,Genetic variation ,Hybrid Vigor ,Genetics ,Humans ,Computer Simulation ,Selection, Genetic ,Allele ,education ,Molecular Biology ,Allele frequency ,Alleles ,Ecology, Evolution, Behavior and Systematics ,Demography ,Population Density ,Evolutionary Biology ,Population Biology ,Models, Genetic ,Organisms ,Biology and Life Sciences ,Genetic Variation ,Mating system ,lcsh:Genetics ,Genetics, Population ,030104 developmental biology ,Evolutionary biology ,Mutation ,Animal Studies ,Hybridization, Genetic ,Plant Biotechnology ,Population Genetics - Abstract
While it is appreciated that population size changes can impact patterns of deleterious variation in natural populations, less attention has been paid to how gene flow affects and is affected by the dynamics of deleterious variation. Here we use population genetic simulations to examine how gene flow impacts deleterious variation under a variety of demographic scenarios, mating systems, dominance coefficients, and recombination rates. Our results show that admixture between populations can temporarily reduce the genetic load of smaller populations and cause increases in the frequency of introgressed ancestry, especially if deleterious mutations are recessive. Additionally, when fitness effects of new mutations are recessive, between-population differences in the sites at which deleterious variants exist creates heterosis in hybrid individuals. Together, these factors lead to an increase in introgressed ancestry, particularly when recombination rates are low. Under certain scenarios, introgressed ancestry can increase from an initial frequency of 5% to 30–75% and fix at many loci, even in the absence of beneficial mutations. Further, deleterious variation and admixture can generate correlations between the frequency of introgressed ancestry and recombination rate or exon density, even in the absence of other types of selection. The direction of these correlations is determined by the specific demography and whether mutations are additive or recessive. Therefore, it is essential that null models of admixture include both demography and deleterious variation before invoking other mechanisms to explain unusual patterns of genetic variation., Author summary Individuals from distinct populations sometimes will produce fertile offspring and will exchange genetic material in a process called hybridization. Genomes of hybrid individuals often show non-random patterns of hybrid ancestry across the genome, where some regions have a high frequency of ancestry from the second population and other regions have less. Typically, this pattern has been attributed to adaptive introgression, where beneficial genetic variants are passed from one population to the other, or to genomic incompatibilities between these distinct species. However, other mechanisms could lead to these heterogeneous patterns of ancestry in hybrids. Here we use simulations to investigate whether deleterious mutations affect the patterns of introgressed ancestry across genomes. We show that when ancestry from a larger population is added to a smaller population, the ancestry from the larger population dramatically increases in frequency because it carries fewer deleterious mutations. This occurs even in the absence of beneficial mutations in either population. Additionally, we show that differences in sex chromosome evolution relative to autosomes, or differences in mating system, can affect patterns of introgression in similar ways. Our study argues that deleterious mutations should be included in population genetic models used to identify unusual regions of the genome that appear to be under selection in hybrids.
- Published
- 2018
34. Selection and Reduced Population Size Cannot Explain Higher Amounts of Neandertal Ancestry in East Asian than in European Human Populations
- Author
-
Bernard Y. Kim and Kirk E. Lohmueller
- Subjects
Range (biology) ,Population ,Biology ,Medical and Health Sciences ,White People ,Negative selection ,Asian People ,Report ,Genetics ,Animals ,Humans ,East Asia ,Genetics(clinical) ,education ,Genetics (clinical) ,Selection (genetic algorithm) ,Alleles ,Neanderthals ,Genetics & Heredity ,Population Density ,education.field_of_study ,Genome ,Whites ,Population size ,Biological Sciences ,Asians ,Genetics, Population ,Evolutionary biology - Abstract
It has been hypothesized that the greater proportion of Neandertal ancestry in East Asians than in Europeans is due to the fact that purifying selection is less effective at removing weakly deleterious Neandertal alleles from East Asian populations. Using simulations of a broad range of models of selection and demography, we have shown that this hypothesis cannot account for the higher proportion of Neandertal ancestry in East Asians than in Europeans. Instead, more complex demographic scenarios, most likely involving multiple pulses of Neandertal admixture, are required to explain the data.
- Full Text
- View/download PDF
35. Population genetic models of GERP scores suggest pervasive turnover of constrained sites across mammalian evolution.
- Author
-
Christian D Huber, Bernard Y Kim, and Kirk E Lohmueller
- Subjects
Genetics ,QH426-470 - Abstract
Comparative genomic approaches have been used to identify sites where mutations are under purifying selection and of functional consequence by searching for sequences that are conserved across distantly related species. However, the performance of these approaches has not been rigorously evaluated under population genetic models. Further, short-lived functional elements may not leave a footprint of sequence conservation across many species. We use simulations to study how one measure of conservation, the Genomic Evolutionary Rate Profiling (GERP) score, relates to the strength of selection (Nes). We show that the GERP score is related to the strength of purifying selection. However, changes in selection coefficients or functional elements over time (i.e. functional turnover) can strongly affect the GERP distribution, leading to unexpected relationships between GERP and Nes. Further, we show that for functional elements that have a high turnover rate, adding more species to the analysis does not necessarily increase statistical power. Finally, we use the distribution of GERP scores across the human genome to compare models with and without turnover of sites where mutations are under purifying selection. We show that mutations in 4.51% of the noncoding human genome are under purifying selection and that most of this sequence has likely experienced changes in selection coefficients throughout mammalian evolution. Our work reveals limitations to using comparative genomic approaches to identify deleterious mutations. Commonly used GERP score thresholds miss over half of the noncoding sites in the human genome where mutations are under purifying selection.
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.