46 results on '"Sarah B. Kingan"'
Search Results
2. Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle
- Author
-
Wai Yee Low, Rick Tearle, Ruijie Liu, Sergey Koren, Arang Rhie, Derek M. Bickhart, Benjamin D. Rosen, Zev N. Kronenberg, Sarah B. Kingan, Elizabeth Tseng, Françoise Thibaud-Nissen, Fergal J. Martin, Konstantinos Billis, Jay Ghurye, Alex R. Hastie, Joyce Lee, Andy W. C. Pang, Michael P. Heaton, Adam M. Phillippy, Stefan Hiendleder, Timothy P. L. Smith, and John L. Williams
- Subjects
Science - Abstract
Taurine and indicine cattle have different desirable traits making them better adapted to different climates across the world. Here, Low et al. describe a pipeline to produce haplotype-resolved, chromosome-level genomes of Angus and Brahman cattle breeds from a crossbred individual and report on comparisons of the two genomes.
- Published
- 2020
- Full Text
- View/download PDF
3. Genomic Resources to Guide Improvement of the Shea Tree
- Author
-
Iago Hale, Xiao Ma, Arthur T. O. Melo, Francis Kwame Padi, Prasad S. Hendre, Sarah B. Kingan, Shawn T. Sullivan, Shiyu Chen, Jean-Marc Boffa, Alice Muchugi, Agyemang Danquah, Michael Teye Barnor, Ramni Jamnadass, Yves Van de Peer, and Allen Van Deynze
- Subjects
shea tree ,Vitellaria paradoxa ,reference genome ,fatty acids ,SNPs ,whole genome duplication ,Plant culture ,SB1-1110 - Abstract
A defining component of agroforestry parklands across Sahelo-Sudanian Africa (SSA), the shea tree (Vitellaria paradoxa) is central to sustaining local livelihoods and the farming environments of rural communities. Despite its economic and cultural value, however, not to mention the ecological roles it plays as a dominant parkland species, shea remains semi-domesticated with virtually no history of systematic genetic improvement. In truth, shea’s extended juvenile period makes traditional breeding approaches untenable; but the opportunity for genome-assisted breeding is immense, provided the foundational resources are available. Here we report the development and public release of such resources. Using the FALCON-Phase workflow, 162.6 Gb of long-read PacBio sequence data were assembled into a 658.7 Mbp, chromosome-scale reference genome annotated with 38,505 coding genes. Whole genome duplication (WGD) analysis based on this gene space revealed clear signatures of two ancient WGD events in shea’s evolutionary past, one prior to the Astrid-Rosid divergence (116–126 Mya) and the other at the root of the order Ericales (65–90 Mya). In a first genome-wide look at the suite of fatty acid (FA) biosynthesis genes that likely govern stearin content, the primary determinant of shea butter quality, relatively high copy numbers of six key enzymes were found (KASI, KASIII, FATB, FAD2, FAD3, and FAX2), some likely originating in shea’s more recent WGD event. To help translate these findings into practical tools for characterization, selection, and genome-wide association studies (GWAS), resequencing data from a shea diversity panel was used to develop a database of more than 3.5 million functionally annotated, physically anchored SNPs. Two smaller, more curated sets of suggested SNPs, one for GWAS (104,211 SNPs) and the other targeting FA biosynthesis genes (90 SNPs), are also presented. With these resources, the hope is to support national programs across the shea belt in the strategic, genome-enabled conservation and long-term improvement of the shea tree for SSA.
- Published
- 2021
- Full Text
- View/download PDF
4. A draft phased assembly of the diploid Cascade hop (Humulus lupulus) genome
- Author
-
Lillian K. Padgitt‐Cobb, Sarah B. Kingan, Jackson Wells, Justin Elser, Brent Kronmiller, Daniel Moore, Gregory Concepcion, Paul Peluso, David Rank, Pankaj Jaiswal, John Henning, and David A. Hendrix
- Subjects
Plant culture ,SB1-1110 ,Genetics ,QH426-470 - Abstract
Abstract Hop (Humulus lupulus L. var Lupulus) is a diploid, dioecious plant with a history of cultivation spanning more than one thousand years. Hop cones are valued for their use in brewing and contain compounds of therapeutic interest including xanthohumol. Efforts to determine how biochemical pathways responsible for desirable traits are regulated have been challenged by the large (2.8 Gb), repetitive, and heterozygous genome of hop. We present a draft haplotype‐phased assembly of the Cascade cultivar genome. Our draft assembly and annotation of the Cascade genome is the most extensive representation of the hop genome to date. PacBio long‐read sequences from hop were assembled with FALCON and partially phased with FALCON‐Unzip. Comparative analysis of haplotype sequences provides insight into selective pressures that have driven evolution in hop. We discovered genes with greater sequence divergence enriched for stress‐response, growth, and flowering functions in the draft phased assembly. With improved resolution of long terminal retrotransposons (LTRs) due to long‐read sequencing, we found that hop is over 70% repetitive. We identified a homolog of cannabidiolic acid synthase (CBDAS) that is expressed in multiple tissues. The approaches we developed to analyze the draft phased assembly serve to deepen our understanding of the genomic landscape of hop and may have broader applicability to the study of other large, complex genomes.
- Published
- 2021
- Full Text
- View/download PDF
5. Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity
- Author
-
Wai Yee Low, Rick Tearle, Derek M. Bickhart, Benjamin D. Rosen, Sarah B. Kingan, Thomas Swale, Françoise Thibaud-Nissen, Terence D. Murphy, Rachel Young, Lucas Lefevre, David A. Hume, Andrew Collins, Paolo Ajmone-Marsan, Timothy P. L. Smith, and John L. Williams
- Subjects
Science - Abstract
Despite technological advances, chromosome-level assemblies of mammalian genomes are still rare. Here, the authors use PacBio, Chicago and Hi-C approaches to generate a highly contiguous and partially-phased genome assembly for the water buffalo, Bubalus bubalis
- Published
- 2019
- Full Text
- View/download PDF
6. A high-quality Ixodes scapularis genome advances tick science
- Author
-
Sandip De, Sarah B. Kingan, Chrysoula Kitsou, Daniel M. Portik, Shelby D. Foor, Julia C. Frederick, Vipin S. Rana, Nicole S. Paulat, David A. Ray, Yan Wang, Travis C. Glenn, and Utpal Pal
- Subjects
Genetics - Published
- 2023
- Full Text
- View/download PDF
7. Design and performance of a long-read sequencing panel for pharmacogenomics
- Author
-
Maaike van der Lee, Loes Busscher, Roberta Menafra, Qinglian Zhai, Redmar R. van den Berg, Sarah B Kingan, Nina Gonzaludo, Ting Hon, Ting Han, Leonardo Arbiza, Ibrahim Numanagić, Susan L. Kloet, and Jesse J. Swen
- Abstract
Pharmacogenomics (PGx)-guided drug treatment is one of the cornerstones of personalized medicine. However, the genes involved in drug response are highly complex and known to carry many (rare) variants. Current technologies (short-read sequencing and SNP panels) are limited in their ability to resolve these genes and characterize all variants. Moreover, these technologies cannot always phase variants to their allele of origin. Recent advance in long-read sequencing technologies have shown promise in resolving these problems. Here we present a long-read sequencing panel-based approach for PGx using PacBio HiFi sequencing.A capture based approach was developed using a custom panel of clinically-relevant pharmacogenes including up- and downstream regions. A total of 27 samples were sequenced and panel accuracy was determined using benchmarking variant calls for 3 Genome in a Bottle samples and GeT-RM star(*)-allele calls for 21 samples..The coverage was uniform for all samples with an average of 94% of bases covered at >30×. When compared to benchmarking results, accuracy was high with an average F1 score of 0.89 for INDELs and 0.98 for SNPs. Phasing was good with an average of 68% the target region phased (compared to ~20% for short-reads) and an average phased haploblock size of 6.6kbp. Using Aldy 4, we compared our variant calls to GeT-RM data for 8 genes (CYP2B6, CYP2C19, CYP2C9, CYP2D6, CYP3A4, CYP3A5, SLCO1B1, TPMT), and observed highly accurate star(*)-allele calling with 98.2% concordance (165/168 calls), with only one discordance inCYP2C9leading to a different predicted phenotype.We have shown that our long-read panel-based approach results in high accuracy and target phasing for SNVs as well as for clinical star(*)-alleles.
- Published
- 2022
- Full Text
- View/download PDF
8. Towards complete and error-free genome assemblies of all vertebrate species
- Author
-
Richard Hall, Tandy Warnow, Tanya M. Lama, Oliver A. Ryder, David Haussler, Matthew T. Biegler, Klaus-Peter Koepfli, Ivo Gut, Paul Flicek, Mark Chaisson, James Torrance, Guojie Zhang, Andrew J. Crawford, Federica Di Palma, Michael Hiller, Jennifer A. Marshall Graves, Sadye Paez, Sarah E. London, Mark Wilkinson, Kateryna D. Makova, Byung June Ko, Jimin George, Farooq O. Al-Ajli, Emma C. Teeling, George F. Turner, Robert H. S. Kraus, Sonja C. Vernes, Zev N. Kronenberg, Michelle Smith, Jonas Korlach, Daryl Eason, Jonathan Wood, Simona Secomandi, Claudio V. Mello, Arkarachai Fungtammasan, Arang Rhie, Tomas Marques-Bonet, Benedict Paten, Ekaterina Osipova, Richard Durbin, M. Thomas P. Gilbert, Beth Shapiro, Ivan Sović, Bruce C. Robertson, Richard E. Green, Eugene W. Myers, Leanne Haggerty, Sergey Koren, Martin Pippel, Bettina Haase, Patrick Masterson, Jay Ghurye, Maria Simbirsky, Samantha R. Friedrich, Chul Hee Lee, Luis R Nassar, Lindsey J. Cantin, Kerstin Howe, Erich D. Jarvis, Marlys L. Houck, Jason T. Howard, Jacquelyn Mountcastle, Mark Mooney, Paolo Franchini, Giulio Formenti, Siddarth Selvaraj, Robel E. Dagnew, Brett T. Hannigan, Brian P. Walenz, Alan Tracey, Heebal Kim, Constantina Theofanopoulou, Nicholas H. Putnam, Karen Clark, Iliana Bista, H. William Detrich, Dengfeng Guan, David Iorns, Andrew Digby, Trevor Pesout, Zemin Ning, Gregory Gedman, Woori Kwak, Maximilian Wagner, Joanna Collins, Harris A. Lewin, Hannes Svardal, Milan Malinsky, Byrappa Venkatesh, Françoise Thibaud-Nissen, Joana Damas, Andreas F. Kautt, Olivier Fedrigo, Christopher Dunn, William Chow, Warren E. Johnson, Yang Zhou, Adam M. Phillippy, Taylor Edwards, Paul Medvedev, Peter V. Lovell, Joyce V. Lee, Sylke Winkler, Stephen J. O'Brien, Wesley C. Warren, Alex Hastie, Marcela Uliano-Silva, Kevin L. Howe, Sarah B. Kingan, Fergal J. Martin, Christopher N. Balakrishnan, David F. Clayton, Ying Sims, Robert W. Murphy, Axel Meyer, Dave W Burt, Shane A. McCarthy, Sarah Pelan, Erik Garrison, Mark Diekhans, Frank Grützner, Gavin J. P. Naylor, Robert S. Harris, Hiram Clawson, Jinna Hoffman, Ann C Misuraca, J. H. Kim, University of St Andrews. School of Biology, University of St Andrews. St Andrews Bioinformatics Unit, Rhie, Arang [0000-0002-9809-8127], Fedrigo, Olivier [0000-0002-6450-7551], Formenti, Giulio [0000-0002-7554-5991], Koren, Sergey [0000-0002-1472-8962], Uliano-Silva, Marcela [0000-0001-6723-4715], Thibaud-Nissen, Francoise [0000-0003-4957-7807], Mountcastle, Jacquelyn [0000-0003-1078-4905], Winkler, Sylke [0000-0002-0915-3316], Vernes, Sonja C. [0000-0003-0305-4584], Grutzner, Frank [0000-0002-3088-7314], Balakrishnan, Christopher N. [0000-0002-0788-0659], Burt, Dave [0000-0002-9991-1028], George, Julia M. [0000-0001-6194-6914], Digby, Andrew [0000-0002-1870-8811], Robertson, Bruce [0000-0002-5348-2731], Edwards, Taylor [0000-0002-7235-6175], Meyer, Axel [0000-0002-0888-8193], Kautt, Andreas F. [0000-0001-7792-0735], Franchini, Paolo [0000-0002-8184-1463], Detrich, H. William, III [0000-0002-0783-4505], Pippel, Martin [0000-0002-8134-5929], Malinsky, Milan [0000-0002-1462-6317], Kingan, Sarah B. [0000-0002-4900-0189], Hall, Richard [0000-0001-6490-8227], Dunn, Christopher [0000-0002-0601-3254], Lee, Joyce [0000-0002-3492-1102], Putnam, Nicholas H. [0000-0002-1315-782X], Gut, Ivo [0000-0001-7219-632X], Tracey, Alan [0000-0002-4805-9058], Guan, Dengfeng [0000-0002-6376-3940], London, Sarah E. [0000-0002-7839-2644], Clayton, David F. [0000-0002-6395-3488], Mello, Claudio V. [0000-0002-9826-8421], Friedrich, Samantha R. [0000-0003-0570-6080], Osipova, Ekaterina [0000-0002-6769-7223], Al-Ajli, Farooq O. [0000-0002-4692-7106], Secomandi, Simona [0000-0001-8597-6034], Kim, Heebal [0000-0003-3064-1303], Theofanopoulou, Constantina [0000-0003-2014-7563], Zhou, Yang [0000-0003-1247-5049], Martin, Fergal [0000-0002-1672-050X], Flicek, Paul [0000-0002-3897-7955], Walenz, Brian P. [0000-0001-8431-1428], Diekhans, Mark [0000-0002-0430-0989], Paten, Benedict [0000-0001-8863-3539], Crawford, Andrew J. [0000-0003-3153-6898], Gilbert, M. Thomas P. [0000-0002-5805-7195], Zhang, Guojie [0000-0001-6860-1521], Venkatesh, Byrappa [0000-0003-3620-0277], Shapiro, Beth [0000-0002-2733-7776], Johnson, Warren E. [0000-0002-5954-186X], Marques-Bonet, Tomas [0000-0002-5597-3075], Teeling, Emma C. [0000-0002-3309-1346], Ryder, Oliver A. [0000-0003-2427-763X], Haussler, David [0000-0003-1533-4575], Korlach, Jonas [0000-0003-3047-4250], Lewin, Harris A. [0000-0002-1043-7287], Howe, Kerstin [0000-0003-2237-513X], Myers, Eugene W. [0000-0002-6580-7839], Durbin, Richard [0000-0002-9130-1006], Phillippy, Adam M. [0000-0003-2983-8934], Jarvis, Erich D. [0000-0001-8931-5049], Apollo - University of Cambridge Repository, National Institutes of Health (US), National Human Genome Research Institute (US), Ministry of Health and Welfare (South Korea), Wellcome Trust, European Molecular Biology Laboratory, Howard Hughes Medical Institute, Rockefeller University, Robert and Rosabel Osborne Endowment, European Commission, National Library of Medicine (US), Korea Institute of Marine Science & Technology, Ministry of Oceans and Fisheries (South Korea), Alfred P. Sloan Foundation, Max Planck Society, Maine Department of Inland Fisheries & Wildlife, National Science Foundation (US), University of Queensland, Science Exchange, Northeastern University (US), Federal Ministry of Education and Research (Germany), EMBO, National Key Research and Development Program (China), Qatar Society of Al-Gannas (Algannas), Katara Cultural Village, Government of Qatar, Monash University Malaysia, Hessen State Ministry of Higher Education, Research and the Arts, Ministry of Science, Research and Art Baden-Württemberg, Agency for Science, Technology and Research A*STAR (Singapore), European Research Council, Ministerio de Ciencia, Innovación y Universidades (España), Fundación 'la Caixa', Generalitat de Catalunya, Irish Research Council, Danish National Research Foundation, Australian Research Council, Vernes, Sonja C [0000-0003-0305-4584], Balakrishnan, Christopher N [0000-0002-0788-0659], George, Julia M [0000-0001-6194-6914], Kautt, Andreas F [0000-0001-7792-0735], Detrich, H William [0000-0002-0783-4505], Kingan, Sarah B [0000-0002-4900-0189], Putnam, Nicholas H [0000-0002-1315-782X], London, Sarah E [0000-0002-7839-2644], Clayton, David F [0000-0002-6395-3488], Mello, Claudio V [0000-0002-9826-8421], Friedrich, Samantha R [0000-0003-0570-6080], Al-Ajli, Farooq O [0000-0002-4692-7106], Walenz, Brian P [0000-0001-8431-1428], Crawford, Andrew J [0000-0003-3153-6898], Gilbert, M Thomas P [0000-0002-5805-7195], Johnson, Warren E [0000-0002-5954-186X], Teeling, Emma C [0000-0002-3309-1346], Ryder, Oliver A [0000-0003-2427-763X], Lewin, Harris A [0000-0002-1043-7287], Myers, Eugene W [0000-0002-6580-7839], Phillippy, Adam M [0000-0003-2983-8934], and Jarvis, Erich D [0000-0001-8931-5049]
- Subjects
QH301 Biology ,Genome ,0302 clinical medicine ,Genome Size ,Vertebrats ,Uncategorized ,64 ,0303 health sciences ,Sex Chromosomes ,Multidisciplinary ,High-Throughput Nucleotide Sequencing ,Genomics ,Mitochondrial ,Vertebrates ,Identification (biology) ,Engineering sciences. Technology ,Sequence Analysis ,Neuroinformatics ,45/23 ,QH426 Genetics ,Biology ,Article ,Evolutionary genetics ,38 ,Birds ,QH301 ,03 medical and health sciences ,Molecular evolution ,ddc:570 ,Genome assembly algorithms ,Animals ,631/181/735 ,14. Life underwater ,Genomes ,QH426 ,Gene ,Gene Library ,Genome, Mitochondrial ,Haplotypes ,Molecular Sequence Annotation ,Sequence Alignment ,Sequence Analysis, DNA ,030304 developmental biology ,45/91 ,631/61/212/2302 ,45 ,Human evolutionary genetics ,Haplotype ,DAS ,DNA ,Research data ,706/648/697 ,631/181/2474 ,Evolutionary biology ,Genètica ,030217 neurology & neurosurgery ,Reference genome - Abstract
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1,2,3,4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences., We thank them for their permission to publish. A.R., S.K., B.P.W. and A.M.P. were supported by the Intramural Research Program of the NHGRI, NIH (1ZIAHG200398). A.R. was also supported by the Korea Health Technology R&D Project through KHIDI, funded by the Ministry of Health & Welfare, Republic of Korea (HI17C2098). S.A.M., I.B. and R.D. were supported by Wellcome Trust grant WT207492; W.C., M. Smith, Z.N., Y.S., J.C., S. Pelan, J.T., A.T., J.W. and Kerstin Howe by WT206194; L.H., F.M., Kevin Howe and P. Flicek by WT108749/Z/15/Z, WT218328/B/19/Z and the European Molecular Biology Laboratory. O.F. and E.D.J. were supported by Howard Hughes Medical Institute and Rockefeller University start-up funds for this project. J.D. and H.A.L. were supported by the Robert and Rosabel Osborne Endowment. M.U.-S. received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement (750747). F.T.-N., J. Hoffman, P. Masterson and K.C. were supported by the Intramural Research Program of the NLM, NIH. C.L., B.J.K., J. Kim and H.K. were supported by the Marine Biotechnology Program of KIMST, funded by the Ministry of Ocean and Fisheries, Republic of Korea (20180430). M.C. was supported by Sloan Research Fellowship (FG-2020-12932). S.C.V. was funded by a Max Planck Research Group award from the Max Planck Society, and a Human Frontiers Science Program (HFSP) Research grant (RGP0058/2016). T.M.L., W.E.J. and the Canada lynx genome were funded by the Maine Department of Inland Fisheries & Wildlife (F11AF01099), including when W.E.J. held a National Research Council Research Associateship Award at the Walter Reed Army Institute of Research (WRAIR). C.B. was supported by the NSF (1457541 and 1456612). D.B. was funded by The University of Queensland (HFSP - RGP0030/2015). D.I. was supported by Science Exchange Inc. (Palo Alto, CA). H.W.D. was supported by NSF grants (OPP-0132032 ICEFISH 2004 Cruise, PLR-1444167 and OPP-1955368) and the Marine Science Center at Northeastern University (416). G.J.P.N. and the thorny skate genome were funded by Lenfest Ocean Program (30884). M.P. was funded by the German Federal Ministry of Education and Research (01IS18026C). M. Malinsky was supported by an EMBO fellowship (ALTF 456-2016). The following authors’ contributions were supported by the NIH: S. Selvaraj (R44HG008118); C.V.M., S.R.F., P.V.L. (R21 DC014432/DC/NIDCD); K.D.M. (R01GM130691); H.C. (5U41HG002371-19); M.D. (U41HG007234); and B.P. (R01HG010485). D.G. was supported by the National Key Research and Development Program of China (2017YFC1201201, 2018YFC0910504 and 2017YFC0907503). F.O.A. was supported by Al-Gannas Qatari Society and The Cultural Village Foundation-Katara, Doha, State of Qatar and Monash University Malaysia. C.T. was supported by The Rockefeller University. M. Hiller was supported by the LOEWE-Centre for Translational Biodiversity Genomics (TBG) funded by the Hessen State Ministry of Higher Education, Research and the Arts (HMWK). H.C. was supported by the NHGRI (5U41HG002371-19). R.H.S.K. was funded by the Max Planck Society with computational resources at the bwUniCluster and BinAC funded by the Ministry of Science, Research and the Arts Baden-Württemberg and the Universities of the State of Baden-Württemberg, Germany (bwHPC-C5). B.V. was supported by the Biomedical Research Council of A*STAR, Singapore. T.M.-B. was funded by the European Research Council under the European Union’s Horizon 2020 research and innovation programme (864203), MINECO/FEDER, UE (BFU2017-86471-P), Unidad de Excelencia María de Maeztu, AEI (CEX2018-000792-M), a Howard Hughes International Early Career award, Obra Social “La Caixa” and Secretaria d’Universitats i Recerca and CERCA Programme del Departament d’Economia i Coneixement de la Generalitat de Catalunya (GRC 2017 SGR 880). E.C.T. was supported by the European Research Council (ERC-2012-StG311000) and an Irish Research Council Laureate Award. M.T.P.G. was supported by an ERC Consolidator Award 681396-Extinction Genomics, and a Danish National Research Foundation Center Grant (DNRF143). T.W. was supported by the NSF (1458652). J. M. Graves was supported by the Australian Research Council (CEO561477). E.W.M. was partially supported by the German Federal Ministry of Education and Research (01IS18026C). Complementary sequencing support for the Anna’s hummingbird and several genomes was provided by Pacific Biosciences, Bionano Genomics, Dovetail Genomics, Arima Genomics, Phase Genomics, 10X Genomics, NRGene, Oxford Nanopore Technologies, Illumina, and DNAnexus. All other sequencing and assembly were conducted at the Rockefeller University, Sanger Institute, and Max Planck Institute Dresden genome labs. Part of this work used the computational resources of the NIH HPC Biowulf cluster (https://hpc.nih.gov). We acknowledge funding from the Wellcome Trust (108749/Z/15/Z) and the European Molecular Biology Laboratory., With funding from the Spanish government through the "Severo Ochoa Centre of Excellence" accreditation (CEX2018-000792-M).
- Published
- 2021
- Full Text
- View/download PDF
9. Mapping non-host resistance to the stem rust pathogen in an interspecific barberry hybrid
- Author
-
Sarah B. Kingan, Yue Jin, Radhika Bartaula, Iago Hale, and Arthur T. O. Melo
- Subjects
0106 biological sciences ,0301 basic medicine ,Candidate gene ,Berberis ,Population ,Quantitative Trait Loci ,Inheritance Patterns ,Plant Science ,Quantitative trait locus ,Plant disease resistance ,Stem rust ,01 natural sciences ,Chromosomes, Plant ,03 medical and health sciences ,Genetic linkage ,Barberry ,lcsh:Botany ,education ,Disease Resistance ,Plant Diseases ,Genetics ,Durable resistance ,education.field_of_study ,biology ,Plant Stems ,Basidiomycota ,Gene Expression Profiling ,Chromosome Mapping ,Non-host resistance ,biology.organism_classification ,lcsh:QK1-989 ,030104 developmental biology ,Phenotype ,Wheat ,Hybridization, Genetic ,Reference genome ,Genome, Plant ,010606 plant biology & botany ,Research Article - Abstract
Background Non-host resistance (NHR) presents a compelling long-term plant protection strategy for global food security, yet the genetic basis of NHR remains poorly understood. For many diseases, including stem rust of wheat [causal organism Puccinia graminis (Pg)], NHR is largely unexplored due to the inherent challenge of developing a genetically tractable system within which the resistance segregates. The present study turns to the pathogen’s alternate host, barberry (Berberis spp.), to overcome this challenge. Results In this study, an interspecific mapping population derived from a cross between Pg-resistant Berberis thunbergii (Bt) and Pg-susceptible B. vulgaris was developed to investigate the Pg-NHR exhibited by Bt. To facilitate QTL analysis and subsequent trait dissection, the first genetic linkage maps for the two parental species were constructed and a chromosome-scale reference genome for Bt was assembled (PacBio + Hi-C). QTL analysis resulted in the identification of a single 13 cM region (~ 5.1 Mbp spanning 13 physical contigs) on the short arm of Bt chromosome 3. Differential gene expression analysis, combined with sequence variation analysis between the two parental species, led to the prioritization of several candidate genes within the QTL region, some of which belong to gene families previously implicated in disease resistance. Conclusions Foundational genetic and genomic resources developed for Berberis spp. enabled the identification and annotation of a QTL associated with Pg-NHR. Although subsequent validation and fine mapping studies are needed, this study demonstrates the feasibility of and lays the groundwork for dissecting Pg-NHR in the alternate host of one of agriculture’s most devastating pathogens. Electronic supplementary material The online version of this article (10.1186/s12870-019-1893-9) contains supplementary material, which is available to authorized users.
- Published
- 2019
- Full Text
- View/download PDF
10. Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle
- Author
-
Benjamin D. Rosen, Arang Rhie, Joyce V. Lee, Adam M. Phillippy, Elizabeth Tseng, Sergey Koren, Sarah B. Kingan, Michael P. Heaton, Fergal J. Martin, Jay Ghurye, Zev N. Kronenberg, Konstantinos Billis, Andy Wing Chun Pang, Derek M. Bickhart, Timothy P. L. Smith, Françoise Thibaud-Nissen, John L. Williams, Stefan Hiendleder, Wai Yee Low, Ruijie Liu, Rick Tearle, and Alex Hastie
- Subjects
0301 basic medicine ,Male ,Lineage (genetic) ,Science ,Iso-Seq ,General Physics and Astronomy ,Biology ,Subspecies ,Allelic Imbalance ,Long reads ,Genome ,Polymorphism, Single Nucleotide ,General Biochemistry, Genetics and Molecular Biology ,Article ,Structural variation ,03 medical and health sciences ,0302 clinical medicine ,Full-Length Transcriptome ,INDEL Mutation ,Animals ,Copy-number variation ,RNA, Messenger ,lcsh:Science ,Gene ,Alleles ,Animal breeding ,Repetitive Sequences, Nucleic Acid ,PacBio ,Multidisciplinary ,Base Sequence ,Haplotype ,Genetic Variation ,Molecular Sequence Annotation ,General Chemistry ,Genomics ,Chromosomes, Mammalian ,Computational biology and bioinformatics ,030104 developmental biology ,Haplotypes ,Evolutionary biology ,Genetic Loci ,lcsh:Q ,Cattle ,Female ,030217 neurology & neurosurgery ,Reference genome - Abstract
Inbred animals were historically chosen for genome analysis to circumvent assembly issues caused by haplotype variation but this resulted in a composite of the two genomes. Here we report a haplotype-aware scaffolding and polishing pipeline which was used to create haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle subspecies from contigs generated by the trio binning method. These assemblies reveal structural and copy number variants that differentiate the subspecies and that variant detection is sensitive to the specific reference genome chosen. Six genes with immune related functions have additional copies in the indicine compared with taurine lineage and an indicus-specific extra copy of fatty acid desaturase is under positive selection. The haplotyped genomes also enable transcripts to be phased to detect allele-specific expression. This work exemplifies the value of haplotype-resolved genomes to better explore evolutionary and functional variations., Taurine and indicine cattle have different desirable traits making them better adapted to different climates across the world. Here, Low et al. describe a pipeline to produce haplotype-resolved, chromosome-level genomes of Angus and Brahman cattle breeds from a crossbred individual and report on comparisons of the two genomes.
- Published
- 2020
- Full Text
- View/download PDF
11. Towards complete and error-free genome assemblies of all vertebrate species
- Author
-
Claudio V. Mello, H. William Detrich, Oliver A. Ryder, George F. Turner, Robert H. S. Kraus, Daryl Eason, Sergey Koren, Stephen J. O'Brien, Ivan Sović, Tandy Warnow, Dave W Burt, Martin Pippel, Mark Diekhans, Jonathan Wood, Sylke Winkler, Joana Damas, Benedict Paten, Shane A. McCarthy, Gregory Gedman, M. Thomas P. Gilbert, David F. Clayton, Erich D. Jarvis, Frank Grützner, Richard E. Green, Andrew J. Crawford, Federica Di Palma, Jason T. Howard, Fergal J. Martin, Brett T. Hannigan, Samantha R. Friedrich, Emma C. Teeling, David Iorns, Woori Kwak, Maximilian Wagner, Iliana Bista, Hiram Clawson, Milan Malinsky, Peter V. Lovell, Gavin J. P. Naylor, Robert S. Harris, Ekaterina Osipova, Sadye Paez, Christopher N. Balakrishnan, Eugene W. Myers, Byrappa Venkatesh, Brian P. Walenz, Warren E. Johnson, Nicholas H. Putnam, Harris A. Lewin, Hannes Svardal, Leanne Haggerty, Andreas F. Kautt, Tomas Marques-Bonet, Luis R Nassar, Maria Simbirsky, Christopher Dunn, William Chow, Marlys L. Houck, Paolo Franchini, Joanna Collins, Jinna Hoffman, Sonja C. Vernes, Alan Tracey, Siddarth Selvaraj, Sarah E. London, Ann C Misuraca, Heebal Kim, Byung June Ko, Trevor Pesout, Françoise Thibaud-Nissen, Jimin George, Jennifer A. Marshall Graves, Arang Rhie, Ying Sims, Mark Wilkinson, Robert W. Murphy, Dengfeng Guan, Axel Meyer, Richard Durbin, Arkarachai Fungtammasan, Sarah Pelan, Lindsey J. Cantin, Erik Garrison, Kerstin Howe, Farooq O. Al-Ajli, Zev N. Kronenberg, Michelle Smith, Paul Flicek, James Torrance, Guojie Zhang, J. H. Kim, Richard Hall, Tanya M. Lama, David Haussler, Matthew T. Biegler, Klaus-Peter Koepfli, Beth Shapiro, Bettina Haase, Andrew Digby, Wesley C. Warren, Alex Hastie, Adam M. Phillippy, Paul Medvedev, Marcela Uliano-Silva, Mark Mooney, Constantina Theofanopoulou, Karen Clark, Chul Hee Lee, Zemin Ning, Olivier Fedrigo, Taylor Edwards, Simona Secomandi, Joyce V. Lee, Jonas Korlach, Patrick Masterson, Jay Ghurye, Jacquelyn Mountcastle, Giulio Formenti, Yang Zhou, Kevin L. Howe, Sarah B. Kingan, and Kateryna D. Makova
- Subjects
Biodiversity conservation ,Extant taxon ,biology ,Evolutionary biology ,biology.animal ,Vertebrate ,Genomics ,Sources of error ,Genome ,Reference genome - Abstract
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are only available for a few non-microbial species1–4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling the most accurate and complete reference genomes to date. Here we summarize these developments, introduce a set of quality standards, and present lessons learned from sequencing and assembling 16 species representing major vertebrate lineages (mammals, birds, reptiles, amphibians, teleost fishes and cartilaginous fishes). We confirm that long-read sequencing technologies are essential for maximizing genome quality and that unresolved complex repeats and haplotype heterozygosity are major sources of error in assemblies. Our new assemblies identify and correct substantial errors in some of the best historical reference genomes. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an effort to generate high-quality, complete reference genomes for all ~70,000 extant vertebrate species and help enable a new era of discovery across the life sciences.
- Published
- 2020
- Full Text
- View/download PDF
12. The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum
- Author
-
Richard Hall, Daniela Puiu, Steven L. Salzberg, Bernardo J. Clavijo, Sarah B. Kingan, and Aleksey V. Zimin
- Subjects
0301 basic medicine ,0106 biological sciences ,Sequence assembly ,Health Informatics ,Genomics ,Data Note ,01 natural sciences ,Genome ,Deep sequencing ,Polyploidy ,03 medical and health sciences ,hybrid assembly ,wheat genome ,Aegilops tauschii ,Genome size ,Triticum ,030304 developmental biology ,Whole genome sequencing ,2. Zero hunger ,Genetics ,plant genomes ,0303 health sciences ,PacBio sequencing ,Contig ,biology ,Whole Genome Sequencing ,Chromosome ,food and beverages ,Molecular Sequence Annotation ,biology.organism_classification ,Computer Science Applications ,030104 developmental biology ,Evolutionary biology ,genome assembly ,Pacific biosciences ,Ploidy ,Genome, Plant ,010606 plant biology & botany - Abstract
Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall size of more than 15 billion bases. Multiple past attempts to assemble the genome have failed. Here we report the first successful assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15,344,693,583 bases and has a weighted average (N50) contig size of of 232,659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4,179,762,575 bp of T. aestivum that correspond to its D genome components.
- Published
- 2017
13. Sequence and annotation of 42 cannabis genomes reveals extensive copy number variation in cannabinoid synthesis and pathogen resistance genes
- Author
-
Biao Liu, Stephen H. McLaughlin, Heather Ebling, Alberto Riva, William B. Barbazuk, Kevin McKernan, Timothy T. Harkins, Zachary Eaton, Primo Baybayan, Mark Jordan, Sarah B. Kingan, Liam T. Kane, Lei Zhang, Yvonne Helbert, and Gregory T. Concepcion
- Subjects
Genetics ,Genetic diversity ,biology ,medicine.medical_treatment ,biology.organism_classification ,Genome ,MRNA Sequencing ,Tetrahydrocannabinolic acid ,medicine ,Cannabis ,Copy-number variation ,Cannabinoid ,Gene ,medicine.drug - Abstract
Cannabis is a diverse and polymorphic species. To better understand cannabinoid synthesis inheritance and its impact on pathogen resistance, we shotgun sequenced and assembled aCannabistrio (sibling pair and their offspring) utilizing long read single molecule sequencing. This resulted in the most contiguousCannabis sativaassemblies to date. These reference assemblies were further annotated with full-length male and female mRNA sequencing (Iso-Seq) to help inform isoform complexity, gene model predictions and identification of the Y chromosome. To further annotate the genetic diversity in the species, 40 male, female, and monoecious cannabis and hemp varietals were evaluated for copy number variation (CNV) and RNA expression. This identified multiple CNVs governing cannabinoid expression and 82 genes associated with resistance toGolovinomyces chicoracearum, the causal agent of powdery mildew in cannabis. Results indicated that breeding for plants with low tetrahydrocannabinolic acid (THCA) concentrations may result in deletion of pathogen resistance genes. Low THCA cultivars also have a polymorphism every 51 bases while dispensary grade high THCA cannabis exhibited a variant every 73 bases. A refined genetic map of the variation in cannabis can guide more stable and directed breeding efforts for desired chemotypes and pathogen-resistant cultivars.Sequence and annotation of 42 cannabis genomes reveals extensive copy number variation in cannabinoid synthesis and pathogen resistance genes
- Published
- 2020
- Full Text
- View/download PDF
14. Genomic analysis of powdery mildew resistance in a hop (Humulus lupulus L.) bi-parental population segregating for 'R6-locus'
- Author
-
Lillian K. Padgitt-Cobb, Sarah B. Kingan, and John A. Henning
- Subjects
0106 biological sciences ,0301 basic medicine ,Genetics ,education.field_of_study ,biology ,Contig ,Population ,Single-nucleotide polymorphism ,Locus (genetics) ,Plant Science ,Horticulture ,Quantitative trait locus ,biology.organism_classification ,01 natural sciences ,03 medical and health sciences ,030104 developmental biology ,education ,Agronomy and Crop Science ,Gene ,Podosphaera ,Powdery mildew ,010606 plant biology & botany - Abstract
Genetic response in hop to fungal pathogen infection has been evaluated at the chromosomal level through QTL analyses but very little information exists on the expression of genes during infection periods. Raw GBS reads and phenotypic data from a previously published QTL analysis along with a newly assembled PacBio-derived hop reference genome were used to re-evaluate resistance to races v4/v6 of powdery mildew (PM; Podosphaera humuli). QTL analyses revealed two tightly linked regions of association on a single linkage group. The three SNP markers most tightly linked to PM resistance (found on contig 000559F) were observed downstream from a putative R-gene locus for powdery mildew resistance. This 230 kb region contained a series of seven putative R-genes surrounded by seven putative peroxidase-3 genes downstream and seven putative glucan endo-1,3-beta-glucosidase upstream and an expressed F-box domain protein. RNAseq data showed all putative R-genes along with all putative glucan endo-1,3-beta-glucosidase genes were expressed under diseased conditions, while none of the peroxidase genes were expressed. The second region contained three SNPs found on contig 002916F next to two putative R-genes. RNAseq data showed complex expression of exons contained in putative isoforms of R-genes. This preliminary information will prove valuable information for development of precise markers located either within or next to genes responsible for race v4/v6 PM resistance.
- Published
- 2019
- Full Text
- View/download PDF
15. A phased, diploid assembly of the Cascade hop (Humulus lupulus) genome reveals patterns of selection and haplotype variation
- Author
-
Lillian K. Padgitt-Cobb, Paul Peluso, Daniel Moore, David A. Hendrix, Pankaj Jaiswal, Sarah B. Kingan, John A. Henning, Justin Elser, Gregory T. Concepcion, Brent A. Kronmiller, Jackson Wells, and David R. Rank
- Subjects
0106 biological sciences ,0303 health sciences ,Humulus lupulus ,biology ,Haplotype ,food and beverages ,Context (language use) ,Retrotransposon ,biology.organism_classification ,01 natural sciences ,Genome ,3. Good health ,Hop (networking) ,Structural variation ,03 medical and health sciences ,Evolutionary biology ,Gene ,030304 developmental biology ,010606 plant biology & botany - Abstract
Hop (Humulus lupulus L. var Lupulus) is a diploid, dioecious plant with a history of cultivation spanning more than one thousand years. Hop cones are valued for their use in brewing, and around the world, hop has been used in traditional medicine to treat a variety of ailments. Efforts to determine how biochemical pathways responsible for desirable traits are regulated have been challenged by the large, repetitive, and heterozygous genome of hop. We present the first report of a haplotype-phased assembly of a large plant genome. Our assembly and annotation of the Cascade cultivar genome is the most extensive to date. PacBio long-read sequences from hop were assembled with FALCON and phased with FALCON-Unzip. Using the diploid assembly to assess haplotype variation, we discovered genes under positive selection enriched for stress-response, growth, and flowering functions. Comparative analysis of haplotypes provides insight into large-scale structural variation and the selective pressures that have driven hop evolution. Previous studies estimated repeat content at around 60%. With improved resolution of long terminal retrotransposons (LTRs) due to long-read sequencing, we found that hop is nearly 78% repetitive. Our quantification of repeat content provides context for the size of the hop genome, and supports the hypothesis of whole genome duplication (WGD), rather than expansion due to LTRs. With our more complete assembly, we have identified a homolog of cannabidiolic acid synthase (CBDAS) that is expressed in multiple tissues. The approaches we developed to analyze a phased, diploid assembly serve to deepen our understanding of the genomic landscape of hop and may have broader applicability to the study of other large, complex genomes.
- Published
- 2019
- Full Text
- View/download PDF
16. Haplotype-Resolved Cattle Genomes Provide Insights Into Structural Variation and Adaptation
- Author
-
John L. Williams, Konstantinos Billis, Derek M. Bickhart, Arang Rhie, Adam M. Phillippy, Elizabeth Tseng, Jay Ghurye, Zev N. Kronenberg, Andy Wing Chun Pang, Françoise Thibaud-Nissen, Sarah B. Kingan, Alex Hastie, Timothy P L Smith, Michael P. Heaton, Jongmin Lee, Sergey Koren, Benjamin D. Rosen, Wai Yee Low, Stefan Hiendleder, Ruijie Liu, Rick Tearle, and Fergal J. Martin
- Subjects
Structural variation ,education.field_of_study ,Lineage (genetic) ,Evolutionary biology ,Population ,Haplotype ,Gene family ,Copy-number variation ,Biology ,Adaptation ,education ,Genome - Abstract
We present high quality, phased genome assemblies representative of taurine and indicine cattle, subspecies that differ markedly in productivity-related traits and environmental adaptation. We report a new haplotype-aware scaffolding and polishing pipeline using contigs generated by the trio binning method to produce haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle breeds. These assemblies were used to identify structural and copy number variants that differentiate the subspecies and we found variant detection was sensitive to the specific reference genome chosen. Six gene families with immune related functions are expanded in the indicine lineage. Assembly of the genomes of both subspecies from a single individual enabled transcripts to be phased to detect allele-specific expression, and to study genome-wide selective sweeps. An indicus-specific extra copy of fatty acid desaturase is under positive selection and may contribute to indicine adaptation to heat and drought.
- Published
- 2019
- Full Text
- View/download PDF
17. A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System
- Author
-
Anna K. Childers, Brad S. Coates, Scott M. Geib, Christine C. Lambert, Julie M. Urban, Kevin J. Hackett, Sarah B. Kingan, Primo Baybayan, Brian E. Scheffler, and Jonas Korlach
- Subjects
0106 biological sciences ,Sequence analysis ,Genome, Insect ,Sequence assembly ,Health Informatics ,Computational biology ,Biology ,Data Note ,01 natural sciences ,Genome ,DNA sequencing ,Spotted lanternfly ,03 medical and health sciences ,Animals ,Genomic library ,Gene ,Gene Library ,030304 developmental biology ,0303 health sciences ,Contig ,Diptera ,Haplotype ,Genomics ,Sequence Analysis, DNA ,Computer Science Applications ,010602 entomology ,Female ,Introduced Species ,Reference genome - Abstract
BackgroundA high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region.ResultsThe DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig.ConclusionsWe demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.
- Published
- 2019
- Full Text
- View/download PDF
18. A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
- Author
-
Sarah B. Kingan, Jonas Korlach, Christine C. Lambert, Brendan Galvin, Haynes Heaton, Juliana Cudini, Mara K. N. Lawniczak, Richard Durbin, Primo Baybayan, Kingan, Sarah B [0000-0002-4900-0189], Heaton, Haynes [0000-0002-9649-525X], Durbin, Richard [0000-0002-9130-1006], Korlach, Jonas [0000-0003-3047-4250], Lawniczak, Mara KN [0000-0002-3006-2080], and Apollo - University of Cambridge Repository
- Subjects
0106 biological sciences ,0301 basic medicine ,lcsh:QH426-470 ,0206 medical engineering ,Genome, Insect ,Sequence assembly ,mosquito ,02 engineering and technology ,Computational biology ,010603 evolutionary biology ,01 natural sciences ,Genome ,Article ,03 medical and health sciences ,Contig Mapping ,Anopheles ,low-input DNA ,Genetics ,Animals ,long-read SMRT sequencing ,Gene ,Genome size ,Genetics (clinical) ,030304 developmental biology ,Comparative genomics ,0303 health sciences ,Ploidies ,Polymorphism, Genetic ,Contig ,de novo genome assembly ,Sequence Analysis, DNA ,genomic DNA ,lcsh:Genetics ,030104 developmental biology ,020602 bioinformatics ,Reference genome - Abstract
A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µ, g for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>, 90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.
- Published
- 2019
19. Gene flow mediates the role of sex chromosome meiotic drive during complex speciation
- Author
-
Colin D. Meiklejohn, Sarah B. Kingan, Thomas Rzatkiewicz, Daniel Garrigan, Kathleen E. Gordon, David L. Stern, Emily L. Landeen, Christina A. Muirhead, Daven C. Presgraves, Jeffrey P. Vedanayagam, and Anthony J. Geneva
- Subjects
Gene Flow ,Male ,0106 biological sciences ,0301 basic medicine ,X Chromosome ,Genetic Speciation ,QH301-705.5 ,Science ,Genomics ,Biology ,010603 evolutionary biology ,01 natural sciences ,Genome ,General Biochemistry, Genetics and Molecular Biology ,Gene flow ,Population genomics ,03 medical and health sciences ,Species Specificity ,Gene mapping ,Y Chromosome ,Genetic algorithm ,Animals ,Biology (General) ,Infertility, Male ,X chromosome ,030304 developmental biology ,Evolutionary Biology ,0303 health sciences ,General Immunology and Microbiology ,General Neuroscience ,Chromosome ,Genetics and Genomics ,General Medicine ,Biological Evolution ,Drosophila mauritiana ,Meiosis ,030104 developmental biology ,Meiotic drive ,speciation ,Evolutionary biology ,Medicine ,Drosophila ,Drosophila simulans ,Other ,Research Article - Abstract
During speciation, sex chromosomes often accumulate interspecific genetic incompatibilities faster than the rest of the genome. The drive theory posits that sex chromosomes are susceptible to recurrent bouts of meiotic drive and suppression, causing the evolutionary build-up of divergent cryptic sex-linked drive systems and, incidentally, genetic incompatibilities. To assess the role of drive during speciation, we combine high-resolution genetic mapping of X-linked hybrid male sterility with population genomics analyses of divergence and recent gene flow between the fruitfly species,Drosophila mauritianaandD. simulans. Our findings reveal a high density of genetic incompatibilities and a corresponding dearth of gene flow on the X chromosome. Surprisingly, we find that, rather than contributing to interspecific divergence, a known drive element has recently migrated between species, caused a strong reduction in local divergence, and undermined the evolution of hybrid sterility. Gene flow can therefore mediate the effects of selfish genetic elements during speciation.
- Published
- 2018
- Full Text
- View/download PDF
20. Improved Aedes aegypti mosquito reference genome assembly enables biological discovery and vector control
- Author
-
Hugh M. Robertson, Jill Muehling, Frederick A. Partridge, Sanjit S. Batra, Joe Turner, Noah H. Rose, William C. Black, Aviva Presser Aiden, Michael R. Murphy, Catherine A. Hill, Jacob E. Crawford, Alistair C. Darby, Erez Lieberman Aiden, Daniel E. Neafsey, Adam M. Phillippy, William J. Glassford, Alexander S. Raikhel, Richard Hall, Melissa Smith, Vinita Joardar, Louis Lambrechts, Andrew K. Jones, Margaret Herre, Paul Peluso, Allison M Weakley, Joyce Lee, Han Cao, Eric Cox, Seth Redmond, Yang Wu, Terence Murphy, Raissa G.G. Kay, Igor Filipović, Maria V. Sharakhova, Vamsi K. Kodali, Carolyn S. McBride, Karla Saavedra-Rodriguez, Shruti Sharan, Gordana Rašić, Zhijian Tu, Albin Fontaine, Gareth D. Weedall, Igor V. Sharakhov, Sarah B. Kingan, Bradley J. White, J. Spencer Johnston, Thanyalak Fansiri, Olga Dudchenko, Richard S. Mann, Jonas Korlach, Steven D. Buckingham, Omar S. Akbari, David B. Sattelle, Saki Chan, Sara N. Mitchell, Corey L. Campbell, Benjamin J. Matthews, Atashi Sharma, Li Zhao, Gareth J Lycett, Benjamin R. Evans, Jeffrey R. Powell, Sergey Koren, Andrea Gloria-Soria, Leslie B. Vosshall, Igor Antoshechkin, Arina D. Omer, Zhilei Zhao, Vidya Ramasamy, Sourav Roy, Alex Hastie, and Carlos A Brito-Sierra
- Subjects
0301 basic medicine ,education.field_of_study ,Population ,Yellow fever ,Genomics ,Aedes aegypti ,Computational biology ,Biology ,biology.organism_classification ,medicine.disease_cause ,medicine.disease ,Genome ,Dengue fever ,03 medical and health sciences ,030104 developmental biology ,medicine ,Chikungunya ,education ,Reference genome - Abstract
Female Aedes aegypti mosquitoes infect hundreds of millions of people each year with dangerous viral pathogens including dengue, yellow fever, Zika, and chikungunya. Progress in understanding the biology of this insect, and developing tools to fight it, has been slowed by the lack of a high-quality genome assembly. Here we combine diverse genome technologies to produce AaegL5, a dramatically improved and annotated assembly, and demonstrate how it accelerates mosquito science and control. We anchored the physical and cytogenetic maps, resolved the size and composition of the elusive sex-determining “M locus”, significantly increased the known members of the glutathione-S-transferase genes important for insecticide resistance, and doubled the number of chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites. Using high-resolution QTL and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. We predict that AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly arboviral vector.
- Published
- 2018
21. Improved reference genome of Aedes aegypti informs arbovirus vector control
- Author
-
Adam M. Phillippy, Richard Hall, Michael R. Murphy, Vinita Joardar, Daniel E. Neafsey, Bradley J. White, Margaret Herre, Gareth D. Weedall, Steven D. Buckingham, Louis Lambrechts, Hugh M. Robertson, Andrew K. Jones, David B. Sattelle, Han Cao, Karla Saavedra-Rodriguez, Yang Wu, Maria V. Sharakhova, Allison M Weakley, Atashi Sharma, Alexander S. Raikhel, Olga Dudchenko, Catherine A. Hill, Sourav Roy, Saki Chan, Sara N. Mitchell, Alex Hastie, Arina D. Omer, Igor Filipović, Zhilei Zhao, Raissa G.G. Kay, Melissa Smith, Joyce Lee, Eric Cox, Sergey Koren, Li Zhao, Vidya Ramasamy, Benjamin J. Matthews, Joe Turner, William C. Black, Noah H. Rose, Leslie B. Vosshall, Erez Lieberman Aiden, Corey L. Campbell, Gordana Rašić, Zhijian Tu, J. Spencer Johnston, Jill Muehling, Jonas Korlach, Igor V. Sharakhov, Seth Redmond, Thanyalak Fansiri, Andrea Gloria-Soria, Omar S. Akbari, Aviva Presser Aiden, Richard S. Mann, Igor Antoshechkin, Jacob E. Crawford, Frederick A. Partridge, Alistair C. Darby, Sanjit S. Batra, Carolyn S. McBride, Sarah B. Kingan, Jeffrey R. Powell, Benjamin R. Evans, Gareth J Lycett, William J. Glassford, Paul Peluso, Shruti Sharan, Albin Fontaine, Carlos A Brito-Sierra, Terence Murphy, Vamsi K. Kodali, Rockefeller University [New York], Howard Hughes Medical Institute [New York] (HHMI), Howard Hughes Medical Institute (HHMI)-New York University School of Medicine, NYU System (NYU)-NYU System (NYU)-Rockefeller University [New York]-Columbia University Irving Medical Center (CUIMC), Kavli Neural Systems Institute, Baylor College of Medicine (BCM), Baylor University, Rice University [Houston], Pacific Biosciences [Menlo Park], Pacific Biosciences of California, National Human Genome Research Institute (NHGRI), California Institute of Technology (CALTECH), Verily Life Sciences Inc, Columbia University [New York], Broad Institute of MIT and Harvard (BROAD INSTITUTE), Harvard Medical School [Boston] (HMS)-Massachusetts Institute of Technology (MIT)-Massachusetts General Hospital [Boston], Harvard T.H. Chan School of Public Health, Princeton University, Liverpool John Moores University (LJMU), Virginia Tech [Blacksburg], Southern Medical University [Guangzhou], Purdue University [West Lafayette], University College of London [London] (UCL), Colorado State University [Fort Collins] (CSU), Bionano Genomics Inc, National Center for Biotechnology Information (NCBI), Yale University [New Haven], Armed Forces Research Institute of Medical Sciences [Bangkok] (AFRIMS), QIMR Berghofer Medical Research Institute, Interactions Virus-Insectes - Insect-Virus Interactions (IVI), Institut Pasteur [Paris] (IP)-Centre National de la Recherche Scientifique (CNRS), Institut de Recherche Biomédicale des Armées [Antenne Marseille] (IRBA), Vecteurs - Infections tropicales et méditerranéennes (VITROME), Institut de Recherche pour le Développement (IRD)-Aix Marseille Université (AMU)-Institut de Recherche Biomédicale des Armées [Brétigny-sur-Orge] (IRBA), Oxford Brookes University, University of California [Riverside] (UC Riverside), University of California (UC), Liverpool School of Tropical Medicine (LSTM), University of Liverpool, University of California [San Diego] (UC San Diego), Texas A&M University [College Station], Tomsk State University [Tomsk], University of Illinois [Chicago] (UIC), University of Illinois System, Howard Hughes Medical Institute [New York], This research was supported in part by federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under grant number U19AI110818 to the Broad Institute (S.N.R. and D.E.N.), USDA 2017-05741 (E.L.A.), NSF PHY-1427654 Center for Theoretical Biological Physics (E.L.A.), NIH Intramural Research Program, National Library of Medicine and National Human Genome Research Institute (A.M.P. and S.K.) and the following extramural NIH grants: R01AI101112 (J.R.P.), R35GM118336 (R.S.M. and W.J.G.), R21AI121853 (M.V.S., I.V.S. and A.S.), R01AI123338 (Z.T.), T32GM007739 (M.H.), NIH/NCATS UL1TR000043 (Rockefeller University), DP2OD008540 (E.L.A.), U01AI088647, 1R01AI121211 (W.C.B. IV), Fogarty Training Grant D43TW001130-08, U01HL130010 (E.L.A.), UM1HG009375 (E.L.A), 5K22AI113060 (O.S.A.), 1R21AI123937 (O.S.A.), and R00DC012069 (C.S.M.), Defence Advanced Research Project Agency: HR0011-17-2- 0047 (O.S.A.). Other support was provided by Jane Coffin Childs Memorial Fund (B.J.M.), Center for Theoretical Biological Physics postdoctoral fellowship (O.D.), Robertson Foundation (L.Z.), and McNair & Welch (Q-1866) Foundations (E.L.A.), French Government’s Investissement d’Avenir program, Laboratoire d’Excellence Integrative Biology of Emerging Infectious Diseases (grant ANR-10-LABX-62-IBEID to L.L.), Agence Nationale de la Recherche grant ANR-17-ERC2-0016-01 (L.L.), European Union’s Horizon 2020 research and innovation program under ZikaPLAN grant agreement no. 734584 (L.L.), Pew and Searle Scholars Programs (C.S.M.), Klingenstein-Simons Fellowship in the Neurosciences (C.S.M.). A.M.W., B.J.W., J.E.C. and S.N.M. were supported by Verily Life Sciences. L.B.V. is an investigator of the Howard Hughes Medical Institute., We thank R. Andino, S. Emrich and D. Lawson (Vectorbase), A. A. James, M. Kunitomi, C. Nusbaum, D. Severson, N. Whiteman, T. Dickinson, M. Hartley and B. Rice (Dovetail Genomics) for early participation in the AGWG, C. Bargmann, D. Botstein, E. Jarvis and E. Lander for encouragement and facilitation. N. Keivanfar, D. Jaffe and D. M. Church (10X Genomics) prepared DNA for structural-variant analysis. We thank A. Harmon of the New York Times and acknowledge generous pro bono data and analysis from our corporate collaborators, ANR-17-ERC2-0016,GxG,Base génétique de la spécificité génotype-génotype dans l'interaction naturelle entre un virus et son insecte vecteur(2017), ANR-10-LABX-0062,IBEID,Integrative Biology of Emerging Infectious Diseases(2010), European Project: no. 734584, Centre National de la Recherche Scientifique (CNRS)-Institut Pasteur [Paris], Institut de Recherche pour le Développement (IRD)-Aix Marseille Université (AMU)-Institut de Recherche Biomédicale des Armées (IRBA), University of California [Riverside] (UCR), University of California, Biochemistry, Entomology, Fralin Life Sciences Institute, Howard Hughes Medical Institute (HHMI)-Rockefeller University [New York]-Columbia University Irving Medical Center (CUIMC)-New York University School of Medicine, NYU System (NYU)-NYU System (NYU), Institut de Recherche Biomédicale des Armées (IRBA)-Institut de Recherche pour le Développement (IRD)-Aix Marseille Université (AMU), The Rockefeller University [New-York], Howard Hughes Medical Institute [Berkeley], University of California [Berkeley], Baylor College of Medicine ( BCM ), Baylor College of Medicine, National Human Genome Research Institute ( NHGRI ), California Institute of Technology ( CALTECH ), Broad Institute of MIT and Harvard ( BROAD INSTITUTE ), Harvard Medical School [Boston] ( HMS ) -Massachusetts General Hospital [Boston] ( MGH ) -Massachusetts Institute of Technology ( MIT ), Liverpool John Moore University ( ljmu ), University College of London [London] ( UCL ), Colorado State University [Fort Collins] ( CSU ), National Center for Biotechnology Information ( NCBI ), Armed Forces Research Institute of Medical Sciences [Bangkok] ( AFRIMS ), Vecteurs - Infections tropicales et méditerranéennes ( VITROME ), Institut de Recherche pour le Développement ( IRD ) -Aix Marseille Université ( AMU ) -Institut de Recherche Biomédicale des Armées ( IRBA ), Interactions Virus-Insectes - Insect-Virus Interactions ( IVI ), Institut Pasteur [Paris]-Centre National de la Recherche Scientifique ( CNRS ), Institut de Recherche Biomédicale des Armées [Antenne Marseille] ( IRBA ), University of California [Riverside] ( UCR ), Liverpool School of Tropical Medicine ( LSTM ), University of California [San Diego] ( UC San Diego ), and University of Illinois at Urbana-Champaign [Urbana]
- Subjects
0301 basic medicine ,Male ,and promotion of well-being ,[SDV]Life Sciences [q-bio] ,Genome, Insect ,Dengue virus ,medicine.disease_cause ,Dengue fever ,Insecticide Resistance ,Aedes ,Pyrethrins ,Chikungunya ,Glutathione Transferase ,Genetics ,education.field_of_study ,Multidisciplinary ,Genome ,biology ,Yellow fever ,Genomics ,Reference Standards ,3. Good health ,[ SDV.MHEP.MI ] Life Sciences [q-bio]/Human health and pathology/Infectious diseases ,Infectious Diseases ,Multigene Family ,[SDV.MP.VIR]Life Sciences [q-bio]/Microbiology and Parasitology/Virology ,Female ,Infection ,Biotechnology ,DNA Copy Number Variations ,General Science & Technology ,1.1 Normal biological development and functioning ,Population ,Aedes aegypti ,Mosquito Vectors ,Arbovirus Infections ,Arbovirus ,Insect Control ,Article ,Vaccine Related ,03 medical and health sciences ,QH301 ,Rare Diseases ,Underpinning research ,[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,medicine ,Animals ,education ,QR355 ,3.2 Interventions to alter physical and biological environmental risks ,[SDV.GEN]Life Sciences [q-bio]/Genetics ,Prevention ,Human Genome ,Genetic Variation ,Molecular Sequence Annotation ,Dengue Virus ,Sex Determination Processes ,medicine.disease ,biology.organism_classification ,Prevention of disease and conditions ,Vector-Borne Diseases ,030104 developmental biology ,Emerging Infectious Diseases ,Good Health and Well Being ,Genetics, Population ,Vector (epidemiology) ,Insect ,Arboviruses ,[SDV.EE.IEO]Life Sciences [q-bio]/Ecology, environment/Symbiosis - Abstract
Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector. National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services [U19AI110818]; USDA [2017-05741]; NIH Intramural Research Program; National Library of Medicine; National Human Genome Research Institute; NSF [PHY-1427654]; NIH [R01AI101112, R35GM118336, R21AI121853, R01AI123338, T32GM007739, NIH/NCATS UL1TR000043, DP2OD008540, U01AI088647, 1R01AI121211, D43TW001130-08, U01HL130010, UM1HG009375, 5K22AI113060, 1R21AI123937, R00DC012069]; Defence Advanced Research Project Agency [HR0011-17-2-0047]; Jane Coffin Childs Memorial Fund; Center for Theoretical Biological Physics postdoctoral fellowship; Robertson Foundation; McNair Foundation; Welch Foundation [Q-1866]; French Government's Investissement d'Avenir program, Laboratoire d'Excellence Integrative Biology of Emerging Infectious Diseases [ANR-10-LABX-62-IBEID]; Agence Nationale de la Recherche [ANR-17-ERC2-0016-01]; European Union [734584]; Pew and Searle Scholars Programs; Klingenstein-Simons Fellowship in the Neurosciences; Verily Life Sciences We thank R. Andino; S. Emrich and D. Lawson (Vectorbase); A. A. James, M. Kunitomi, C. Nusbaum, D. Severson, N. Whiteman; T. Dickinson, M. Hartley and B. Rice (Dovetail Genomics) for early participation in the AGWG; C. Bargmann, D. Botstein, E. Jarvis and E. Lander for encouragement and facilitation. N. Keivanfar, D. Jaffe and D. M. Church (10X Genomics) prepared DNA for structural-variant analysis. We thank A. Harmon of the New York Times and acknowledge generous pro bono data and analysis from our corporate collaborators. This research was supported in part by federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under grant number U19AI110818 to the Broad Institute (S.N.R. and D.E.N.); USDA 2017-05741 (E.L.A.); NSF PHY-1427654 Center for Theoretical Biological Physics (E.L.A.); NIH Intramural Research Program, National Library of Medicine and National Human Genome Research Institute (A.M.P. and S.K.) and the following extramural NIH grants: R01AI101112 (J.R.P.), R35GM118336 (R.S.M. and W.J.G.), R21AI121853 (M.V.S., I.V.S. and A. S.), R01AI123338 (Z.T.), T32GM007739 (M.H.), NIH/NCATS UL1TR000043 (Rockefeller University), DP2OD008540 (E.L.A.), U01AI088647, 1R01AI121211 (W.C.B. IV), Fogarty Training Grant D43TW001130-08, U01HL130010 (E.L.A.), UM1HG009375 (E.L.A), 5K22AI113060 (O.S.A.), 1R21AI123937 (O.S.A.), and R00DC012069 (C.S.M.); Defence Advanced Research Project Agency: HR0011-17-2-0047 (O.S.A.). Other support was provided by Jane Coffin Childs Memorial Fund (B.J.M.), Center for Theoretical Biological Physics postdoctoral fellowship (O.D.), Robertson Foundation (L.Z.), and McNair & Welch (Q-1866) Foundations (E.L.A.), French Government's Investissement d'Avenir program, Laboratoire d'Excellence Integrative Biology of Emerging Infectious Diseases (grant ANR-10-LABX-62-IBEID to L.L.), Agence Nationale de la Recherche grant ANR-17-ERC2-0016-01 (L.L.), European Union's Horizon 2020 research and innovation program under ZikaPLAN grant agreement no. 734584 (L.L.), Pew and Searle Scholars Programs (C.S.M.), Klingenstein-Simons Fellowship in the Neurosciences (C.S.M.). A.M.W., B.J.W., J.E.C. and S.N.M. were supported by Verily Life Sciences. L.B.V. is an investigator of the Howard Hughes Medical Institute.
- Published
- 2018
- Full Text
- View/download PDF
22. De novo assembly of haplotype-resolved genomes with trio binning
- Author
-
Arang Rhie, Sergey Koren, John L. Williams, Derek M. Bickhart, Adam M. Phillippy, Stefan Hiendleder, Timothy P L Smith, Alexander T. Dilthey, Sarah B. Kingan, and Brian P. Walenz
- Subjects
0301 basic medicine ,Haplotype ,Biomedical Engineering ,Sequence assembly ,Bioengineering ,Genomics ,Biology ,Applied Microbiology and Biotechnology ,Genome ,humanities ,Article ,Loss of heterozygosity ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Evolutionary biology ,Molecular Medicine ,Human genome ,Allele ,Ploidy ,030217 neurology & neurosurgery ,Biotechnology - Abstract
Complex allelic variation hampers the assembly of haplotype-resolved sequences from diploid genomes. We developed trio binning, an approach that simplifies haplotype assembly by resolving allelic variation before assembly. In contrast with prior approaches, the effectiveness of our method improved with increasing heterozygosity. Trio binning uses short reads from two parental genomes to first partition long reads from an offspring into haplotype-specific sets. Each haplotype is then assembled independently, resulting in a complete diploid reconstruction. We used trio binning to recover both haplotypes of a diploid human genome and identified complex structural variants missed by alternative approaches. We sequenced an F1 cross between the cattle subspecies Bos taurus taurus and Bos taurus indicus and completely assembled both parental haplotypes with NG50 haplotig sizes of >20 Mb and 99.998% accuracy, surpassing the quality of current cattle reference genomes. We suggest that trio binning improves diploid genome assembly and will facilitate new studies of haplotype variation and inheritance.
- Published
- 2018
23. Author response: Gene flow mediates the role of sex chromosome meiotic drive during complex speciation
- Author
-
Thomas Rzatkiewicz, Emily L. Landeen, Sarah B. Kingan, Jeffrey P. Vedanayagam, Daven C. Presgraves, Christina A. Muirhead, David L. Stern, Daniel Garrigan, Kathleen E. Gordon, Colin D. Meiklejohn, and Anthony J. Geneva
- Subjects
Genetics ,Meiotic drive ,Genetic algorithm ,Chromosome ,Biology ,Gene flow - Published
- 2018
- Full Text
- View/download PDF
24. A High-Quality, Long-Read De Novo Genome Assembly to Aid Conservation of Hawaii’s Last Remaining Crow Species
- Author
-
M. Renee Bellinger, Jolene T. Sutton, Primo Baybayan, Oliver A. Ryder, Martin Helmkampf, Richard Hall, Jonas Korlach, Jenny Gu, Cynthia C. Steiner, Sarah B. Kingan, Bryce M. Masuda, and Jill Muehling
- Subjects
0301 basic medicine ,0106 biological sciences ,Hawaiian crow ,lcsh:QH426-470 ,Population ,runs of homozygosity (ROH) ,Endangered species ,Sequence assembly ,Genomics ,SMRT sequencing ,Runs of Homozygosity ,Biology ,010603 evolutionary biology ,01 natural sciences ,Genome ,Article ,03 medical and health sciences ,Captive breeding ,Genetics ,education ,Genetics (clinical) ,Wildlife conservation ,030304 developmental biology ,Comparative genomics ,0303 health sciences ,education.field_of_study ,behavior ,fungi ,food and beverages ,15. Life on land ,biology.organism_classification ,major histocompatibility complex ,lcsh:Genetics ,030104 developmental biology ,toll-like receptors ,Evolutionary biology ,inbreeding depression - Abstract
Genome-level data can provide researchers with unprecedented precision to examine the causes and genetic consequences of population declines, which can inform conservation management. Here, we present a high-quality, long-read, de novo genome assembly for one of the world’s most endangered bird species, the ʻAlalā (Corvus hawaiiensis, Hawaiian crow). As the only remaining native crow species in Hawaiʻi, the ʻAlalā survived solely in a captive-breeding program from 2002 until 2016, at which point a long-term reintroduction program was initiated. The high-quality genome assembly was generated to lay the foundation for both comparative genomics studies and the development of population-level genomic tools that will aid conservation and recovery efforts. We illustrate how the quality of this assembly places it amongst the very best avian genomes assembled to date, comparable to intensively studied model systems. We describe the genome architecture in terms of repetitive elements and runs of homozygosity, and we show that compared with more outbred species, the ʻAlalā genome is substantially more homozygous. We also provide annotations for a subset of immunity genes that are likely to be important in conservation management, and we discuss how this genome is currently being used as a roadmap for downstream conservation applications.
- Published
- 2018
- Full Text
- View/download PDF
25. Extended haplotype phasing of de novo genome assemblies with FALCON-Phase
- Author
-
Sarah B. Kingan, John L. Williams, Zev N. Kronenberg, Gregory T. Concepcion, Olivier Fedrigo, Katherine M. Munson, Erich D. Jarvis, Stefan Hiendleder, Arang Rhie, Timothy P. L. Smith, Adam M. Phillippy, Shawn Sullivan, Richard Hall, Sergey Koren, Evan E. Eichler, and Paul Peluso
- Subjects
Loss of heterozygosity ,Haplotype ,Computational biology ,Biology ,Ploidy ,Phaser ,Zebra finch ,Phenotype ,Genome ,Chromatin - Abstract
Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. These assemblies can be created in various ways, such as use of tissues that contain single-haplotype (haploid) genomes, or by co-sequencing of parental genomes, but these approaches can be impractical in many situations. We present FALCON-Phase, which integrates long-read sequencing data and ultra-long-range Hi-C chromatin interaction data of a diploid individual to create high-quality, phased diploid genome assemblies. The method was evaluated by application to three datasets, including human, cattle, and zebra finch, for which high-quality, fully haplotype resolved assemblies were available for benchmarking. Phasing algorithm accuracy was affected by heterozygosity of the individual sequenced, with higher accuracy for cattle and zebra finch (>97%) compared to human (82%). In addition, scaffolding with the same Hi-C chromatin contact data resulted in phased chromosome-scale scaffolds.
- Published
- 2018
- Full Text
- View/download PDF
26. Complete assembly of parental haplotypes with trio binning
- Author
-
Sarah B. Kingan, Timothy P L Smith, John L. Williams, Sergey Koren, Arang Rhie, Adam M. Phillippy, Alexander T. Dilthey, Stefan Hiendleder, Brian P. Walenz, and Derek M. Bickhart
- Subjects
Loss of heterozygosity ,Evolutionary biology ,Haplotype ,Inheritance (genetic algorithm) ,Genomics ,Ploidy ,Biology ,Subspecies ,Genome ,humanities ,Reference genome - Abstract
Reference genome projects have historically selected inbred individuals to minimize heterozygosity and simplify assembly. We challenge this dogma and present a new approach designed specifically for heterozygous genomes. “Trio binning” uses short reads from two parental genomes to partition long reads from an offspring into haplotype-specific sets prior to assembly. Each haplotype is then assembled independently, resulting in a complete diploid reconstruction. On a benchmark human trio, this method achieved high accuracy and recovered complex structural variants missed by alternative approaches. To demonstrate its effectiveness on a heterozygous genome, we sequenced an F1 cross between cattle subspeciesBos taurus taurusandBos taurus indicus, and completely assembled both parental haplotypes with NG50 haplotig sizes >20 Mbp and 99.998% accuracy, surpassing the quality of current cattle reference genomes. We propose trio binning as a new best practice for diploid genome assembly that will enable new studies of haplotype variation and inheritance.
- Published
- 2018
- Full Text
- View/download PDF
27. De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads
- Author
-
Chen-Shan Chin, Erich D. Jarvis, Jonas Korlach, Jason T. Howard, Lindsey J. Cantin, Sarah B. Kingan, Gregory Gedman, and Jean-Nicolas Audet
- Subjects
0301 basic medicine ,Male ,animal structures ,brain ,Health Informatics ,Genomics ,Nerve Tissue Proteins ,Computational biology ,Biology ,Genome ,Avian Proteins ,Birds ,03 medical and health sciences ,0302 clinical medicine ,Reference genes ,long reads ,Animals ,SMRT Sequencing ,Gene ,Zebra finch ,Early Growth Response Protein 1 ,Genetics ,language ,Contig ,Research ,Haplotype ,Dual Specificity Phosphatase 1 ,Forkhead Transcription Factors ,Sequence Analysis, DNA ,de novo genome assembly ,Computer Science Applications ,030104 developmental biology ,Female ,030217 neurology & neurosurgery ,Single molecule real time sequencing - Abstract
Reference-quality genomes are expected to provide a resource for studying gene structure, function, and evolution. However, often genes of interest are not completely or accurately assembled, leading to unknown errors in analyses or additional cloning efforts for the correct sequences. A promising solution is long-read sequencing. Here we tested PacBio-based long-read sequencing and diploid assembly for potential improvements to the Sanger-based intermediate-read zebra finch reference and Illumina-based short-read Anna's hummingbird reference, 2 vocal learning avian species widely studied in neuroscience and genomics. With DNA of the same individuals used to generate the reference genomes, we generated diploid assemblies with the FALCON-Unzip assembler, resulting in contigs with no gaps in the megabase range, representing 150-fold and 200-fold improvements over the current zebra finch and hummingbird references, respectively. These long-read and phased assemblies corrected and resolved what we discovered to be numerous misassemblies in the references, including missing sequences in gaps, erroneous sequences flanking gaps, base call errors in difficult-to-sequence regions, complex repeat structure errors, and allelic differences between the 2 haplotypes. These improvements were validated by single long-genome and transcriptome reads and resulted for the first time in completely resolved protein-coding genes widely studied in neuroscience and specialized in vocal learning species. These findings demonstrate the impact of long reads, sequencing of previously difficult-to-sequence regions, and phasing of haplotypes on generating the high-quality assemblies necessary for understanding gene structure, function, and evolution.
- Published
- 2017
28. De Novo PacBio long-read and phased avian genome assemblies correct and add to genes important in neuroscience research
- Author
-
Jason T. Howard, Sarah B. Kingan, Erich D. Jarvis, Gregory Gedman, Jonas Korlach, Lindsey J. Cantin, and Chen-Shan Chin
- Subjects
Genetics ,Contig ,Reference genes ,Haplotype ,Genomics ,Computational biology ,Biology ,Genome ,Gene ,Zebra finch ,Sequence (medicine) - Abstract
Reference quality genomes are expected to provide a resource for studying gene structure and function. However, often genes of interest are not completely or accurately assembled, leading to unknown errors in analyses or additional cloning efforts for the correct sequences. A promising solution to this problem is long-read sequencing. Here we tested PacBio-based long-read sequencing and diploid assembly for potential improvements to the Sanger-based intermediate-read zebra finch reference and Illumina-based short-read Anna’s hummingbird reference, two vocal learning avian species widely studied in neuroscience and genomics. With DNA of the same individuals used to generate the reference genomes, we generated diploid assemblies with the FALCON-Unzip assembler, resulting in contigs with no gaps in the megabase range (N50s of 5.4 and 7.7 Mb, respectively), and representing 150-fold and 200-fold improvements over the current zebra finch and hummingbird references, respectively. These long-read assemblies corrected and resolved what we discovered to be misassemblies, including due to erroneous sequences flanking gaps, complex repeat structure errors in the references, base call errors in difficult to sequence regions, and inaccurate resolution of allelic differences between the two haplotypes. We analyzed protein-coding genes widely studied in neuroscience and specialized in vocal learning species, and found numerous assembly and sequence errors in the reference genes that the PacBio-based assemblies resolved completely, validated by single long genomic reads and transcriptome reads. These findings demonstrate, for the first time in non-human vocal learning species, the impact of higher quality, phased and gap-less assemblies for understanding gene structure and function.
- Published
- 2017
- Full Text
- View/download PDF
29. Molecular Evolution at a Meiosis Gene Mediates Species Differences in the Rate and Patterning of Recombination
- Author
-
M. Victoria Cattani, Emily L. Landeen, Cara L. Brand, Sarah B. Kingan, and Daven C. Presgraves
- Subjects
0301 basic medicine ,Centromere ,Population ,Gene Expression ,Population genetics ,Cell Cycle Proteins ,Biology ,General Biochemistry, Genetics and Molecular Biology ,Chromosomal crossover ,Animals, Genetically Modified ,Evolution, Molecular ,03 medical and health sciences ,0302 clinical medicine ,Species Specificity ,Meiosis ,Molecular evolution ,Homologous chromosome ,Animals ,Drosophila Proteins ,DNA Breaks, Double-Stranded ,Amino Acid Sequence ,Crossing Over, Genetic ,Selection, Genetic ,education ,Gene ,Recombination, Genetic ,education.field_of_study ,Genetic hitchhiking ,Drosophila melanogaster ,030104 developmental biology ,Evolutionary biology ,Drosophila ,General Agricultural and Biological Sciences ,030217 neurology & neurosurgery - Abstract
Crossing over between homologous chromosomes during meiosis repairs programmed DNA double-strand breaks, ensures proper segregation at meiosis I [1], shapes the genomic distribution of nucleotide variability in populations, and enhances the efficacy of natural selection among genetically linked sites [2]. Between closely related Drosophila species, large differences exist in the rate and chromosomal distribution of crossing over. Little, however, is known about the molecular genetic changes or population genetic forces that mediate evolved differences in recombination between species [3, 4]. Here, we show that a meiosis gene with a history of rapid evolution acts as a trans-acting modifier of species differences in crossing over. In transgenic flies, the dicistronic gene, mei-217/mei-218, recapitulates a large part of the species differences in the rate and chromosomal distribution of crossing over. These phenotypic differences appear to result from changes in protein sequence not gene expression. Our population genetics analyses show that the protein-coding sequence of mei-218, but not mei-217, has a history of recurrent positive natural selection. By modulating the intensity of centromeric and telomeric suppression of crossing over, evolution at mei-217/-218 has incidentally shaped gross differences in the chromosomal distribution of nucleotide variability between species. We speculate that recurrent bouts of adaptive evolution at mei-217/-218 might reflect a history of coevolution with selfish genetic elements.
- Published
- 2018
- Full Text
- View/download PDF
30. Recurrent Selection on the Winters sex-ratio Genes in Drosophila simulans
- Author
-
Daniel Garrigan, Daniel L. Hartl, and Sarah B. Kingan
- Subjects
Male ,Aging ,Linkage disequilibrium ,Time Factors ,X Chromosome ,Population ,Gene Conversion ,Genes, Insect ,Single-nucleotide polymorphism ,Investigations ,Biology ,California ,Linkage Disequilibrium ,Coalescent theory ,Evolution, Molecular ,INDEL Mutation ,Genetic variation ,Genetics ,Animals ,Sex Ratio ,Selection, Genetic ,Allele ,education ,Alleles ,education.field_of_study ,Polymorphism, Genetic ,Nucleotides ,Haplotype ,Meiotic drive ,Haplotypes ,Genetic Loci ,Evolutionary biology ,Drosophila ,Female - Abstract
Selfish genes, such as meiotic drive elements, propagate themselves through a population without increasing the fitness of host organisms. X-linked (or Y-linked) meiotic drive elements reduce the transmission of the Y (X) chromosome and skew progeny and population sex ratios, leading to intense conflict among genomic compartments. Drosophila simulans is unusual in having a least three distinct systems of X chromosome meiotic drive. Here, we characterize naturally occurring genetic variation at the Winters sex-ratio driver (Distorter on the X or Dox), its progenitor gene (Mother of Dox or MDox), and its suppressor gene (Not Much Yang or Nmy), which have been previously mapped and characterized. We survey three North American populations as well as 13 globally distributed strains and present molecular polymorphism data at the three loci. We find that all three genes show signatures of selection in North America, judging from levels of polymorphism and skews in the site-frequency spectrum. These signatures likely result from the biased transmission of the driver and selection on the suppressor for the maintenance of equal sex ratios. Coalescent modeling indicates that the timing of selection is more recent than the age of the alleles, suggesting that the driver and suppressor are coevolving under an evolutionary “arms race.” None of the Winters sex-ratio genes are fixed in D. simulans, and at all loci we find ancestral alleles, which lack the gene insertions and exhibit high levels of nucleotide polymorphism compared to the derived alleles. In addition, we find several “null” alleles that have mutations on the derived Dox background, which result in loss of drive function. We discuss the possible causes of the maintenance of presence–absence polymorphism in the Winters sex-ratio genes.
- Published
- 2010
- Full Text
- View/download PDF
31. Archaic Human Admixture
- Author
-
Daniel Garrigan and Sarah B. Kingan
- Subjects
Archeology ,education.field_of_study ,Nuclear gene ,Population ,Genetic variants ,Neanderthal genome project ,Reproductive isolation ,Biology ,biology.organism_classification ,Genome ,Genealogy ,Anthropology ,Anatomically modern human ,Human genome ,education - Abstract
The long‐standing debate over admixture between the anatomically modern human population and archaic forms of Homo has recently resurfaced, fueled by new evolutionary studies of the nuclear genome. Several genetic studies arrive at conflicting conclusions regarding the tempo at which reproductive barriers between primate species evolve, but new studies are uncovering indirect genetic evidence that refutes the existence of such barriers among hominins. An ambitious project of sequencing the Neanderthal genome promises to provide a detailed but not necessarily conclusive perspective on the admixture problem. Working under the assumption that members of the genus Homo could produce viable hybrid offspring, several new models have emerged detailing how archaic populations may have become extinct while still leaving behind their genetic footprints in the modern human genome. One intriguing hypothesis is that the expanding anatomically modern human population acquired locally adapted genetic variants from endem...
- Published
- 2007
- Full Text
- View/download PDF
32. Global patterns of human mitochondrial DNA and Y-chromosome structure are not influenced by higher migration rates of females versus males
- Author
-
Jason A. Wilder, Sarah B. Kingan, Maya Metni Pilkington, Zahra Mobasher, and Michael F. Hammer
- Subjects
Male ,Mitochondrial DNA ,Molecular Sequence Data ,Population Dynamics ,Population ,Alu element ,Biology ,Y chromosome ,DNA, Mitochondrial ,Human mitochondrial genetics ,Alu Elements ,Genetic variation ,Genetics ,Humans ,Genetic variability ,education ,Family Characteristics ,Sex Characteristics ,education.field_of_study ,Chromosomes, Human, Y ,Models, Genetic ,Genetic Variation ,Emigration and Immigration ,Genetics, Population ,Genetic structure ,Female - Abstract
Global-scale patterns of human population structure may be influenced by the rate of migration among populations that is nearly eight times higher for females than for males. This difference is attributed mainly to the widespread practice of patrilocality, in which women move into their mates' residences after marriage. Here we directly test this hypothesis by comparing global patterns of DNA sequence variation on the Y chromosome and mitochondrial DNA (mtDNA) in the same panel of 389 individuals from ten populations (four from Africa and two each from Europe, Asia and Oceania). We introduce a new strategy to assay Y-chromosome variation that identifies a high density of single-nucleotide polymorphisms, allows complete sequencing of all individuals rather than relying on predetermined markers and provides direct sequence comparisons with mtDNA. We found the overall proportion of between-group variation (Phi(ST)) to be 0.334 for the Y chromosome and 0.382 for mtDNA. Genetic differentiation between populations was similar for the Y chromosome and mtDNA at all geographic scales that we tested. Although patrilocality may be important at the local scale, patterns of genetic structure on the continental and global scales are not shaped by the higher rate of migration among females than among males.
- Published
- 2004
- Full Text
- View/download PDF
33. Reduced Polymorphism in the Chimpanzee Semen Coagulating Protein, Semenogelin I
- Author
-
Sarah B. Kingan, David M. Rand, and Marc Tatar
- Subjects
Male ,Genetics ,Gorilla gorilla ,Polymorphism, Genetic ,Base Sequence ,Pan troglodytes ,Locus (genetics) ,Semen ,Pan paniscus ,Biology ,Seminal Vesicle Secretory Proteins ,Sperm ,Evolution, Molecular ,Semenogelin I ,Animals ,Humans ,Mating plug ,Selective sweep ,Sequence Alignment ,Molecular Biology ,Sperm competition ,Ecology, Evolution, Behavior and Systematics ,Semenogelin - Abstract
The semen of many primate species coagulates into a mating plug believed to prevent the sperm of subsequent mating events from accessing the ova. The texture of the coagulum varies among species: from a semisoft mass in humans to a firm plug in chimpanzees. In humans, a component of the coagulum, semenogelin I, also inhibits sperm motility. We tested the hypothesis that polymorphism and divergence at semenogelin I differ among hominoid species with different mating systems. Sequence data for the semenogelin I locus were obtained from 12 humans, 10 chimpanzees, 7 gorillas, and 1 bonobo. Mitochondrial D-loop data were collected from a subset of individuals to assess levels of variation at an unlinked locus. HKA tests using D-loop sequence data revealed a significant reduction of polymorphism at semenogelin I in chimpanzees, consistent with predictions of a selective sweep at this locus. This result was supported by independent HKA tests using polymorphism data from a putatively neutral locus from the literature. Humans show a similar trend toward reduced polymorphism, although HKA tests were only marginally significant. Gorilla sequence data show evidence of functional loss at the semenogelin I locus, indicated by stop codons within the putative open reading frame as well as high levels of polymorphism. Elevated K a/K s ratios within the Pan–Homo clade suggest a history of positive selection at semenogelin I. Our results suggest that there is a positive relationship between the intensity of sperm competition in a species and the strength of positive Darwinian selection on the seminal protein semenogelin I.
- Published
- 2003
- Full Text
- View/download PDF
34. A selective sweep across species boundaries in Drosophila
- Author
-
Cara L. Brand, Long Jun Wu, Sarah B. Kingan, and Daniel Garrigan
- Subjects
Gene Flow ,Male ,Linkage disequilibrium ,Genetic Speciation ,Introgression ,Generalist and specialist species ,Drosophila sechellia ,Linkage Disequilibrium ,Gene flow ,Evolution, Molecular ,Species Specificity ,Genetics ,Animals ,Selection, Genetic ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,Phylogeny ,Discoveries ,biology ,Haplotype ,fungi ,Chromosome Mapping ,Genetic Variation ,biology.organism_classification ,Chromosomes, Insect ,Haplotypes ,Evolutionary biology ,Mutation ,Drosophila ,Female ,Selective sweep - Abstract
Adaptive mutations that accumulate during species divergence are likely to contribute to reproductive incompatibilities and hinder gene flow; however, there may also be a class of mutations that are generally advantageous and can spread across species boundaries. In this study, we characterize a 15 kb region on chromosome 3R that has introgressed from the cosmopolitan generalist species Drosophila simulans into the island endemic D. sechellia, which is an ecological specialist. The introgressed haplotype is fixed in D. sechellia over almost the entirety of the resequenced region, whereas a core region of the introgressed haplotype occurs at high frequency in D. simulans. The observed patterns of nucleotide variation and linkage disequilibrium are consistent with a recently completed selective sweep in D. sechellia and an incomplete sweep in D. simulans. Independent estimates of both the time to the introgression and sweep events are all close to 10,000 years before the present. Interestingly, the most likely target of selection is a highly occupied transcription factor binding region. This work confirms that it is possible for mutations to be globally advantageous, despite their occurrence in divergent genomic and ecological backgrounds.
- Published
- 2013
35. Genome sequencing reveals complex speciation in the Drosophila simulans clade
- Author
-
Kevin R. Thornton, Anthony J. Geneva, Peter Andolfatto, Sarah B. Kingan, Daven C. Presgraves, Andrew G. Clark, and Daniel Garrigan
- Subjects
Polytomy ,Gene Flow ,Reproductive Isolation ,Genetic Speciation ,Genome, Insect ,Drosophila sechellia ,Gene flow ,Evolution, Molecular ,Genetics ,Animals ,Selection, Genetic ,Clade ,Mauritiana ,Genetics (clinical) ,Phylogeny ,Population Density ,biology ,Phylogenetic tree ,Base Sequence ,Research ,Reproductive isolation ,Sequence Analysis, DNA ,biology.organism_classification ,Chromosomes, Insect ,Haplotypes ,Evolutionary biology ,Drosophila ,Sequence Alignment - Abstract
The three species of the Drosophila simulans clade—the cosmopolitan species, D. simulans, and the two island endemic species, D. mauritiana and D. sechellia—are important models in speciation genetics, but some details of their phylogenetic and speciation history remain unresolved. The order and timing of speciation are disputed, and the existence, magnitude, and timing of gene flow among the three species remain unclear. Here we report on the analysis of a whole-genome four-species sequence alignment that includes all three D. simulans clade species as well as the D. melanogaster reference sequence. The alignment comprises novel, paired short-read sequence data from a single highly inbred line each from D. simulans, D. mauritiana, and D. sechellia. We are unable to reject a species phylogeny with a basal polytomy; the estimated age of the polytomy is 242,000 yr before the present. However, we also find that up to 4.6% of autosomal and 2.2% of X-linked regions have evolutionary histories consistent with recent gene flow between the mainland species (D. simulans) and the two island endemic species (D. mauritiana and D. sechellia). Our findings thus show that gene flow has occurred throughout the genomes of the D. simulans clade species despite considerable geographic, ecological, and intrinsic reproductive isolation. Last, our analysis of lineage-specific changes confirms that the D. sechellia genome has experienced a significant excess of slightly deleterious changes and a dearth of presumed favorable changes. The relatively reduced efficacy of natural selection in D. sechellia is consistent with its derived, persistently reduced historical effective population size.
- Published
- 2012
36. Cis- and trans-acting genetic factors contribute to heterogeneity in the rate of crossing over between the Drosophila simulans clade species
- Author
-
Daven C. Presgraves, Sarah B. Kingan, and M. V. Cattani
- Subjects
Genetics ,X Chromosome ,Genetic Speciation ,Introgression ,Chromosome Mapping ,Genetic Variation ,Biology ,biology.organism_classification ,Drosophila mauritiana ,Drosophila sechellia ,Chromosomal crossover ,Genetic variation ,Animals ,Drosophila ,Female ,Crossing Over, Genetic ,Drosophila (subgenus) ,Ecology, Evolution, Behavior and Systematics ,Recombination ,X chromosome - Abstract
In the genus Drosophila, variation in recombination rates has been found within and between species. Genetic variation for both cis- and trans-acting factors has been shown to affect recombination rates within species, but little is known about the genetic factors that affect differences between species. Here, we estimate rates of crossing over for seven segments that tile across the euchromatic length of the X chromosome in the genetic backgrounds of three closely related Drosophila species. We first generated a set of Drosophila mauritiana lines each having two semidominant visible markers on the X chromosome and then introgressed these doubly marked segments into the genetic backgrounds of its sibling species, Drosophila simulans and Drosophila sechellia. Using these 21 lines (seven segments, three genetic backgrounds), we tested whether recombination rates within the doubly marked intervals differed depending on genetic background. We find significant heterogeneity among intervals and among species backgrounds. Our results suggest that a combination of both cis- and trans-acting factors have evolved among the three D. simulans clade species and interact to affect recombination rate.
- Published
- 2012
37. Sex chromosome-specific regulation in the Drosophila male germline but little evidence for chromosomal dosage compensation or meiotic inactivation
- Author
-
Jodi M. Cook, Colin D. Meiklejohn, Emily L. Landeen, Daven C. Presgraves, and Sarah B. Kingan
- Subjects
Male ,QH301-705.5 ,DNA transcription ,General Biochemistry, Genetics and Molecular Biology ,Germline ,X hyperactivation ,Transcriptomes ,03 medical and health sciences ,0302 clinical medicine ,Model Organisms ,X Chromosome Inactivation ,Genome Analysis Tools ,Dosage Compensation, Genetic ,Testis ,Genetics ,Animals ,Biology (General) ,Spermatogenesis ,Skewed X-inactivation ,Biology ,X chromosome ,X-linked recessive inheritance ,030304 developmental biology ,0303 health sciences ,Evolutionary Biology ,Dosage compensation ,Autosome ,Sex Chromosomes ,General Immunology and Microbiology ,biology ,Chromosome Biology ,General Neuroscience ,Drosophila Melanogaster ,Genomic Evolution ,Genomics ,Animal Models ,biology.organism_classification ,Meiosis ,Germ Cells ,Drosophila ,Female ,Gene expression ,Drosophila melanogaster ,General Agricultural and Biological Sciences ,Genome Expression Analysis ,030217 neurology & neurosurgery ,Research Article - Abstract
Suppression of X-linked transgene reporters versus normal expression of endogenous X-linked genes suggest a novel form of X chromosome-specific regulation in Drosophila testes, instead of sex chromosome dosage compensation or meiotic inactivation., The evolution of heteromorphic sex chromosomes (e.g., XY in males or ZW in females) has repeatedly elicited the evolution of two kinds of chromosome-specific regulation: dosage compensation—the equalization of X chromosome gene expression in males and females— and meiotic sex chromosome inactivation (MSCI)—the transcriptional silencing and heterochromatinization of the X during meiosis in the male (or Z in the female) germline. How the X chromosome is regulated in the Drosophila melanogaster male germline is unclear. Here we report three new findings concerning gene expression from the X in Drosophila testes. First, X chromosome-wide dosage compensation appears to be absent from most of the Drosophila male germline. Second, microarray analysis provides no evidence for X chromosome-specific inactivation during meiosis. Third, we confirm the previous discovery that the expression of transgene reporters driven by autosomal spermatogenesis-specific promoters is strongly reduced when inserted on the X chromosome versus the autosomes; but we show that this chromosomal difference in expression is established in premeiotic cells and persists in meiotic cells. The magnitude of the X-autosome difference in transgene expression cannot be explained by the absence of dosage compensation, suggesting that a previously unrecognized mechanism limits expression from the X during spermatogenesis in Drosophila. These findings help to resolve several previously conflicting reports and have implications for patterns of genome evolution and speciation in Drosophila., Author Summary Many species have heteromorphic sex chromosomes (XY males or ZW females) where one sex chromosome (the Y or W) has degenerated. In the somatic cells of mammals, worms, and flies, the X-to-autosome ratio of expression is equalized between the sexes by dedicated sex chromosome-specific dosage compensation systems. In the germline cells of male mammals and worms, however, the X chromosome is transcriptionally silenced early in meiosis. Here we have analyzed gene expression in Drosophila testes and show that the X chromosome lacks both of these types of chromosomal regulation. We find that X chromosome-wide dosage compensation is absent from most cells in the Drosophila male germline, and there is little or no evidence for X chromosome-specific inactivation during meiosis. However, another kind of sex-chromosome-specific regulation occurs. Testes-specific transgene reporters show much weaker expression when inserted on the X chromosome versus the autosomes, suggesting that some other, uncharacterized mechanism limits their expression from the X during spermatogenesis. The strong suppression of X-linked transgenes—but not X-linked endogenous genes—suggests that endogenous X-linked testes-specific promoters might have adapted to a suppressive X chromosome environment in the Drosophila male germline.
- Published
- 2010
38. Testing for archaic hominin admixture on the X chromosome: model likelihoods for the modern human RRM2P4 region from summaries of genealogical topology under the structured coalescent
- Author
-
Tatiana M. Karafet, Maya Metni Pilkington, Michael F. Hammer, Murray P. Cox, Giovanni Destro-Bisol, Fernando L. Mendez, Beverly I. Strassmann, and Sarah B. Kingan
- Subjects
Most recent common ancestor ,Range (biology) ,molecular-clock ,Pseudogene ,rrm2p4 region ,Biology ,Investigations ,Genome ,Coalescent theory ,Genetics ,Humans ,Clade ,Phylogeny ,Demography ,Chromosomes, Human, X ,Likelihood Functions ,Models, Genetic ,x chromosome ,Genetic Variation ,Sequence Analysis, DNA ,biology.organism_classification ,Anatomically modern human ,DNA, Intergenic ,Homo erectus ,Genealogy and Heraldry - Abstract
A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of ∼2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios.
- Published
- 2008
39. Contrasting signatures of population growth for mitochondrial DNA and Y chromosomes among human populations in Africa
- Author
-
Zahra Mobasher, Himla Soodyall, Murray P. Cox, Giovanni Destro-Bisol, Chiara Batini, Jason A. Wilder, Beverly I. Strassmann, August E. Woerner, Fernando L. Mendez, Maya Metni Pilkington, Michael F. Hammer, Sarah B. Kingan, and Thiep Angui
- Subjects
Genetics ,Male ,Population Density ,Mitochondrial DNA ,education.field_of_study ,Chromosomes, Human, Y ,Polymorphism, Genetic ,Population size ,Population ,Locus (genetics) ,Biology ,Y chromosome ,DNA, Mitochondrial ,Coalescent theory ,Electron Transport Complex IV ,Population bottleneck ,Genetics, Population ,Effective population size ,Africa ,Humans ,education ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,hunter-gatherers ,mtdna ,population growth ,sub-saharan africa ,homo sapiens ,nry ,hunter-gatherer - Abstract
A history of Pleistocene population expansion has been inferred from the frequency spectrum of polymorphism in the mitochondrial DNA (mtDNA) of many human populations. Similar patterns are not typically observed for autosomal and X-linked loci. One explanation for this discrepancy is a recent population bottleneck, with different rates of recovery for haploid and autosomal loci as a result of their different effective population sizes. This hypothesis predicts that mitochondrial and Y chromosomal DNA will show a similar skew in the frequency spectrum in populations that have experienced a recent increase in effective population size. We test this hypothesis by resequencing 6.6 kb of noncoding Y chromosomal DNA and 780 basepairs of the mtDNA cytochrome c oxidase subunit III (COIII) gene in 172 males from 5 African populations. Four tests of population expansion are employed for each locus in each population: Fu's F s statistic, the R 2 statistic, coalescent simulations, and the mismatch distribution. Consistent with previous results, patterns of mtDNA polymorphism better fit a model of constant population size for food-gathering populations and a model of population expansion for food-producing populations. In contrast, none of the tests reveal evidence of Y chromosome growth for either food-gatherers or food-producers. The distinct mtDNA and Y chromosome polymorphism patterns most likely reflect sex-biased demographic processes in the recent history of African populations. We hypothesize that males experienced smaller effective population sizes and/or lower rates of migration during the Bantu expansion, which occurred over the last 5,000 years.
- Published
- 2008
40. A sex-ratio meiotic drive system in Drosophila simulans. II: an X-linked distorter
- Author
-
Luciana O. Araripe, Yeyan Ke, Sarah B. Kingan, Hailian Xiao, Daniel L. Hartl, and Yun Tao
- Subjects
0106 biological sciences ,Genetics ,0303 health sciences ,General Immunology and Microbiology ,Human evolutionary genetics ,QH301-705.5 ,General Neuroscience ,Chromosome ,Biology ,Y chromosome ,biology.organism_classification ,010603 evolutionary biology ,01 natural sciences ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Meiotic drive ,Meiosis ,Evolutionary biology ,Drosophila melanogaster ,Biology (General) ,General Agricultural and Biological Sciences ,Heterogametic sex ,X chromosome ,030304 developmental biology - Abstract
The evolution of heteromorphic sex chromosomes creates a genetic condition favoring the invasion of sex-ratio meiotic drive elements, resulting in the biased transmission of one sex chromosome over the other, in violation of Mendel's first law. The molecular mechanisms of sex-ratio meiotic drive may therefore help us to understand the evolutionary forces shaping the meiotic behavior of the sex chromosomes. Here we characterize a sex-ratio distorter on the X chromosome (Dox) in Drosophila simulans by genetic and molecular means. Intriguingly, Dox has very limited coding capacity. It evolved from another X-linked gene, which also evolved de nova. Through retrotransposition, Dox also gave rise to an autosomal suppressor, not much yang (Nmy). An RNA interference mechanism seems to be involved in the suppression of the Dox distorter by the Nmy suppressor. Double mutant males of the genotype dox; nmy are normal for both sex-ratio and spermatogenesis. We postulate that recurrent bouts of sex-ratio meiotic drive and its subsequent suppression might underlie several common features observed in the heterogametic sex, including meiotic sex chromosome inactivation and achiasmy.
- Published
- 2007
41. Inferring human population sizes, divergence times and rates of gene flow from mitochondrial, X and Y chromosome resequencing data
- Author
-
Jason A. Wilder, Andrea Novelletto, Maya Metni Pilkington, Jonathan S. Friedlaender, Sarah B. Kingan, Peter de Knijff, Daniel Garrigan, Murray P. Cox, Giovanni Destro-Bisol, Himla Soodyall, Michael F. Hammer, and Beverly I. Strassmann
- Subjects
Population Dynamics ,Gene flow ,Coalescent theory ,Effective population size ,genetic variability ,mitochondrion ,comparative study ,X chromosome ,Genetics ,education.field_of_study ,Continental Population Groups ,article ,mitochondrial dna ,gene isolation ,Markov Chains ,Mitochondrial ,Europe ,Monte Carlo method ,priority journal ,history ,population size ,Gene Flow ,Mitochondrial DNA ,Asia ,gene locus ,probability ,Population ,gene sequence ,Investigations ,Biology ,Y chromosome ,DNA, Mitochondrial ,Chromosomes ,Humans ,controlled study ,x chromosome ,y chromosome ,human ,education ,Population Density ,Chromosomes, Human, X ,Chromosomes, Human, Y ,Base Sequence ,Racial Groups ,population genetics ,DNA ,Settore BIO/18 - Genetica ,Genetics, Population ,Population bottleneck ,mitochondrial DNA ,Africa ,gene flow ,genetic model ,population growth - Abstract
We estimate parameters of a general isolation-with-migration model using resequence data from mitochondrial DNA (mtDNA), the Y chromosome, and two loci on the X chromosome in samples of 25–50 individuals from each of 10 human populations. Application of a coalescent-based Markov chain Monte Carlo technique allows simultaneous inference of divergence times, rates of gene flow, as well as changes in effective population size. Results from comparisons between sub-Saharan African and Eurasian populations estimate that 1500 individuals founded the ancestral Eurasian population ∼40 thousand years ago (KYA). Furthermore, these small Eurasian founding populations appear to have grown much more dramatically than either African or Oceanian populations. Analyses of sub-Saharan African populations provide little evidence for a history of population bottlenecks and suggest that they began diverging from one another upward of 50 KYA. We surmise that ancestral African populations had already been geographically structured prior to the founding of ancestral Eurasian populations. African populations are shown to experience low levels of mitochondrial DNA gene flow, but high levels of Y chromosome gene flow. In particular, Y chromosome gene flow appears to be asymmetric, i.e., from the Bantu-speaking population into other African populations. Conversely, mitochondrial gene flow is more extensive between non-African populations, but appears to be absent between European and Asian populations.
- Published
- 2007
42. Genome Diversity and Divergence in Drosophila mauritiana: Multiple Signatures of Faster X Evolution
- Author
-
Jeffrey P. Vedanayagam, Anthony J. Geneva, Sarah B. Kingan, Daniel Garrigan, and Daven C. Presgraves
- Subjects
Male ,0106 biological sciences ,Genetic Speciation ,Genome, Insect ,satellite DNA ,adaptation ,Drosophila mauritiana ,010603 evolutionary biology ,01 natural sciences ,Drosophila sechellia ,Nucleotide diversity ,selective sweep ,X chromosome ,Evolution, Molecular ,03 medical and health sciences ,Genetics ,Animals ,14. Life underwater ,Mauritiana ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,0303 health sciences ,Genome ,Autosome ,Models, Genetic ,biology ,Reproduction ,Genetic Variation ,Reproductive isolation ,biology.organism_classification ,Chromosomes, Insect ,Evolutionary biology ,Drosophila ,Female ,Corrigendum ,Selective sweep ,Research Article - Abstract
Drosophila mauritiana is an Indian Ocean island endemic species that diverged from its two sister species, Drosophila simulans and Drosophila sechellia, approximately 240,000 years ago. Multiple forms of incomplete reproductive isolation have evolved among these species, including sexual, gametic, ecological, and intrinsic postzygotic barriers, with crosses among all three species conforming to Haldane's rule: F(1) hybrid males are sterile and F(1) hybrid females are fertile. Extensive genetic resources and the fertility of hybrid females have made D. mauritiana, in particular, an important model for speciation genetics. Analyses between D. mauritiana and both of its siblings have shown that the X chromosome makes a disproportionate contribution to hybrid male sterility. But why the X plays a special role in the evolution of hybrid sterility in these, and other, species remains an unsolved problem. To complement functional genetic analyses, we have investigated the population genomics of D. mauritiana, giving special attention to differences between the X and the autosomes. We present a de novo genome assembly of D. mauritiana annotated with RNAseq data and a whole-genome analysis of polymorphism and divergence from ten individuals. Our analyses show that, relative to the autosomes, the X chromosome has reduced nucleotide diversity but elevated nucleotide divergence; an excess of recurrent adaptive evolution at its protein-coding genes; an excess of recent, strong selective sweeps; and a large excess of satellite DNA. Interestingly, one of two centimorgan-scale selective sweeps on the D. mauritiana X chromosome spans a region containing two sex-ratio meiotic drive elements and a high concentration of satellite DNA. Furthermore, genes with roles in reproduction and chromosome biology are enriched among genes that have histories of recurrent adaptive protein evolution. Together, these genome-wide analyses suggest that genetic conflict and frequent positive natural selection on the X chromosome have shaped the molecular evolutionary history of D. mauritiana, refining our understanding of the possible causes of the large X-effect in speciation.
- Published
- 2015
- Full Text
- View/download PDF
43. Deep haplotype divergence and long-range linkage disequilibrium at xp21.1 provide evidence that humans descend from a structured ancestral population
- Author
-
Daniel Garrigan, Zahra Mobasher, Sarah B. Kingan, Michael F. Hammer, and Jason A. Wilder
- Subjects
Genetics ,Panmixia ,Linkage disequilibrium ,education.field_of_study ,Chromosomes, Human, X ,Fossils ,Haplotype ,Population ,Locus (genetics) ,Biology ,Investigations ,Biological Evolution ,Linkage Disequilibrium ,Genetics, Population ,Human evolution ,Haplotypes ,Evolutionary biology ,Homo sapiens ,Humans ,education ,Null hypothesis - Abstract
Fossil evidence links human ancestry with populations that evolved from modern gracile morphology in Africa 130,000–160,000 years ago. Yet fossils alone do not provide clear answers to the question of whether the ancestors of all modern Homo sapiens comprised a single African population or an amalgamation of distinct archaic populations. DNA sequence data have consistently supported a single-origin model in which anatomically modern Africans expanded and completely replaced all other archaic hominin populations. Aided by a novel experimental design, we present the first genetic evidence that statistically rejects the null hypothesis that our species descends from a single, historically panmictic population. In a global sample of 42 X chromosomes, two African individuals carry a lineage of noncoding 17.5-kb sequence that has survived for >1 million years without any clear traces of ongoing recombination with other lineages at this locus. These patterns of deep haplotype divergence and long-range linkage disequilibrium are best explained by a prolonged period of ancestral population subdivision followed by relatively recent interbreeding. This inference supports human evolution models that incorporate admixture between divergent African branches of the genus Homo.
- Published
- 2005
44. A new method to scan genomes for introgression in a secondary contact model.
- Author
-
Anthony J Geneva, Christina A Muirhead, Sarah B Kingan, and Daniel Garrigan
- Subjects
Medicine ,Science - Abstract
Secondary contact between divergent populations or incipient species may result in the exchange and introgression of genomic material. We develop a simple DNA sequence measure, called Gmin, which is designed to identify genomic regions experiencing introgression in a secondary contact model. Gmin is defined as the ratio of the minimum between-population number of nucleotide differences in a genomic window to the average number of between-population differences. Although it is conceptually simple, one advantage of Gmin is that it is computationally inexpensive relative to model-based methods for detecting gene flow and it scales easily to the level of whole-genome analysis. We compare the sensitivity and specificity of Gmin to those of the widely used index of population differentiation, FST, and suggest a simple statistical test for identifying genomic outliers. Extensive computer simulations demonstrate that Gmin has both greater sensitivity and specificity for detecting recent introgression than does FST. Furthermore, we find that the sensitivity of Gmin is robust with respect to both the population mutation and recombination rates. Finally, a scan of Gmin across the X chromosome of Drosophila melanogaster identifies candidate regions of introgression between sub-Saharan African and cosmopolitan populations that were previously missed by other methods. These results show that Gmin is a biologically straightforward, yet powerful, alternative to FST, as well as to more computationally intensive model-based methods for detecting gene flow.
- Published
- 2015
- Full Text
- View/download PDF
45. Sex chromosome-specific regulation in the Drosophila male germline but little evidence for chromosomal dosage compensation or meiotic inactivation.
- Author
-
Colin D Meiklejohn, Emily L Landeen, Jodi M Cook, Sarah B Kingan, and Daven C Presgraves
- Subjects
Biology (General) ,QH301-705.5 - Abstract
The evolution of heteromorphic sex chromosomes (e.g., XY in males or ZW in females) has repeatedly elicited the evolution of two kinds of chromosome-specific regulation: dosage compensation--the equalization of X chromosome gene expression in males and females--and meiotic sex chromosome inactivation (MSCI)--the transcriptional silencing and heterochromatinization of the X during meiosis in the male (or Z in the female) germline. How the X chromosome is regulated in the Drosophila melanogaster male germline is unclear. Here we report three new findings concerning gene expression from the X in Drosophila testes. First, X chromosome-wide dosage compensation appears to be absent from most of the Drosophila male germline. Second, microarray analysis provides no evidence for X chromosome-specific inactivation during meiosis. Third, we confirm the previous discovery that the expression of transgene reporters driven by autosomal spermatogenesis-specific promoters is strongly reduced when inserted on the X chromosome versus the autosomes; but we show that this chromosomal difference in expression is established in premeiotic cells and persists in meiotic cells. The magnitude of the X-autosome difference in transgene expression cannot be explained by the absence of dosage compensation, suggesting that a previously unrecognized mechanism limits expression from the X during spermatogenesis in Drosophila. These findings help to resolve several previously conflicting reports and have implications for patterns of genome evolution and speciation in Drosophila.
- Published
- 2011
- Full Text
- View/download PDF
46. A sex-ratio meiotic drive system in Drosophila simulans. II: an X-linked distorter.
- Author
-
Yun Tao, Luciana Araripe, Sarah B Kingan, Yeyan Ke, Hailian Xiao, and Daniel L Hartl
- Subjects
Biology (General) ,QH301-705.5 - Abstract
The evolution of heteromorphic sex chromosomes creates a genetic condition favoring the invasion of sex-ratio meiotic drive elements, resulting in the biased transmission of one sex chromosome over the other, in violation of Mendel's first law. The molecular mechanisms of sex-ratio meiotic drive may therefore help us to understand the evolutionary forces shaping the meiotic behavior of the sex chromosomes. Here we characterize a sex-ratio distorter on the X chromosome (Dox) in Drosophila simulans by genetic and molecular means. Intriguingly, Dox has very limited coding capacity. It evolved from another X-linked gene, which also evolved de nova. Through retrotransposition, Dox also gave rise to an autosomal suppressor, not much yang (Nmy). An RNA interference mechanism seems to be involved in the suppression of the Dox distorter by the Nmy suppressor. Double mutant males of the genotype dox; nmy are normal for both sex-ratio and spermatogenesis. We postulate that recurrent bouts of sex-ratio meiotic drive and its subsequent suppression might underlie several common features observed in the heterogametic sex, including meiotic sex chromosome inactivation and achiasmy.
- Published
- 2007
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.