22 results on '"Hasenfeld, Patrick"'
Search Results
2. Cell-type-specific consequences of mosaic structural variants in hematopoietic stem and progenitor cells
- Author
-
Grimes, Karen, Jeong, Hyobin, Amoah, Amanda, Xu, Nuo, Niemann, Julian, Raeder, Benjamin, Hasenfeld, Patrick, Stober, Catherine, Rausch, Tobias, Benito, Eva, Jann, Johann-Christoph, Nowak, Daniel, Emini, Ramiz, Hoenicka, Markus, Liebold, Andreas, Ho, Anthony, Shuai, Shimin, Geiger, Hartmut, Sanders, Ashley D., and Korbel, Jan O.
- Published
- 2024
- Full Text
- View/download PDF
3. Gaps and complex structurally variant loci in phased genome assemblies
- Author
-
Porubsky, David, Vollger, Mitchell R, Harvey, William T, Rozanski, Allison N, Ebert, Peter, Hickey, Glenn, Hasenfeld, Patrick, Sanders, Ashley D, Stober, Catherine, Consortium, Human Pangenome Reference, Korbel, Jan O, Paten, Benedict, Marschall, Tobias, Eichler, Evan E, Abel, Haley J, Antonacci-Fulton, Lucinda L, Asri, Mobin, Baid, Gunjan, Baker, Carl A, Belyaeva, Anastasiya, Billis, Konstantinos, Bourque, Guillaume, Buonaiuto, Silvia, Carroll, Andrew, Chaisson, Mark JP, Chang, Pi-Chuan, Chang, Xian H, Cheng, Haoyu, Chu, Justin, Cody, Sarah, Colonna, Vincenza, Cook, Daniel E, Cook-Deegan, Robert M, Cornejo, Omar E, Diekhans, Mark, Doerr, Daniel, Ebler, Jana, Eizenga, Jordan M, Fairley, Susan, Fedrigo, Olivier, Felsenfeld, Adam L, Feng, Xiaowen, Fischer, Christian, Flicek, Paul, Formenti, Giulio, Frankish, Adam, Fulton, Robert S, Gao, Yan, Garg, Shilpa, Garrison, Erik, Garrison, Nanibaa’ A, Giron, Carlos Garcia, Green, Richard E, Groza, Cristian, Guarracino, Andrea, Haggerty, Leanne, Hall, Ira M, Haukness, Marina, Haussler, David, Heumos, Simon, Hoekzema, Kendra, Hourlier, Thibaut, Howe, Kerstin, Jain, Miten, Jarvis, Erich D, Ji, Hanlee P, Kenny, Eimear E, Koenig, Barbara A, Kolesnikov, Alexey, Kordosky, Jennifer, Koren, Sergey, Lee, HoJoon, Lewis, Alexandra P, Li, Heng, Liao, Wen-Wei, Lu, Shuangjia, Lu, Tsung-Yu, Lucas, Julian K, Magalhães, Hugo, Marco-Sola, Santiago, Marijon, Pierre, Markello, Charles, Martin, Fergal J, McCartney, Ann, McDaniel, Jennifer, Miga, Karen H, Mitchell, Matthew W, Monlong, Jean, Mountcastle, Jacquelyn, Munson, Katherine M, Mwaniki, Moses Njagi, Nattestad, Maria, Novak, Adam M, and Nurk, Sergey
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Humans ,DNA ,Satellite ,Polymorphism ,Genetic ,Haplotypes ,Segmental Duplications ,Genomic ,Sequence Analysis ,DNA ,Human Pangenome Reference Consortium ,Medical and Health Sciences ,Bioinformatics - Abstract
There has been tremendous progress in phased genome assembly production by combining long-read data with parental information or linked-read data. Nevertheless, a typical phased genome assembly generated by trio-hifiasm still generates more than 140 gaps. We perform a detailed analysis of gaps, assembly breaks, and misorientations from 182 haploid assemblies obtained from a diversity panel of 77 unique human samples. Although trio-based approaches using HiFi are the current gold standard, chromosome-wide phasing accuracy is comparable when using Strand-seq instead of parental data. Importantly, the majority of assembly gaps cluster near the largest and most identical repeats (including segmental duplications [35.4%], satellite DNA [22.3%], or regions enriched in GA/AT-rich DNA [27.4%]). Consequently, 1513 protein-coding genes overlap assembly gaps in at least one haplotype, and 231 are recurrently disrupted or missing from five or more haplotypes. Furthermore, we estimate that 6-7 Mbp of DNA are misorientated per haplotype irrespective of whether trio-free or trio-based approaches are used. Of these misorientations, 81% correspond to bona fide large inversion polymorphisms in the human species, most of which are flanked by large segmental duplications. We also identify large-scale alignment discontinuities consistent with 11.9 Mbp of deletions and 161.4 Mbp of insertions per haploid genome. Although 99% of this variation corresponds to satellite DNA, we identify 230 regions of euchromatic DNA with frequent expansions and contractions, nearly half of which overlap with 197 protein-coding genes. Such variable and incompletely assembled regions are important targets for future algorithmic development and pangenome representation.
- Published
- 2023
4. Semi-automated assembly of high-quality diploid human reference genomes
- Author
-
Jarvis, Erich D, Formenti, Giulio, Rhie, Arang, Guarracino, Andrea, Yang, Chentao, Wood, Jonathan, Tracey, Alan, Thibaud-Nissen, Francoise, Vollger, Mitchell R, Porubsky, David, Cheng, Haoyu, Asri, Mobin, Logsdon, Glennis A, Carnevali, Paolo, Chaisson, Mark JP, Chin, Chen-Shan, Cody, Sarah, Collins, Joanna, Ebert, Peter, Escalona, Merly, Fedrigo, Olivier, Fulton, Robert S, Fulton, Lucinda L, Garg, Shilpa, Gerton, Jennifer L, Ghurye, Jay, Granat, Anastasiya, Green, Richard E, Harvey, William, Hasenfeld, Patrick, Hastie, Alex, Haukness, Marina, Jaeger, Erich B, Jain, Miten, Kirsche, Melanie, Kolmogorov, Mikhail, Korbel, Jan O, Koren, Sergey, Korlach, Jonas, Lee, Joyce, Li, Daofeng, Lindsay, Tina, Lucas, Julian, Luo, Feng, Marschall, Tobias, Mitchell, Matthew W, McDaniel, Jennifer, Nie, Fan, Olsen, Hugh E, Olson, Nathan D, Pesout, Trevor, Potapova, Tamara, Puiu, Daniela, Regier, Allison, Ruan, Jue, Salzberg, Steven L, Sanders, Ashley D, Schatz, Michael C, Schmitt, Anthony, Schneider, Valerie A, Selvaraj, Siddarth, Shafin, Kishwar, Shumate, Alaina, Stitziel, Nathan O, Stober, Catherine, Torrance, James, Wagner, Justin, Wang, Jianxin, Wenger, Aaron, Xiao, Chuanle, Zimin, Aleksey V, Zhang, Guojie, Wang, Ting, Li, Heng, Garrison, Erik, Haussler, David, Hall, Ira, Zook, Justin M, Eichler, Evan E, Phillippy, Adam M, Paten, Benedict, Howe, Kerstin, and Miga, Karen H
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Generic health relevance ,Humans ,Chromosome Mapping ,Diploidy ,Genome ,Human ,Haplotypes ,High-Throughput Nucleotide Sequencing ,Sequence Analysis ,DNA ,Reference Standards ,Genomics ,Chromosomes ,Human ,Genetic Variation ,Human Pangenome Reference Consortium ,General Science & Technology - Abstract
The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.
- Published
- 2022
5. Assembly of 43 human Y chromosomes reveals extensive complexity and variation
- Author
-
Hallast, Pille, Ebert, Peter, Loftus, Mark, Yilmaz, Feyza, Audano, Peter A., Logsdon, Glennis A., Bonder, Marc Jan, Zhou, Weichen, Höps, Wolfram, Kim, Kwondo, Li, Chong, Hoyt, Savannah J., Dishuck, Philip C., Porubsky, David, Tsetsos, Fotios, Kwon, Jee Young, Zhu, Qihui, Munson, Katherine M., Hasenfeld, Patrick, Harvey, William T., Lewis, Alexandra P., Kordosky, Jennifer, Hoekzema, Kendra, O’Neill, Rachel J., Korbel, Jan O., Tyler-Smith, Chris, Eichler, Evan E., Shi, Xinghua, Beck, Christine R., Marschall, Tobias, Konkel, Miriam K., and Lee, Charles
- Published
- 2023
- Full Text
- View/download PDF
6. Inverted triplications formed by iterative template switches generate structural variant diversity at genomic disorder loci
- Author
-
Grochowski, Christopher M., Bengtsson, Jesse D., Du, Haowei, Gandhi, Mira, Lun, Ming Yin, Mehaffey, Michele G., Park, KyungHee, Höps, Wolfram, Benito, Eva, Hasenfeld, Patrick, Korbel, Jan O., Mahmoud, Medhat, Paulin, Luis F., Jhangiani, Shalini N., Hwang, James Paul, Bhamidipati, Sravya V., Muzny, Donna M., Fatih, Jawid M., Gibbs, Richard A., Pendleton, Matthew, Harrington, Eoghan, Juul, Sissel, Lindstrand, Anna, Sedlazeck, Fritz J., Pehlivan, Davut, Lupski, James R., and Carvalho, Claudia M.B.
- Published
- 2024
- Full Text
- View/download PDF
7. Functional analysis of structural variants in single cells using Strand-seq
- Author
-
Jeong, Hyobin, Grimes, Karen, Rauwolf, Kerstin K., Bruch, Peter-Martin, Rausch, Tobias, Hasenfeld, Patrick, Benito, Eva, Roider, Tobias, Sabarinathan, Radhakrishnan, Porubsky, David, Herbst, Sophie A., Erarslan-Uysal, Büşra, Jann, Johann-Christoph, Marschall, Tobias, Nowak, Daniel, Bourquin, Jean-Pierre, Kulozik, Andreas E., Dietrich, Sascha, Bornhauser, Beat, Sanders, Ashley D., and Korbel, Jan O.
- Published
- 2023
- Full Text
- View/download PDF
8. Inversion polymorphism in a complete human genome assembly
- Author
-
Porubsky, David, Harvey, William T., Rozanski, Allison N., Ebler, Jana, Höps, Wolfram, Ashraf, Hufsah, Hasenfeld, Patrick, Paten, Benedict, Sanders, Ashley D., Marschall, Tobias, Korbel, Jan O., and Eichler, Evan E.
- Published
- 2023
- Full Text
- View/download PDF
9. Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders
- Author
-
Porubsky, David, Höps, Wolfram, Ashraf, Hufsah, Hsieh, PingHsun, Rodriguez-Martin, Bernardo, Yilmaz, Feyza, Ebler, Jana, Hallast, Pille, Maria Maggiolini, Flavia Angela, Harvey, William T., Henning, Barbara, Audano, Peter A., Gordon, David S., Ebert, Peter, Hasenfeld, Patrick, Benito, Eva, Zhu, Qihui, Lee, Charles, Antonacci, Francesca, Steinrücken, Matthias, Beck, Christine R., Sanders, Ashley D., Marschall, Tobias, Eichler, Evan E., and Korbel, Jan O.
- Published
- 2022
- Full Text
- View/download PDF
10. Break-induced replication underlies formation of inverted triplications and generates unexpected diversity in haplotype structures
- Author
-
Grochowski, Christopher M., primary, Bengtsson, Jesse D., additional, Du, Haowei, additional, Gandhi, Mira, additional, Lun, Ming Yin, additional, Mehaffey, Michele G., additional, Park, KyungHee, additional, Höps, Wolfram, additional, Benito-Garagorri, Eva, additional, Hasenfeld, Patrick, additional, Korbel, Jan O., additional, Mahmoud, Medhat, additional, Paulin, Luis F., additional, Jhangiani, Shalini N., additional, Muzny, Donna M., additional, Fatih, Jawid M., additional, Gibbs, Richard A., additional, Pendleton, Matthew, additional, Harrington, Eoghan, additional, Juul, Sissel, additional, Lindstrand, Anna, additional, Sedlazeck, Fritz J., additional, Pehlivan, Davut, additional, Lupski, James R., additional, and Carvalho, Claudia M.B., additional
- Published
- 2023
- Full Text
- View/download PDF
11. Functional analysis of structural variants in single cells using Strand-seq
- Author
-
Jeong, Hyobin; https://orcid.org/0000-0002-1526-3343, Grimes, Karen; https://orcid.org/0000-0002-4805-2366, Rauwolf, Kerstin K, Bruch, Peter-Martin; https://orcid.org/0000-0002-9992-3109, Rausch, Tobias; https://orcid.org/0000-0001-5773-5620, Hasenfeld, Patrick; https://orcid.org/0000-0003-2319-2482, Benito, Eva, Roider, Tobias; https://orcid.org/0000-0002-6973-3531, Sabarinathan, Radhakrishnan, Porubsky, David; https://orcid.org/0000-0001-8414-8966, Herbst, Sophie A; https://orcid.org/0000-0001-8502-0366, Erarslan-Uysal, Büşra, Jann, Johann-Christoph; https://orcid.org/0000-0003-2200-0151, Marschall, Tobias; https://orcid.org/0000-0002-9376-1030, Nowak, Daniel; https://orcid.org/0000-0001-7130-7921, Bourquin, Jean-Pierre, Kulozik, Andreas E; https://orcid.org/0000-0003-1953-0848, Dietrich, Sascha, Bornhauser, Beat; https://orcid.org/0000-0003-2890-3191, Sanders, Ashley D; https://orcid.org/0000-0003-3945-0677, Korbel, Jan O; https://orcid.org/0000-0002-2798-3794, Jeong, Hyobin; https://orcid.org/0000-0002-1526-3343, Grimes, Karen; https://orcid.org/0000-0002-4805-2366, Rauwolf, Kerstin K, Bruch, Peter-Martin; https://orcid.org/0000-0002-9992-3109, Rausch, Tobias; https://orcid.org/0000-0001-5773-5620, Hasenfeld, Patrick; https://orcid.org/0000-0003-2319-2482, Benito, Eva, Roider, Tobias; https://orcid.org/0000-0002-6973-3531, Sabarinathan, Radhakrishnan, Porubsky, David; https://orcid.org/0000-0001-8414-8966, Herbst, Sophie A; https://orcid.org/0000-0001-8502-0366, Erarslan-Uysal, Büşra, Jann, Johann-Christoph; https://orcid.org/0000-0003-2200-0151, Marschall, Tobias; https://orcid.org/0000-0002-9376-1030, Nowak, Daniel; https://orcid.org/0000-0001-7130-7921, Bourquin, Jean-Pierre, Kulozik, Andreas E; https://orcid.org/0000-0003-1953-0848, Dietrich, Sascha, Bornhauser, Beat; https://orcid.org/0000-0003-2890-3191, Sanders, Ashley D; https://orcid.org/0000-0003-3945-0677, and Korbel, Jan O; https://orcid.org/0000-0002-2798-3794
- Abstract
Somatic structural variants (SVs) are widespread in cancer, but their impact on disease evolution is understudied due to a lack of methods to directly characterize their functional consequences. We present a computational method, scNOVA, which uses Strand-seq to perform haplotype-aware integration of SV discovery and molecular phenotyping in single cells by using nucleosome occupancy to infer gene expression as a readout. Application to leukemias and cell lines identifies local effects of copy-balanced rearrangements on gene deregulation, and consequences of SVs on aberrant signaling pathways in subclones. We discovered distinct SV subclones with dysregulated Wnt signaling in a chronic lymphocytic leukemia patient. We further uncovered the consequences of subclonal chromothripsis in T cell acute lymphoblastic leukemia, which revealed c-Myb activation, enrichment of a primitive cell state and informed successful targeting of the subclone in cell culture, using a Notch inhibitor. By directly linking SVs to their functional effects, scNOVA enables systematic single-cell multiomic studies of structural variation in heterogeneous cell populations.
- Published
- 2023
12. Additional file 1 of Inversion polymorphism in a complete human genome assembly
- Author
-
Porubsky, David, Harvey, William T., Rozanski, Allison N., Ebler, Jana, Höps, Wolfram, Ashraf, Hufsah, Hasenfeld, Patrick, Paten, Benedict, Sanders, Ashley D., Marschall, Tobias, Korbel, Jan O., and Eichler, Evan E.
- Abstract
Additional file 1: Figure S1. T2T-CHM13 inversion callset summary and comparison to GRCh38 (n = 373). Figure S2. Differences between GRCh38 and T2T-CHM13 callsets. Figure S3. Inversion callset summary with respect to T2T-CHM13 reference. Figure S4. Nonsyntenic and likely novel sites in T2T-CHM13 inversion calls. Figure S5. Enrichment of inversions in pericentromeric regions. Figure S6. Sequence composition of inversions from pericentromeric regions. Figure S7. Novel pericentromeric inversion on chromosome 1. Figure S8. Complete assemblies of chromosome 1 centromeric region. Figure S9. Relative position of alpha satellite array and novel pericentromeric inversion on chromosome 1. Figure S10. Inversion phasing at pericentromeric region of chromosome 7. Figure S11. Evaluation of putative misorients in GRCh38 with respect to T2T-CHM13. Figure S12. Evaluation of inversion differences between GRCh38 and T2T-CHM13 references. Figure S13. Examples of minor and misoriented alleles at chromosome 16. Figure S14. Structural differences at Xq28 between GRCh38 and T2T-CHM13. Figure S15. Diverse structural haplotypes at the Xq28 region. Figure S16. Structural differences at 16p12.2 between GRCh38 and T2T-CHM13. Figure S17. Topological differences at 16p12.2 between GRCh38 and T2T-CHM13. Figure S18. Rare inversions at disease relevant loci. Figure S19. Diverse structural haplotypes at 15q25.2 region. Figure S20. Assembled inversion breakpoints at 15q25.2 and inversion breakpoint mapping. Figure S21. Example of long-lasting misorientation errors in previous human genome references. Supplementary Notes. Consortia.
- Published
- 2023
- Full Text
- View/download PDF
13. Additional file 3 of Inversion polymorphism in a complete human genome assembly
- Author
-
Porubsky, David, Harvey, William T., Rozanski, Allison N., Ebler, Jana, Höps, Wolfram, Ashraf, Hufsah, Hasenfeld, Patrick, Paten, Benedict, Sanders, Ashley D., Marschall, Tobias, Korbel, Jan O., and Eichler, Evan E.
- Abstract
Additional file 3. Review history.
- Published
- 2023
- Full Text
- View/download PDF
14. Functional analysis of structural variants in single cells using Strand-seq
- Author
-
Jeong, Hyobin, primary, Grimes, Karen, additional, Rauwolf, Kerstin K., additional, Bruch, Peter-Martin, additional, Rausch, Tobias, additional, Hasenfeld, Patrick, additional, Benito, Eva, additional, Roider, Tobias, additional, Sabarinathan, Radhakrishnan, additional, Porubsky, David, additional, Herbst, Sophie A., additional, Erarslan-Uysal, Büşra, additional, Jann, Johann-Christoph, additional, Marschall, Tobias, additional, Nowak, Daniel, additional, Bourquin, Jean-Pierre, additional, Kulozik, Andreas E., additional, Dietrich, Sascha, additional, Bornhauser, Beat, additional, Sanders, Ashley D., additional, and Korbel, Jan O., additional
- Published
- 2022
- Full Text
- View/download PDF
15. A high-resolution map of small-scale inversions in the gibbon genome
- Author
-
Mercuri, Ludovica, primary, Palmisano, Donato, additional, L'Abbate, Alberto, additional, D'Addabbo, Pietro, additional, Montinaro, Francesco, additional, Catacchio, Claudia Rita, additional, Hasenfeld, Patrick, additional, Ventura, Mario, additional, Korbel, Jan O., additional, Sanders, Ashley D., additional, Maggiolini, Flavia Angela Maria, additional, and Antonacci, Francesca, additional
- Published
- 2022
- Full Text
- View/download PDF
16. Semi-automated assembly of high-quality diploid human reference genomes
- Author
-
Jarvis, Erich D., Formenti, Giulio, Rhie, Arang, Guarracino, Andrea, Yang, Chentao, Wood, Jonathan, Tracey, Alan, Thibaud-Nissen, Francoise, Vollger, Mitchell R., Porubsky, David, Cheng, Haoyu, Asri, Mobin, Logsdon, Glennis A., Carnevali, Paolo, Chaisson, Mark J.P., Chin, Chen Shan, Cody, Sarah, Collins, Joanna, Ebert, Peter, Escalona, Merly, Fedrigo, Olivier, Fulton, Robert S., Fulton, Lucinda L., Garg, Shilpa, Gerton, Jennifer L., Ghurye, Jay, Granat, Anastasiya, Green, Richard E., Harvey, William, Hasenfeld, Patrick, Hastie, Alex, Haukness, Marina, Jaeger, Erich B., Jain, Miten, Kirsche, Melanie, Kolmogorov, Mikhail, Korbel, Jan O., Koren, Sergey, Korlach, Jonas, Lee, Joyce, Li, Daofeng, Lindsay, Tina, Lucas, Julian, Luo, Feng, Marschall, Tobias, Mitchell, Matthew W., McDaniel, Jennifer, Nie, Fan, Zhang, Guojie, Li, Heng, Jarvis, Erich D., Formenti, Giulio, Rhie, Arang, Guarracino, Andrea, Yang, Chentao, Wood, Jonathan, Tracey, Alan, Thibaud-Nissen, Francoise, Vollger, Mitchell R., Porubsky, David, Cheng, Haoyu, Asri, Mobin, Logsdon, Glennis A., Carnevali, Paolo, Chaisson, Mark J.P., Chin, Chen Shan, Cody, Sarah, Collins, Joanna, Ebert, Peter, Escalona, Merly, Fedrigo, Olivier, Fulton, Robert S., Fulton, Lucinda L., Garg, Shilpa, Gerton, Jennifer L., Ghurye, Jay, Granat, Anastasiya, Green, Richard E., Harvey, William, Hasenfeld, Patrick, Hastie, Alex, Haukness, Marina, Jaeger, Erich B., Jain, Miten, Kirsche, Melanie, Kolmogorov, Mikhail, Korbel, Jan O., Koren, Sergey, Korlach, Jonas, Lee, Joyce, Li, Daofeng, Lindsay, Tina, Lucas, Julian, Luo, Feng, Marschall, Tobias, Mitchell, Matthew W., McDaniel, Jennifer, Nie, Fan, Zhang, Guojie, and Li, Heng
- Abstract
The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent–child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.
- Published
- 2022
17. Haplotype-aware single-cell multiomics uncovers functional effects of somatic structural variation
- Author
-
Jeong, Hyobin, primary, Grimes, Karen, additional, Bruch, Peter-Martin, additional, Rausch, Tobias, additional, Hasenfeld, Patrick, additional, Sabarinathan, Radhakrishnan, additional, Porubsky, David, additional, Herbst, Sophie A., additional, Erarslan-Uysal, Büşra, additional, Jann, Johann-Christoph, additional, Marschall, Tobias, additional, Nowak, Daniel, additional, Bourquin, Jean-Pierre, additional, Kulozik, Andreas E., additional, Dietrich, Sascha, additional, Bornhauser, Beat, additional, Sanders, Ashley D., additional, and Korbel, Jan O., additional
- Published
- 2021
- Full Text
- View/download PDF
18. Haplotype-resolved diverse human genomes and integrated analysis of structural variation
- Author
-
Ebert, Peter, primary, Audano, Peter A., additional, Zhu, Qihui, additional, Rodriguez-Martin, Bernardo, additional, Porubsky, David, additional, Bonder, Marc Jan, additional, Sulovari, Arvis, additional, Ebler, Jana, additional, Zhou, Weichen, additional, Serra Mari, Rebecca, additional, Yilmaz, Feyza, additional, Zhao, Xuefang, additional, Hsieh, PingHsun, additional, Lee, Joyce, additional, Kumar, Sushant, additional, Lin, Jiadong, additional, Rausch, Tobias, additional, Chen, Yu, additional, Ren, Jingwen, additional, Santamarina, Martin, additional, Höps, Wolfram, additional, Ashraf, Hufsah, additional, Chuang, Nelson T., additional, Yang, Xiaofei, additional, Munson, Katherine M., additional, Lewis, Alexandra P., additional, Fairley, Susan, additional, Tallon, Luke J., additional, Clarke, Wayne E., additional, Basile, Anna O., additional, Byrska-Bishop, Marta, additional, Corvelo, André, additional, Evani, Uday S., additional, Lu, Tsung-Yu, additional, Chaisson, Mark J. P., additional, Chen, Junjie, additional, Li, Chong, additional, Brand, Harrison, additional, Wenger, Aaron M., additional, Ghareghani, Maryam, additional, Harvey, William T., additional, Raeder, Benjamin, additional, Hasenfeld, Patrick, additional, Regier, Allison A., additional, Abel, Haley J., additional, Hall, Ira M., additional, Flicek, Paul, additional, Stegle, Oliver, additional, Gerstein, Mark B., additional, Tubio, Jose M. C., additional, Mu, Zepeng, additional, Li, Yang I., additional, Shi, Xinghua, additional, Hastie, Alex R., additional, Ye, Kai, additional, Chong, Zechen, additional, Sanders, Ashley D., additional, Zody, Michael C., additional, Talkowski, Michael E., additional, Mills, Ryan E., additional, Devine, Scott E., additional, Lee, Charles, additional, Korbel, Jan O., additional, Marschall, Tobias, additional, and Eichler, Evan E., additional
- Published
- 2021
- Full Text
- View/download PDF
19. De novo assembly of 64 haplotype-resolved human genomes of diverse ancestry and integrated analysis of structural variation
- Author
-
Ebert, Peter, primary, Audano, Peter A., additional, Zhu, Qihui, additional, Rodriguez-Martin, Bernardo, additional, Porubsky, David, additional, Bonder, Marc Jan, additional, Sulovari, Arvis, additional, Ebler, Jana, additional, Zhou, Weichen, additional, Mari, Rebecca Serra, additional, Yilmaz, Feyza, additional, Zhao, Xuefang, additional, Hsieh, PingHsun, additional, Lee, Joyce, additional, Kumar, Sushant, additional, Lin, Jiadong, additional, Rausch, Tobias, additional, Chen, Yu, additional, Ren, Jingwen, additional, Santamarina, Martin, additional, Höps, Wolfram, additional, Ashraf, Hufsah, additional, Chuang, Nelson T., additional, Yang, Xiaofei, additional, Munson, Katherine M., additional, Lewis, Alexandra P., additional, Fairley, Susan, additional, Tallon, Luke J., additional, Clarke, Wayne E., additional, Basile, Anna O., additional, Byrska-Bishop, Marta, additional, Corvelo, André, additional, Chaisson, Mark J.P., additional, Chen, Junjie, additional, Li, Chong, additional, Brand, Harrison, additional, Wenger, Aaron M., additional, Ghareghani, Maryam, additional, Harvey, William T., additional, Raeder, Benjamin, additional, Hasenfeld, Patrick, additional, Regier, Allison, additional, Abel, Haley, additional, Hall, Ira, additional, Flicek, Paul, additional, Stegle, Oliver, additional, Gerstein, Mark B., additional, Tubio, Jose M.C., additional, Mu, Zepeng, additional, Li, Yang I., additional, Shi, Xinghua, additional, Hastie, Alex R., additional, Ye, Kai, additional, Chong, Zechen, additional, Sanders, Ashley D., additional, Zody, Michael C., additional, Talkowski, Michael E., additional, Mills, Ryan E., additional, Devine, Scott E., additional, Lee, Charles, additional, Korbel, Jan O., additional, Marschall, Tobias, additional, and Eichler, Evan E., additional
- Published
- 2020
- Full Text
- View/download PDF
20. Haplotype-resolved inversion landscape reveals hotspots of mutational recurrence associated with genomic disorders
- Author
-
Porubsky, David, Höps, Wolfram, Ashraf, Hufsah, Hsieh, PingHsun, Rodriguez-Martin, Bernardo, Yilmaz, Feyza, Ebler, Jana, Hallast, Pille, Maria Maggiolini, Flavia Angela, Harvey, William T., Henning, Barbara, Audano, Peter A., Gordon, David S., Ebert, Peter, Hasenfeld, Patrick, Benito, Eva, Zhu, Qihui, Lee, Charles, Antonacci, Francesca, Steinrücken, Matthias, Beck, Christine R., Sanders, Ashley D., Marschall, Tobias, Eichler, Evan E., and Korbel, Jan O.
- Abstract
Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions -4 per locus and generation. Recurrent inversions exhibit a sex- chromosomal bias, and significantly co-localize to the critical regions of genomic disorders. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes to disease- causing CNVs.
- Full Text
- View/download PDF
21. Complex genetic variation in nearly complete human genomes.
- Author
-
Logsdon GA, Ebert P, Audano PA, Loftus M, Porubsky D, Ebler J, Yilmaz F, Hallast P, Prodanov T, Yoo D, Paisie CA, Harvey WT, Zhao X, Martino GV, Henglin M, Munson KM, Rabbani K, Chin CS, Gu B, Ashraf H, Austine-Orimoloye O, Balachandran P, Bonder MJ, Cheng H, Chong Z, Crabtree J, Gerstein M, Guethlein LA, Hasenfeld P, Hickey G, Hoekzema K, Hunt SE, Jensen M, Jiang Y, Koren S, Kwon Y, Li C, Li H, Li J, Norman PJ, Oshima KK, Paten B, Phillippy AM, Pollock NR, Rausch T, Rautiainen M, Scholz S, Song Y, Söylev A, Sulovari A, Surapaneni L, Tsapalou V, Zhou W, Zhou Y, Zhu Q, Zody MC, Mills RE, Devine SE, Shi X, Talkowski ME, Chaisson MJP, Dilthey AT, Konkel MK, Korbel JO, Lee C, Beck CR, Eichler EE, and Marschall T
- Abstract
Diverse sets of complete human genomes are required to construct a pangenome reference and to understand the extent of complex structural variation. Here, we sequence 65 diverse human genomes and build 130 haplotype-resolved assemblies (130 Mbp median continuity), closing 92% of all previous assembly gaps
1,2 and reaching telomere-to-telomere (T2T) status for 39% of the chromosomes. We highlight complete sequence continuity of complex loci, including the major histocompatibility complex (MHC), SMN1 / SMN2 , NBPF8 , and AMY1/AMY2 , and fully resolve 1,852 complex structural variants (SVs). In addition, we completely assemble and validate 1,246 human centromeres. We find up to 30-fold variation in α-satellite high-order repeat (HOR) array length and characterize the pattern of mobile element insertions into α-satellite HOR arrays. While most centromeres predict a single site of kinetochore attachment, epigenetic analysis suggests the presence of two hypomethylated regions for 7% of centromeres. Combining our data with the draft pangenome reference1 significantly enhances genotyping accuracy from short-read data, enabling whole-genome inference3 to a median quality value (QV) of 45. Using this approach, 26,115 SVs per sample are detected, substantially increasing the number of SVs now amenable to downstream disease association studies., Competing Interests: Competing Interests E.E.E. is a scientific advisory board member of Variant Bio, Inc. C. Lee is a scientific advisory board member of Nabsys and Genome Insight. S.K. has received travel funds to speak at events hosted by Oxford Nanopore Technologies. The following authors have previously disclosed a patent application (No. EP19169090) relevant to Strand-seq: J.O.K., T.M., and D.P. The other authors declare no competing interests.- Published
- 2024
- Full Text
- View/download PDF
22. Break-induced replication underlies formation of inverted triplications and generates unexpected diversity in haplotype structures.
- Author
-
Grochowski CM, Bengtsson JD, Du H, Gandhi M, Lun MY, Mehaffey MG, Park K, Höps W, Benito-Garagorri E, Hasenfeld P, Korbel JO, Mahmoud M, Paulin LF, Jhangiani SN, Muzny DM, Fatih JM, Gibbs RA, Pendleton M, Harrington E, Juul S, Lindstrand A, Sedlazeck FJ, Pehlivan D, Lupski JR, and Carvalho CMB
- Abstract
Background: The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a type of complex genomic rearrangement (CGR) hypothesized to result from replicative repair of DNA due to replication fork collapse. It is often mediated by a pair of inverted low-copy repeats (LCR) followed by iterative template switches resulting in at least two breakpoint junctions in cis . Although it has been identified as an important mutation signature of pathogenicity for genomic disorders and cancer genomes, its architecture remains unresolved and is predicted to display at least four structural variation (SV) haplotypes., Results: Here we studied the genomic architecture of DUP-TRP/INV-DUP by investigating the genomic DNA of 24 patients with neurodevelopmental disorders identified by array comparative genomic hybridization (aCGH) on whom we found evidence for the existence of 4 out of 4 predicted SV haplotypes. Using a combination of short-read genome sequencing (GS), long- read GS, optical genome mapping and StrandSeq the haplotype structure was resolved in 18 samples. This approach refined the point of template switching between inverted LCRs in 4 samples revealing a DNA segment of ∼2.2-5.5 kb of 100% nucleotide similarity. A prediction model was developed to infer the LCR used to mediate the non-allelic homology repair., Conclusions: These data provide experimental evidence supporting the hypothesis that inverted LCRs act as a recombinant substrate in replication-based repair mechanisms. Such inverted repeats are particularly relevant for formation of copy-number associated inversions, including the DUP-TRP/INV-DUP structures. Moreover, this type of CGR can result in multiple conformers which contributes to generate diverse SV haplotypes in susceptible loci .
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.