757 results on '"Flicek, P."'
Search Results
2. MR imaging of benign vulvar lesions: a pictorial essay
- Author
-
Jang, Samuel, Kim, Bohyun, K. Venkatesh, Sudhakar, and T. Flicek, Kristina
- Published
- 2024
- Full Text
- View/download PDF
3. DNA methylation patterns of transcription factor binding regions characterize their functional and evolutionary contexts
- Author
-
Rimoldi, Martina, Wang, Ning, Zhang, Jilin, Villar, Diego, Odom, Duncan T., Taipale, Jussi, Flicek, Paul, and Roller, Maša
- Published
- 2024
- Full Text
- View/download PDF
4. Pictorial essay: improving diagnostic effectiveness of colorectal cancer at CT
- Author
-
Flicek, Kristina T. and Johnson, C. Dan
- Published
- 2024
- Full Text
- View/download PDF
5. Gaps and complex structurally variant loci in phased genome assemblies
- Author
-
Porubsky, David, Vollger, Mitchell R, Harvey, William T, Rozanski, Allison N, Ebert, Peter, Hickey, Glenn, Hasenfeld, Patrick, Sanders, Ashley D, Stober, Catherine, Consortium, Human Pangenome Reference, Korbel, Jan O, Paten, Benedict, Marschall, Tobias, Eichler, Evan E, Abel, Haley J, Antonacci-Fulton, Lucinda L, Asri, Mobin, Baid, Gunjan, Baker, Carl A, Belyaeva, Anastasiya, Billis, Konstantinos, Bourque, Guillaume, Buonaiuto, Silvia, Carroll, Andrew, Chaisson, Mark JP, Chang, Pi-Chuan, Chang, Xian H, Cheng, Haoyu, Chu, Justin, Cody, Sarah, Colonna, Vincenza, Cook, Daniel E, Cook-Deegan, Robert M, Cornejo, Omar E, Diekhans, Mark, Doerr, Daniel, Ebler, Jana, Eizenga, Jordan M, Fairley, Susan, Fedrigo, Olivier, Felsenfeld, Adam L, Feng, Xiaowen, Fischer, Christian, Flicek, Paul, Formenti, Giulio, Frankish, Adam, Fulton, Robert S, Gao, Yan, Garg, Shilpa, Garrison, Erik, Garrison, Nanibaa’ A, Giron, Carlos Garcia, Green, Richard E, Groza, Cristian, Guarracino, Andrea, Haggerty, Leanne, Hall, Ira M, Haukness, Marina, Haussler, David, Heumos, Simon, Hoekzema, Kendra, Hourlier, Thibaut, Howe, Kerstin, Jain, Miten, Jarvis, Erich D, Ji, Hanlee P, Kenny, Eimear E, Koenig, Barbara A, Kolesnikov, Alexey, Kordosky, Jennifer, Koren, Sergey, Lee, HoJoon, Lewis, Alexandra P, Li, Heng, Liao, Wen-Wei, Lu, Shuangjia, Lu, Tsung-Yu, Lucas, Julian K, Magalhães, Hugo, Marco-Sola, Santiago, Marijon, Pierre, Markello, Charles, Martin, Fergal J, McCartney, Ann, McDaniel, Jennifer, Miga, Karen H, Mitchell, Matthew W, Monlong, Jean, Mountcastle, Jacquelyn, Munson, Katherine M, Mwaniki, Moses Njagi, Nattestad, Maria, Novak, Adam M, and Nurk, Sergey
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Humans ,DNA ,Satellite ,Polymorphism ,Genetic ,Haplotypes ,Segmental Duplications ,Genomic ,Sequence Analysis ,DNA ,Human Pangenome Reference Consortium ,Medical and Health Sciences ,Bioinformatics - Abstract
There has been tremendous progress in phased genome assembly production by combining long-read data with parental information or linked-read data. Nevertheless, a typical phased genome assembly generated by trio-hifiasm still generates more than 140 gaps. We perform a detailed analysis of gaps, assembly breaks, and misorientations from 182 haploid assemblies obtained from a diversity panel of 77 unique human samples. Although trio-based approaches using HiFi are the current gold standard, chromosome-wide phasing accuracy is comparable when using Strand-seq instead of parental data. Importantly, the majority of assembly gaps cluster near the largest and most identical repeats (including segmental duplications [35.4%], satellite DNA [22.3%], or regions enriched in GA/AT-rich DNA [27.4%]). Consequently, 1513 protein-coding genes overlap assembly gaps in at least one haplotype, and 231 are recurrently disrupted or missing from five or more haplotypes. Furthermore, we estimate that 6-7 Mbp of DNA are misorientated per haplotype irrespective of whether trio-free or trio-based approaches are used. Of these misorientations, 81% correspond to bona fide large inversion polymorphisms in the human species, most of which are flanked by large segmental duplications. We also identify large-scale alignment discontinuities consistent with 11.9 Mbp of deletions and 161.4 Mbp of insertions per haploid genome. Although 99% of this variation corresponds to satellite DNA, we identify 230 regions of euchromatic DNA with frequent expansions and contractions, nearly half of which overlap with 197 protein-coding genes. Such variable and incompletely assembled regions are important targets for future algorithmic development and pangenome representation.
- Published
- 2023
6. The Type 2 Diabetes Knowledge Portal: An open access genetic resource dedicated to type 2 diabetes and related traits
- Author
-
Costanzo, Maria C, von Grotthuss, Marcin, Massung, Jeffrey, Jang, Dongkeun, Caulkins, Lizz, Koesterer, Ryan, Gilbert, Clint, Welch, Ryan P, Kudtarkar, Parul, Hoang, Quy, Boughton, Andrew P, Singh, Preeti, Sun, Ying, Duby, Marc, Moriondo, Annie, Nguyen, Trang, Smadbeck, Patrick, Alexander, Benjamin R, Brandes, MacKenzie, Carmichael, Mary, Dornbos, Peter, Green, Todd, Huellas-Bruskiewicz, Kenneth C, Ji, Yue, Kluge, Alexandria, McMahon, Aoife C, Mercader, Josep M, Ruebenacker, Oliver, Sengupta, Sebanti, Spalding, Dylan, Taliun, Daniel, Consortium, AMP-T2D, Abecasis, Gonçalo, Akolkar, Beena, Allred, Nicholette D, Altshuler, David, Below, Jennifer E, Bergman, Richard, Beulens, Joline WJ, Blangero, John, Boehnke, Michael, Bokvist, Krister, Bottinger, Erwin, Bowden, Donald, Brosnan, M Julia, Brown, Christopher, Bruskiewicz, Kenneth, Burtt, Noël P, Cebola, Inês, Chambers, John, Chen, Yii-Der Ida, Cherkas, Andriy, Chu, Audrey Y, Clark, Christopher, Claussnitzer, Melina, Cox, Nancy J, Hoed, Marcel den, Dong, Duc, Duggirala, Ravindranath, Dupuis, Josée, Elders, Petra JM, Engreitz, Jesse M, Fauman, Eric, Ferrer, Jorge, Flannick, Jason, Flicek, Paul, Flickinger, Matthew, Florez, Jose C, Fox, Caroline S, Frayling, Timothy M, Frazer, Kelly A, Gaulton, Kyle J, Gloyn, Anna L, Hanis, Craig L, Hanson, Robert, Hattersley, Andrew T, Im, Hae Kyung, Iqbal, Sidra, Jacobs, Suzanne BR, Jang, Dong-Keun, Jordan, Tad, Kamphaus, Tania, Karpe, Fredrik, Keane, Thomas M, Kim, Seung K, Lage, Kasper, Lange, Leslie A, and Lazar, Mitchell
- Subjects
Genetics ,Diabetes ,Human Genome ,Metabolic and endocrine ,Good Health and Well Being ,Humans ,Diabetes Mellitus ,Type 2 ,Access to Information ,Prospective Studies ,Genomics ,Phenotype ,AMP-T2D Consortium ,CMDKP ,GWAS ,T2DKP ,data sharing ,diabetes ,effector genes ,genetic associations ,genetic support ,genomics ,portal ,Biochemistry and Cell Biology ,Medical Biochemistry and Metabolomics ,Endocrinology & Metabolism - Abstract
Associations between human genetic variation and clinical phenotypes have become a foundation of biomedical research. Most repositories of these data seek to be disease-agnostic and therefore lack disease-focused views. The Type 2 Diabetes Knowledge Portal (T2DKP) is a public resource of genetic datasets and genomic annotations dedicated to type 2 diabetes (T2D) and related traits. Here, we seek to make the T2DKP more accessible to prospective users and more useful to existing users. First, we evaluate the T2DKP's comprehensiveness by comparing its datasets with those of other repositories. Second, we describe how researchers unfamiliar with human genetic data can begin using and correctly interpreting them via the T2DKP. Third, we describe how existing users can extend their current workflows to use the full suite of tools offered by the T2DKP. We finally discuss the lessons offered by the T2DKP toward the goal of democratizing access to complex disease genetic results.
- Published
- 2023
7. DNA methylation patterns of transcription factor binding regions characterize their functional and evolutionary contexts
- Author
-
Martina Rimoldi, Ning Wang, Jilin Zhang, Diego Villar, Duncan T. Odom, Jussi Taipale, Paul Flicek, and Maša Roller
- Subjects
DNA methylation ,Transcription factor binding ,Evolution ,Mammals ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Background DNA methylation is an important epigenetic modification which has numerous roles in modulating genome function. Its levels are spatially correlated across the genome, typically high in repressed regions but low in transcription factor (TF) binding sites and active regulatory regions. However, the mechanisms establishing genome-wide and TF binding site methylation patterns are still unclear. Results Here we use a comparative approach to investigate the association of DNA methylation to TF binding evolution in mammals. Specifically, we experimentally profile DNA methylation and combine this with published occupancy profiles of five distinct TFs (CTCF, CEBPA, HNF4A, ONECUT1, FOXA1) in the liver of five mammalian species (human, macaque, mouse, rat, dog). TF binding sites are lowly methylated, but they often also have intermediate methylation levels. Furthermore, biding sites are influenced by the methylation status of CpGs in their wider binding regions even when CpGs are absent from the core binding motif. Employing a classification and clustering approach, we extract distinct and species-conserved patterns of DNA methylation levels at TF binding regions. CEBPA, HNF4A, ONECUT1, and FOXA1 share the same methylation patterns, while CTCF's differ. These patterns characterize alternative functions and chromatin landscapes of TF-bound regions. Leveraging our phylogenetic framework, we find DNA methylation gain upon evolutionary loss of TF occupancy, indicating coordinated evolution. Furthermore, each methylation pattern has its own evolutionary trajectory reflecting its genomic contexts. Conclusions Our epigenomic analyses indicate a role for DNA methylation in TF binding changes across species including that specific DNA methylation profiles characterize TF binding and are associated with their regulatory activity, chromatin contexts, and evolutionary trajectories.
- Published
- 2024
- Full Text
- View/download PDF
8. Clinical and Radiographic Characteristics in Segmental Colitis Associated With Diverticulosis, Diverticulitis, and Crohn’s Disease
- Author
-
Siri A. Urquhart, Matthew W. Ewy, Kristina T. Flicek, Jeff L. Fidler, Shannon P. Sheedy, William S. Harmsen, Victor G. Chedid, and Nayantara Coelho-Prabhu
- Subjects
Segmental Colitis Associated With Diverticulosis ,SCAD ,Diverticulitis ,Inflammatory Bowel Disease ,Crohn’s Disease ,Diseases of the digestive system. Gastroenterology ,RC799-869 - Abstract
Background and Aims: Segmental colitis associated with diverticulosis (SCAD) is an underrecognized disease characterized by chronic segmental inflammation surrounding colonic diverticula. SCAD is recognized as an autonomous entity, although shares similar pathogenic and therapeutic aspects to inflammatory bowel disease and may be considered a complication of diverticulitis. We aimed to characterize the clinical, endoscopic, and radiographic findings of SCAD and its potential overlap with diverticulitis and inflammatory bowel disease. Methods: All patients with suspected diagnosis of SCAD were identified using a bioinformatics search tool from January 1996 to October 2021 at our institution. Relevant demographic, clinical, endoscopic, and radiographic data were abstracted. Disease-related outcomes and radiographic characteristics were determined. Results: Seventy-five patients with SCAD were included (48.0% female) with average age at diagnosis 62.5 years. Thirty-seven (49.3%) had a prior episode of diverticulitis. The most common presenting symptoms were abdominal pain (33.3%) and hematochezia (22.7%). Antibiotics (42.7%) and mesalamine (36.0%) were most used as first-line treatment options. Twenty (26.7%) required surgical intervention. The most common initial endoscopic finding was isolated sigmoid inflammation (86.7%). Fifty-one patients with confirmed SCAD, 72 with diverticulitis, and 12 with Crohn’s disease (CD) had imaging available for review. Penetrating disease was seen in 7 (13.7%) with SCAD compared to 7 (9.7%) and 2 (16.6%) with diverticulitis and CD, respectively (P = .14). Blinded radiologists diagnosed SCAD, CD, or diverticulitis correctly in 43.8%, 8.3%, and 27.1%, respectively. Conclusion: SCAD should be considered when isolated sigmoid colon inflammation is seen on cross-sectional imaging. Penetrating disease is not a specific radiologic feature for either SCAD or diverticulitis. Further prospective studies are needed to correlate imaging characteristics with endoscopic findings to better describe radiographic features in SCAD.
- Published
- 2024
- Full Text
- View/download PDF
9. Standardized annotation of translated open reading frames
- Author
-
Mudge, Jonathan M, Ruiz-Orera, Jorge, Prensner, John R, Brunet, Marie A, Calvet, Ferriol, Jungreis, Irwin, Gonzalez, Jose Manuel, Magrane, Michele, Martinez, Thomas F, Schulz, Jana Felicitas, Yang, Yucheng T, Albà, M Mar, Aspden, Julie L, Baranov, Pavel V, Bazzini, Ariel A, Bruford, Elspeth, Martin, Maria Jesus, Calviello, Lorenzo, Carvunis, Anne-Ruxandra, Chen, Jin, Couso, Juan Pablo, Deutsch, Eric W, Flicek, Paul, Frankish, Adam, Gerstein, Mark, Hubner, Norbert, Ingolia, Nicholas T, Kellis, Manolis, Menschaert, Gerben, Moritz, Robert L, Ohler, Uwe, Roucou, Xavier, Saghatelian, Alan, Weissman, Jonathan S, and van Heesch, Sebastiaan
- Subjects
Molecular Sequence Annotation ,Open Reading Frames ,Protein Biosynthesis ,Ribosomes - Published
- 2022
10. Author Correction: Perspectives on ENCODE
- Author
-
Snyder, Michael P, Gingeras, Thomas R, Moore, Jill E, Weng, Zhiping, Gerstein, Mark B, Ren, Bing, Hardison, Ross C, Stamatoyannopoulos, John A, Graveley, Brenton R, Feingold, Elise A, Pazin, Michael J, Pagan, Michael, Gilchrist, Daniel A, Hitz, Benjamin C, Cherry, J Michael, Bernstein, Bradley E, Mendenhall, Eric M, Zerbino, Daniel R, Frankish, Adam, Flicek, Paul, and Myers, Richard M
- Subjects
ENCODE Project Consortium ,General Science & Technology - Abstract
In this Article, the authors Rizi Ai (Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA) and Shantao Li (Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA) were mistakenly omitted from the ENCODE Project Consortium author list. The original Article has been corrected online.
- Published
- 2022
11. The Human Pangenome Project: a global resource to map genomic diversity
- Author
-
Wang, Ting, Antonacci-Fulton, Lucinda, Howe, Kerstin, Lawson, Heather A, Lucas, Julian K, Phillippy, Adam M, Popejoy, Alice B, Asri, Mobin, Carson, Caryn, Chaisson, Mark JP, Chang, Xian, Cook-Deegan, Robert, Felsenfeld, Adam L, Fulton, Robert S, Garrison, Erik P, Garrison, Nanibaa’ A, Graves-Lindsay, Tina A, Ji, Hanlee, Kenny, Eimear E, Koenig, Barbara A, Li, Daofeng, Marschall, Tobias, McMichael, Joshua F, Novak, Adam M, Purushotham, Deepak, Schneider, Valerie A, Schultz, Baergen I, Smith, Michael W, Sofia, Heidi J, Weissman, Tsachy, Flicek, Paul, Li, Heng, Miga, Karen H, Paten, Benedict, Jarvis, Erich D, Hall, Ira M, Eichler, Evan E, and Haussler, David
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Human Genome ,Generic health relevance ,Genome ,Human ,Genomics ,Haplotypes ,High-Throughput Nucleotide Sequencing ,Humans ,Sequence Analysis ,DNA ,Human Pangenome Reference Consortium ,General Science & Technology - Abstract
The human reference genome is the most widely used resource in human genetics and is due for a major update. Its current structure is a linear composite of merged haplotypes from more than 20 people, with a single individual comprising most of the sequence. It contains biases and errors within a framework that does not represent global human genomic variation. A high-quality reference with global representation of common variants, including single-nucleotide variants, structural variants and functional elements, is needed. The Human Pangenome Reference Consortium aims to create a more sophisticated and complete human reference genome with a graph-based, telomere-to-telomere representation of global genomic diversity. Here we leverage innovations in technology, study design and global partnerships with the goal of constructing the highest-possible quality human pangenome reference. Our goal is to improve data representation and streamline analyses to enable routine assembly of complete diploid genomes. With attention to ethical frameworks, the human pangenome reference will contain a more accurate and diverse representation of global genomic variation, improve gene-disease association studies across populations, expand the scope of genomics research to the most repetitive and polymorphic regions of the genome, and serve as the ultimate genetic resource for future biomedical research and precision medicine.
- Published
- 2022
12. GET_PANGENES: calling pangenes from plant genome alignments confirms presence-absence variation
- Author
-
Bruno Contreras-Moreira, Shradha Saraf, Guy Naamati, Ana M. Casas, Sandeep S. Amberkar, Paul Flicek, Andrew R. Jones, and Sarah Dyer
- Subjects
Pangene ,Plant genome ,Gene annotation ,Collinearity ,Whole genome alignment ,Presence-absence variation ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Crop pangenomes made from individual cultivar assemblies promise easy access to conserved genes, but genome content variability and inconsistent identifiers hamper their exploration. To address this, we define pangenes, which summarize a species coding potential and link back to original annotations. The protocol get_pangenes performs whole genome alignments (WGA) to call syntenic gene models based on coordinate overlaps. A benchmark with small and large plant genomes shows that pangenes recapitulate phylogeny-based orthologies and produce complete soft-core gene sets. Moreover, WGAs support lift-over and help confirm gene presence-absence variation. Source code and documentation: https://github.com/Ensembl/plant-scripts .
- Published
- 2023
- Full Text
- View/download PDF
13. The complete sequence of a human Y chromosome
- Author
-
Rhie, Arang, Nurk, Sergey, Cechova, Monika, Hoyt, Savannah J., Taylor, Dylan J., Altemose, Nicolas, Hook, Paul W., Koren, Sergey, Rautiainen, Mikko, Alexandrov, Ivan A., Allen, Jamie, Asri, Mobin, Bzikadze, Andrey V., Chen, Nae-Chyun, Chin, Chen-Shan, Diekhans, Mark, Flicek, Paul, Formenti, Giulio, Fungtammasan, Arkarachai, Garcia Giron, Carlos, Garrison, Erik, Gershman, Ariel, Gerton, Jennifer L., Grady, Patrick G. S., Guarracino, Andrea, Haggerty, Leanne, Halabian, Reza, Hansen, Nancy F., Harris, Robert, Hartley, Gabrielle A., Harvey, William T., Haukness, Marina, Heinz, Jakob, Hourlier, Thibaut, Hubley, Robert M., Hunt, Sarah E., Hwang, Stephen, Jain, Miten, Kesharwani, Rupesh K., Lewis, Alexandra P., Li, Heng, Logsdon, Glennis A., Lucas, Julian K., Makalowski, Wojciech, Markovic, Christopher, Martin, Fergal J., Mc Cartney, Ann M., McCoy, Rajiv C., McDaniel, Jennifer, McNulty, Brandy M., Medvedev, Paul, Mikheenko, Alla, Munson, Katherine M., Murphy, Terence D., Olsen, Hugh E., Olson, Nathan D., Paulin, Luis F., Porubsky, David, Potapova, Tamara, Ryabov, Fedor, Salzberg, Steven L., Sauria, Michael E. G., Sedlazeck, Fritz J., Shafin, Kishwar, Shepelev, Valery A., Shumate, Alaina, Storer, Jessica M., Surapaneni, Likhitha, Taravella Oill, Angela M., Thibaud-Nissen, Françoise, Timp, Winston, Tomaszkiewicz, Marta, Vollger, Mitchell R., Walenz, Brian P., Watwood, Allison C., Weissensteiner, Matthias H., Wenger, Aaron M., Wilson, Melissa A., Zarate, Samantha, Zhu, Yiming, Zook, Justin M., Eichler, Evan E., O’Neill, Rachel J., Schatz, Michael C., Miga, Karen H., Makova, Kateryna D., and Phillippy, Adam M.
- Published
- 2023
- Full Text
- View/download PDF
14. Future directions in imaging pouches
- Author
-
Baker, Mark E, Fletcher, Joel G, Church, James, Hull, Tracy, Dane, Bari, Huang, Chenchen, Flicek, Kristina, and Ream, Justin
- Published
- 2023
- Full Text
- View/download PDF
15. Continent ileostomies: what the radiologist needs to know
- Author
-
Flicek, Kristina T. and Shawki, Sherief F.
- Published
- 2023
- Full Text
- View/download PDF
16. Standards recommendations for the Earth BioGenome Project
- Author
-
Lawniczak, Mara KN, Durbin, Richard, Flicek, Paul, Lindblad-Toh, Kerstin, Wei, Xiaofeng, Archibald, John M, Baker, William J, Belov, Katherine, Blaxter, Mark L, Bonet, Tomas Marques, Childers, Anna K, Coddington, Jonathan A, Crandall, Keith A, Crawford, Andrew J, Davey, Robert P, Di Palma, Federica, Fang, Qi, Haerty, Wilfried, Hall, Neil, Hoff, Katharina J, Howe, Kerstin, Jarvis, Erich D, Johnson, Warren E, Johnson, Rebecca N, Kersey, Paul J, Liu, Xin, Lopez, Jose Victor, Myers, Eugene W, Pettersson, Olga Vinnere, Phillippy, Adam M, Poelchau, Monica F, Pruitt, Kim D, Rhie, Arang, Castilla-Rubio, Juan Carlos, Sahu, Sunil Kumar, Salmon, Nicholas A, Soltis, Pamela S, Swarbreck, David, Thibaud-Nissen, Françoise, Wang, Sibo, Wegrzyn, Jill L, Zhang, Guojie, Zhang, He, Lewin, Harris A, and Richards, Stephen
- Subjects
Human Genome ,Genetics ,Animals ,Base Sequence ,Biodiversity ,Eukaryota ,Genomics ,Humans ,Reference Standards ,Reference Values ,Sequence Analysis ,DNA ,Earth BioGenome Project ,genomics ,ethics ,genome assembly - Abstract
A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.
- Published
- 2022
17. Analysis of genome-wide knockout mouse database identifies candidate ciliopathy genes
- Author
-
Higgins, Kendall, Moore, Bret A, Berberovic, Zorana, Adissu, Hibret A, Eskandarian, Mohammad, Flenniken, Ann M, Shao, Andy, Imai, Denise M, Clary, Dave, Lanoue, Louise, Newbigging, Susan, Nutter, Lauryl MJ, Adams, David J, Bosch, Fatima, Braun, Robert E, Brown, Steve DM, Dickinson, Mary E, Dobbie, Michael, Flicek, Paul, Gao, Xiang, Galande, Sanjeev, Grobler, Anne, Heaney, Jason D, Herault, Yann, de Angelis, Martin Hrabe, Chin, Hsian-Jean Genie, Mammano, Fabio, Qin, Chuan, Shiroishi, Toshihiko, Sedlacek, Radislav, Seong, J-K, Xu, Ying, Lloyd, KC Kent, McKerlie, Colin, and Moshiri, Ala
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Biomedical and Clinical Sciences ,Genetics ,Biotechnology ,Rare Diseases ,2.1 Biological and endogenous factors ,Mice ,Animals ,Mice ,Knockout ,Ciliopathies ,Gene Knockout Techniques ,Cilia ,Databases ,Factual ,Nerve Tissue Proteins ,Cell Cycle Proteins ,IMPC Consortium - Abstract
We searched a database of single-gene knockout (KO) mice produced by the International Mouse Phenotyping Consortium (IMPC) to identify candidate ciliopathy genes. We first screened for phenotypes in mouse lines with both ocular and renal or reproductive trait abnormalities. The STRING protein interaction tool was used to identify interactions between known cilia gene products and those encoded by the genes in individual knockout mouse strains in order to generate a list of "candidate ciliopathy genes." From this list, 32 genes encoded proteins predicted to interact with known ciliopathy proteins. Of these, 25 had no previously described roles in ciliary pathobiology. Histological and morphological evidence of phenotypes found in ciliopathies in knockout mouse lines are presented as examples (genes Abi2, Wdr62, Ap4e1, Dync1li1, and Prkab1). Phenotyping data and descriptions generated on IMPC mouse line are useful for mechanistic studies, target discovery, rare disease diagnosis, and preclinical therapeutic development trials. Here we demonstrate the effective use of the IMPC phenotype data to uncover genes with no previous role in ciliary biology, which may be clinically relevant for identification of novel disease genes implicated in ciliopathies.
- Published
- 2022
18. GA4GH: International policies and standards for data sharing across genomic research and healthcare
- Author
-
Rehm, Heidi L, Page, Angela JH, Smith, Lindsay, Adams, Jeremy B, Alterovitz, Gil, Babb, Lawrence J, Barkley, Maxmillian P, Baudis, Michael, Beauvais, Michael JS, Beck, Tim, Beckmann, Jacques S, Beltran, Sergi, Bernick, David, Bernier, Alexander, Bonfield, James K, Boughtwood, Tiffany F, Bourque, Guillaume, Bowers, Sarion R, Brookes, Anthony J, Brudno, Michael, Brush, Matthew H, Bujold, David, Burdett, Tony, Buske, Orion J, Cabili, Moran N, Cameron, Daniel L, Carroll, Robert J, Casas-Silva, Esmeralda, Chakravarty, Debyani, Chaudhari, Bimal P, Chen, Shu Hui, Cherry, J Michael, Chung, Justina, Cline, Melissa, Clissold, Hayley L, Cook-Deegan, Robert M, Courtot, Mélanie, Cunningham, Fiona, Cupak, Miro, Davies, Robert M, Denisko, Danielle, Doerr, Megan J, Dolman, Lena I, Dove, Edward S, Dursi, L Jonathan, Dyke, Stephanie OM, Eddy, James A, Eilbeck, Karen, Ellrott, Kyle P, Fairley, Susan, Fakhro, Khalid A, Firth, Helen V, Fitzsimons, Michael S, Fiume, Marc, Flicek, Paul, Fore, Ian M, Freeberg, Mallory A, Freimuth, Robert R, Fromont, Lauren A, Fuerth, Jonathan, Gaff, Clara L, Gan, Weiniu, Ghanaim, Elena M, Glazer, David, Green, Robert C, Griffith, Malachi, Griffith, Obi L, Grossman, Robert L, Groza, Tudor, Auvil, Jaime M Guidry, Guigó, Roderic, Gupta, Dipayan, Haendel, Melissa A, Hamosh, Ada, Hansen, David P, Hart, Reece K, Hartley, Dean Mitchell, Haussler, David, Hendricks-Sturrup, Rachele M, Ho, Calvin WL, Hobb, Ashley E, Hoffman, Michael M, Hofmann, Oliver M, Holub, Petr, Hsu, Jacob Shujui, Hubaux, Jean-Pierre, Hunt, Sarah E, Husami, Ammar, Jacobsen, Julius O, Jamuar, Saumya S, Janes, Elizabeth L, Jeanson, Francis, Jené, Aina, Johns, Amber L, Joly, Yann, Jones, Steven JM, Kanitz, Alexander, Kato, Kazuto, Keane, Thomas M, and Kekesi-Lafrance, Kristina
- Subjects
Biological Sciences ,Genetics ,Human Genome ,Clinical Research ,Health Services ,Generic health relevance ,Good Health and Well Being - Abstract
The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits.
- Published
- 2021
19. Strategies for improving colorectal cancer detection with routine computed tomography
- Author
-
Johnson, C. Daniel, Flicek, Kristina T., Mead-Harvey, Carolyn, and Quillen, Jaxon K.
- Published
- 2023
- Full Text
- View/download PDF
20. A draft human pangenome reference
- Author
-
Liao, Wen-Wei, Asri, Mobin, Ebler, Jana, Doerr, Daniel, Haukness, Marina, Hickey, Glenn, Lu, Shuangjia, Lucas, Julian K., Monlong, Jean, Abel, Haley J., Buonaiuto, Silvia, Chang, Xian H., Cheng, Haoyu, Chu, Justin, Colonna, Vincenza, Eizenga, Jordan M., Feng, Xiaowen, Fischer, Christian, Fulton, Robert S., Garg, Shilpa, Groza, Cristian, Guarracino, Andrea, Harvey, William T., Heumos, Simon, Howe, Kerstin, Jain, Miten, Lu, Tsung-Yu, Markello, Charles, Martin, Fergal J., Mitchell, Matthew W., Munson, Katherine M., Mwaniki, Moses Njagi, Novak, Adam M., Olsen, Hugh E., Pesout, Trevor, Porubsky, David, Prins, Pjotr, Sibbesen, Jonas A., Sirén, Jouni, Tomlinson, Chad, Villani, Flavia, Vollger, Mitchell R., Antonacci-Fulton, Lucinda L., Baid, Gunjan, Baker, Carl A., Belyaeva, Anastasiya, Billis, Konstantinos, Carroll, Andrew, Chang, Pi-Chuan, Cody, Sarah, Cook, Daniel E., Cook-Deegan, Robert M., Cornejo, Omar E., Diekhans, Mark, Ebert, Peter, Fairley, Susan, Fedrigo, Olivier, Felsenfeld, Adam L., Formenti, Giulio, Frankish, Adam, Gao, Yan, Garrison, Nanibaa’ A., Giron, Carlos Garcia, Green, Richard E., Haggerty, Leanne, Hoekzema, Kendra, Hourlier, Thibaut, Ji, Hanlee P., Kenny, Eimear E., Koenig, Barbara A., Kolesnikov, Alexey, Korbel, Jan O., Kordosky, Jennifer, Koren, Sergey, Lee, HoJoon, Lewis, Alexandra P., Magalhães, Hugo, Marco-Sola, Santiago, Marijon, Pierre, McCartney, Ann, McDaniel, Jennifer, Mountcastle, Jacquelyn, Nattestad, Maria, Nurk, Sergey, Olson, Nathan D., Popejoy, Alice B., Puiu, Daniela, Rautiainen, Mikko, Regier, Allison A., Rhie, Arang, Sacco, Samuel, Sanders, Ashley D., Schneider, Valerie A., Schultz, Baergen I., Shafin, Kishwar, Smith, Michael W., Sofia, Heidi J., Abou Tayoun, Ahmad N., Thibaud-Nissen, Françoise, Tricomi, Francesca Floriana, Wagner, Justin, Walenz, Brian, Wood, Jonathan M. D., Zimin, Aleksey V., Bourque, Guillaume, Chaisson, Mark J. P., Flicek, Paul, Phillippy, Adam M., Zook, Justin M., Eichler, Evan E., Haussler, David, Wang, Ting, Jarvis, Erich D., Miga, Karen H., Garrison, Erik, Marschall, Tobias, Hall, Ira M., Li, Heng, and Paten, Benedict
- Published
- 2023
- Full Text
- View/download PDF
21. Chemotherapy-associated liver morphological changes in hepatic metastases (CALMCHeM)
- Author
-
Matthew C. Pope, Michael C. Olson, Kristina T. Flicek, Neema J. Patel, Candice W. Bolan, Christine O. Menias, Zhen Wang, and Sudhakar K. Venkatesh
- Subjects
liver metastases ,pseudocirrhosis ,liver tumor burden ,Medical physics. Medical radiology. Nuclear medicine ,R895-920 - Abstract
PURPOSETo review imaging findings in chemotherapy-associated liver morphological changes in hepatic metastases (CALMCHeM) on computed tomography (CT)/magnetic resonance imaging (MRI) and its association with tumor burden.METHODSWe performed a retrospective chart review to identify patients with hepatic metastases who received chemotherapy and subsequent follow-up imaging where CT or MRI showed morphological changes in the liver. The morphological changes searched for were nodularity, capsular retraction, hypodense fibrotic bands, lobulated outline, atrophy or hypertrophy of segments or lobes, widened fissures, and one or more features of portal hypertension (splenomegaly/venous collaterals/ascites). The inclusion criteria were as follows: a) no known chronic liver disease; b) availability of CT or MRI images before chemotherapy that showed no morphological signs of chronic liver disease; c) at least one follow-up CT or MRI image demonstrating CALMCHeM after chemotherapy. Two radiologists in consensus graded the initial hepatic metastases tumor burden according to number (≤10 and >10), lobe distribution (single or both lobes), and liver parenchyma volume affected (10 in 64.4% of patients. The volume of liver involved was
- Published
- 2023
- Full Text
- View/download PDF
22. GET_PANGENES: calling pangenes from plant genome alignments confirms presence-absence variation
- Author
-
Contreras-Moreira, Bruno, Saraf, Shradha, Naamati, Guy, Casas, Ana M., Amberkar, Sandeep S., Flicek, Paul, Jones, Andrew R., and Dyer, Sarah
- Published
- 2023
- Full Text
- View/download PDF
23. Towards complete and error-free genome assemblies of all vertebrate species
- Author
-
Rhie, Arang, McCarthy, Shane A, Fedrigo, Olivier, Damas, Joana, Formenti, Giulio, Koren, Sergey, Uliano-Silva, Marcela, Chow, William, Fungtammasan, Arkarachai, Kim, Juwan, Lee, Chul, Ko, Byung June, Chaisson, Mark, Gedman, Gregory L, Cantin, Lindsey J, Thibaud-Nissen, Francoise, Haggerty, Leanne, Bista, Iliana, Smith, Michelle, Haase, Bettina, Mountcastle, Jacquelyn, Winkler, Sylke, Paez, Sadye, Howard, Jason, Vernes, Sonja C, Lama, Tanya M, Grutzner, Frank, Warren, Wesley C, Balakrishnan, Christopher N, Burt, Dave, George, Julia M, Biegler, Matthew T, Iorns, David, Digby, Andrew, Eason, Daryl, Robertson, Bruce, Edwards, Taylor, Wilkinson, Mark, Turner, George, Meyer, Axel, Kautt, Andreas F, Franchini, Paolo, Detrich, H William, Svardal, Hannes, Wagner, Maximilian, Naylor, Gavin JP, Pippel, Martin, Malinsky, Milan, Mooney, Mark, Simbirsky, Maria, Hannigan, Brett T, Pesout, Trevor, Houck, Marlys, Misuraca, Ann, Kingan, Sarah B, Hall, Richard, Kronenberg, Zev, Sović, Ivan, Dunn, Christopher, Ning, Zemin, Hastie, Alex, Lee, Joyce, Selvaraj, Siddarth, Green, Richard E, Putnam, Nicholas H, Gut, Ivo, Ghurye, Jay, Garrison, Erik, Sims, Ying, Collins, Joanna, Pelan, Sarah, Torrance, James, Tracey, Alan, Wood, Jonathan, Dagnew, Robel E, Guan, Dengfeng, London, Sarah E, Clayton, David F, Mello, Claudio V, Friedrich, Samantha R, Lovell, Peter V, Osipova, Ekaterina, Al-Ajli, Farooq O, Secomandi, Simona, Kim, Heebal, Theofanopoulou, Constantina, Hiller, Michael, Zhou, Yang, Harris, Robert S, Makova, Kateryna D, Medvedev, Paul, Hoffman, Jinna, Masterson, Patrick, Clark, Karen, Martin, Fergal, Howe, Kevin, Flicek, Paul, Walenz, Brian P, Kwak, Woori, and Clawson, Hiram
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Biotechnology ,Generic health relevance ,Animals ,Birds ,Gene Library ,Genome ,Genome Size ,Genome ,Mitochondrial ,Genomics ,Haplotypes ,High-Throughput Nucleotide Sequencing ,Molecular Sequence Annotation ,Sequence Alignment ,Sequence Analysis ,DNA ,Sex Chromosomes ,Vertebrates ,General Science & Technology - Abstract
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.
- Published
- 2021
24. Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research.
- Author
-
Kern, Colin, Wang, Ying, Xu, Xiaoqin, Pan, Zhangyuan, Halstead, Michelle, Chanthavixay, Ganrea, Saelao, Perot, Waters, Susan, Xiang, Ruidong, Chamberlain, Amanda, Korf, Ian, Delany, Mary E, Cheng, Hans H, Medrano, Juan F, Van Eenennaam, Alison L, Tuggle, Chris K, Ernst, Catherine, Flicek, Paul, Quon, Gerald, Ross, Pablo, and Zhou, Huaijun
- Subjects
Animals ,Animals ,Domestic ,Chickens ,Cattle ,Swine ,Mice ,Transcription Factors ,Phylogeny ,Organ Specificity ,Gene Expression Regulation ,Epigenesis ,Genetic ,Amino Acid Motifs ,Regulatory Sequences ,Nucleic Acid ,Polymorphism ,Single Nucleotide ,Genome ,Enhancer Elements ,Genetic ,Genome-Wide Association Study ,Epigenomics ,Chromatin Immunoprecipitation Sequencing - Abstract
Gene regulatory elements are central drivers of phenotypic variation and thus of critical importance towards understanding the genetics of complex traits. The Functional Annotation of Animal Genomes consortium was formed to collaboratively annotate the functional elements in animal genomes, starting with domesticated animals. Here we present an expansive collection of datasets from eight diverse tissues in three important agricultural species: chicken (Gallus gallus), pig (Sus scrofa), and cattle (Bos taurus). Comparative analysis of these datasets and those from the human and mouse Encyclopedia of DNA Elements projects reveal that a core set of regulatory elements are functionally conserved independent of divergence between species, and that tissue-specific transcription factor occupancy at regulatory elements and their predicted target genes are also conserved. These datasets represent a unique opportunity for the emerging field of comparative epigenomics, as well as the agricultural research community, including species that are globally important food resources.
- Published
- 2021
25. The Deep Genome Project
- Author
-
Lloyd, KC Kent, Adams, David J, Baynam, Gareth, Beaudet, Arthur L, Bosch, Fatima, Boycott, Kym M, Braun, Robert E, Caulfield, Mark, Cohn, Ronald, Dickinson, Mary E, Dobbie, Michael S, Flenniken, Ann M, Flicek, Paul, Galande, Sanjeev, Gao, Xiang, Grobler, Anne, Heaney, Jason D, Herault, Yann, de Angelis, Martin Hrabě, Lupski, James R, Lyonnet, Stanislas, Mallon, Ann-Marie, Mammano, Fabio, MacRae, Calum A, McInnes, Roderick, McKerlie, Colin, Meehan, Terrence F, Murray, Stephen A, Nutter, Lauryl MJ, Obata, Yuichi, Parkinson, Helen, Pepper, Michael S, Sedlacek, Radislav, Seong, Je Kyung, Shiroishi, Toshihiko, Smedley, Damian, Tocchini-Valentini, Glauco, Valle, David, Wang, Chi-Kuang Leo, Wells, Sara, White, Jacqueline, Wurst, Wolfgang, Xu, Ying, and Brown, Steve DM
- Subjects
Animals ,Genes ,Genome ,Humans ,Mice ,Mutation ,Phenotype ,Proteins ,Environmental Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Published
- 2020
26. Perspectives on ENCODE
- Author
-
Snyder, Michael P, Gingeras, Thomas R, Moore, Jill E, Weng, Zhiping, Gerstein, Mark B, Ren, Bing, Hardison, Ross C, Stamatoyannopoulos, John A, Graveley, Brenton R, Feingold, Elise A, Pazin, Michael J, Pagan, Michael, Gilchrist, Daniel A, Hitz, Benjamin C, Cherry, J Michael, Bernstein, Bradley E, Mendenhall, Eric M, Zerbino, Daniel R, Frankish, Adam, Flicek, Paul, and Myers, Richard M
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Human Genome ,1.1 Normal biological development and functioning ,Animals ,Binding Sites ,Chromatin ,DNA Methylation ,Databases ,Genetic ,Gene Expression Regulation ,Genome ,Genome ,Human ,Genomics ,Histones ,Humans ,Mice ,Molecular Sequence Annotation ,Quality Control ,Regulatory Sequences ,Nucleic Acid ,Transcription Factors ,ENCODE Project Consortium ,General Science & Technology - Abstract
The Encylopedia of DNA Elements (ENCODE) Project launched in 2003 with the long-term goal of developing a comprehensive map of functional elements in the human genome. These included genes, biochemical regions associated with gene regulation (for example, transcription factor binding sites, open chromatin, and histone marks) and transcript isoforms. The marks serve as sites for candidate cis-regulatory elements (cCREs) that may serve functional roles in regulating gene expression1. The project has been extended to model organisms, particularly the mouse. In the third phase of ENCODE, nearly a million and more than 300,000 cCRE annotations have been generated for human and mouse, respectively, and these have provided a valuable resource for the scientific community.
- Published
- 2020
27. Perspectives on ENCODE.
- Author
-
ENCODE Project Consortium, Snyder, Michael P, Gingeras, Thomas R, Moore, Jill E, Weng, Zhiping, Gerstein, Mark B, Ren, Bing, Hardison, Ross C, Stamatoyannopoulos, John A, Graveley, Brenton R, Feingold, Elise A, Pazin, Michael J, Pagan, Michael, Gilchrist, Daniel A, Hitz, Benjamin C, Cherry, J Michael, Bernstein, Bradley E, Mendenhall, Eric M, Zerbino, Daniel R, Frankish, Adam, Flicek, Paul, and Myers, Richard M
- Subjects
ENCODE Project Consortium ,Chromatin ,Animals ,Humans ,Mice ,Histones ,Transcription Factors ,Genomics ,DNA Methylation ,Gene Expression Regulation ,Binding Sites ,Regulatory Sequences ,Nucleic Acid ,Genome ,Genome ,Human ,Quality Control ,Databases ,Genetic ,Molecular Sequence Annotation ,Human Genome ,Vaccine Related ,Biotechnology ,Genetics ,Immunization ,Vaccine Related (AIDS) ,Prevention ,1.1 Normal biological development and functioning ,Generic health relevance ,General Science & Technology - Abstract
The Encylopedia of DNA Elements (ENCODE) Project launched in 2003 with the long-term goal of developing a comprehensive map of functional elements in the human genome. These included genes, biochemical regions associated with gene regulation (for example, transcription factor binding sites, open chromatin, and histone marks) and transcript isoforms. The marks serve as sites for candidate cis-regulatory elements (cCREs) that may serve functional roles in regulating gene expression1. The project has been extended to model organisms, particularly the mouse. In the third phase of ENCODE, nearly a million and more than 300,000 cCRE annotations have been generated for human and mouse, respectively, and these have provided a valuable resource for the scientific community.
- Published
- 2020
28. Soft windowing application to improve analysis of high-throughput phenotyping data
- Author
-
Haselimashhadi, Hamed, Mason, Jeremy C, Munoz-Fuentes, Violeta, López-Gómez, Federico, Babalola, Kolawole, Acar, Elif F, Kumar, Vivek, White, Jacqui, Flenniken, Ann M, King, Ruairidh, Straiton, Ewan, Seavitt, John Richard, Gaspero, Angelina, Garza, Arturo, Christianson, Audrey E, Hsu, Chih-Wei, Reynolds, Corey L, Lanza, Denise G, Lorenzo, Isabel, Green, Jennie R, Gallegos, Juan J, Bohat, Ritu, Samaco, Rodney C, Veeraragavan, Surabi, Kim, Jong Kyoung, Miller, Gregor, Fuchs, Helmult, Garrett, Lillian, Becker, Lore, Kang, Yeon Kyung, Clary, David, Cho, Soo Young, Tamura, Masaru, Tanaka, Nobuhiko, Soo, Kyung Dong, Bezginov, Alexandr, About, Ghina Bou, Champy, Marie-France, Vasseur, Laurent, Leblanc, Sophie, Meziane, Hamid, Selloum, Mohammed, Reilly, Patrick T, Spielmann, Nadine, Maier, Holger, Gailus-Durner, Valerie, Sorg, Tania, Hiroshi, Masuya, Yuichi, Obata, Heaney, Jason D, Dickinson, Mary E, Wolfgang, Wurst, Tocchini-Valentini, Glauco P, Lloyd, Kevin C Kent, McKerlie, Colin, Seong, Je Kyung, Yann, Herault, de Angelis, Martin Hrabé, Brown, Steve DM, Smedley, Damian, Flicek, Paul, Mallon, Ann-Marie, Parkinson, Helen, and Meehan, Terrence F
- Subjects
Biological Sciences ,Genetics ,Animals ,Genetic Association Studies ,Humans ,Mice ,Phenotype ,Population Health ,Software ,Mathematical Sciences ,Information and Computing Sciences ,Bioinformatics ,Biological sciences ,Information and computing sciences ,Mathematical sciences - Abstract
MotivationHigh-throughput phenomic projects generate complex data from small treatment and large control groups that increase the power of the analyses but introduce variation over time. A method is needed to utlize a set of temporally local controls that maximizes analytic power while minimizing noise from unspecified environmental factors.ResultsHere we introduce 'soft windowing', a methodological approach that selects a window of time that includes the most appropriate controls for analysis. Using phenotype data from the International Mouse Phenotyping Consortium (IMPC), adaptive windows were applied such that control data collected proximally to mutants were assigned the maximal weight, while data collected earlier or later had less weight. We applied this method to IMPC data and compared the results with those obtained from a standard non-windowed approach. Validation was performed using a resampling approach in which we demonstrate a 10% reduction of false positives from 2.5 million analyses. We applied the method to our production analysis pipeline that establishes genotype-phenotype associations by comparing mutant versus control data. We report an increase of 30% in significant P-values, as well as linkage to 106 versus 99 disease models via phenotype overlap with the soft-windowed and non-windowed approaches, respectively, from a set of 2082 mutant mouse lines. Our method is generalizable and can benefit large-scale human phenomic projects such as the UK Biobank and the All of Us resources.Availability and implementationThe method is freely available in the R package SmoothWin, available on CRAN http://CRAN.R-project.org/package=SmoothWin.Supplementary informationSupplementary data are available at Bioinformatics online.
- Published
- 2020
29. Mouse mutant phenotyping at scale reveals novel genes controlling bone mineral density
- Author
-
Swan, Anna L, Schütt, Christine, Rozman, Jan, del Mar Muñiz Moreno, Maria, Brandmaier, Stefan, Simon, Michelle, Leuchtenberger, Stefanie, Griffiths, Mark, Brommage, Robert, Keskivali-Bond, Piia, Grallert, Harald, Werner, Thomas, Teperino, Raffaele, Becker, Lore, Miller, Gregor, Moshiri, Ala, Seavitt, John R, Cissell, Derek D, Meehan, Terrence F, Acar, Elif F, Lelliott, Christopher J, Flenniken, Ann M, Champy, Marie-France, Sorg, Tania, Ayadi, Abdel, Braun, Robert E, Cater, Heather, Dickinson, Mary E, Flicek, Paul, Gallegos, Juan, Ghirardello, Elena J, Heaney, Jason D, Jacquot, Sylvie, Lally, Connor, Logan, John G, Teboul, Lydia, Mason, Jeremy, Spielmann, Nadine, McKerlie, Colin, Murray, Stephen A, Nutter, Lauryl MJ, Odfalk, Kristian F, Parkinson, Helen, Prochazka, Jan, Reynolds, Corey L, Selloum, Mohammed, Spoutil, Frantisek, Svenson, Karen L, Vales, Taylor S, Wells, Sara E, White, Jacqueline K, Sedlacek, Radislav, Wurst, Wolfgang, Lloyd, KC Kent, Croucher, Peter I, Fuchs, Helmut, Williams, Graham R, Bassett, JH Duncan, Gailus-Durner, Valerie, Herault, Yann, Mallon, Ann-Marie, Brown, Steve DM, Mayer-Kuckuk, Philipp, and Hrabe de Angelis, Martin
- Subjects
Biological Sciences ,Genetics ,Osteoporosis ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,Aetiology ,Underpinning research ,Musculoskeletal ,Animals ,Bone Density ,Female ,Gene Expression Regulation ,Gene Ontology ,Genetic Pleiotropy ,Genome-Wide Association Study ,Genotype ,Male ,Mice ,Mice ,Transgenic ,Mutation ,Osteoblasts ,Osteoclasts ,Phenotype ,Promoter Regions ,Genetic ,Protein Interaction Maps ,Sex Characteristics ,Transcriptome ,IMPC Consortium ,Developmental Biology - Abstract
The genetic landscape of diseases associated with changes in bone mineral density (BMD), such as osteoporosis, is only partially understood. Here, we explored data from 3,823 mutant mouse strains for BMD, a measure that is frequently altered in a range of bone pathologies, including osteoporosis. A total of 200 genes were found to significantly affect BMD. This pool of BMD genes comprised 141 genes with previously unknown functions in bone biology and was complementary to pools derived from recent human studies. Nineteen of the 141 genes also caused skeletal abnormalities. Examination of the BMD genes in osteoclasts and osteoblasts underscored BMD pathways, including vesicle transport, in these cells and together with in silico bone turnover studies resulted in the prioritization of candidate genes for further investigation. Overall, the results add novel pathophysiological and molecular insight into bone health and disease.
- Published
- 2020
30. Analysis of genome-wide knockout mouse database identifies candidate ciliopathy genes
- Author
-
Kendall Higgins, Bret A. Moore, Zorana Berberovic, Hibret A. Adissu, Mohammad Eskandarian, Ann M. Flenniken, Andy Shao, Denise M. Imai, Dave Clary, Louise Lanoue, Susan Newbigging, Lauryl M. J. Nutter, David J. Adams, Fatima Bosch, Robert E. Braun, Steve D. M. Brown, Mary E. Dickinson, Michael Dobbie, Paul Flicek, Xiang Gao, Sanjeev Galande, Anne Grobler, Jason D. Heaney, Yann Herault, Martin Hrabe de Angelis, Hsian-Jean Genie Chin, Fabio Mammano, Chuan Qin, Toshihiko Shiroishi, Radislav Sedlacek, J.-K. Seong, Ying Xu, The IMPC Consortium, K. C. Kent Lloyd, Colin McKerlie, and Ala Moshiri
- Subjects
Medicine ,Science - Abstract
Abstract We searched a database of single-gene knockout (KO) mice produced by the International Mouse Phenotyping Consortium (IMPC) to identify candidate ciliopathy genes. We first screened for phenotypes in mouse lines with both ocular and renal or reproductive trait abnormalities. The STRING protein interaction tool was used to identify interactions between known cilia gene products and those encoded by the genes in individual knockout mouse strains in order to generate a list of “candidate ciliopathy genes.” From this list, 32 genes encoded proteins predicted to interact with known ciliopathy proteins. Of these, 25 had no previously described roles in ciliary pathobiology. Histological and morphological evidence of phenotypes found in ciliopathies in knockout mouse lines are presented as examples (genes Abi2, Wdr62, Ap4e1, Dync1li1, and Prkab1). Phenotyping data and descriptions generated on IMPC mouse line are useful for mechanistic studies, target discovery, rare disease diagnosis, and preclinical therapeutic development trials. Here we demonstrate the effective use of the IMPC phenotype data to uncover genes with no previous role in ciliary biology, which may be clinically relevant for identification of novel disease genes implicated in ciliopathies.
- Published
- 2022
- Full Text
- View/download PDF
31. Comparative analysis of repeat content in plant genomes, large and small
- Author
-
Joris Argentin, Dan Bolser, Paul J. Kersey, and Paul Flicek
- Subjects
transposons ,plants ,pipeline ,annotation ,comparative analysis ,Plant culture ,SB1-1110 - Abstract
The DNA Features pipeline is the analysis pipeline at EMBL-EBI that annotates repeat elements, including transposable elements. With Ensembl’s goal to stay at the cutting edge of genome annotation, we proved that this pipeline needed an update. We then created a new analysis that allowed the Ensembl database to store the repeat classification from the PGSB repeat classification (Recat). This new dataset was then fetched using Perl scripts and used to prove that the pipeline modification induced a gain in sensitivity. Finally, we performed a comparative analysis of transposable element distribution in all plant species available, raising new questions about transposable elements in certain branches of the taxonomic tree.
- Published
- 2023
- Full Text
- View/download PDF
32. Multi-platform discovery of haplotype-resolved structural variation in human genomes.
- Author
-
Chaisson, Mark JP, Sanders, Ashley D, Zhao, Xuefang, Malhotra, Ankit, Porubsky, David, Rausch, Tobias, Gardner, Eugene J, Rodriguez, Oscar L, Guo, Li, Collins, Ryan L, Fan, Xian, Wen, Jia, Handsaker, Robert E, Fairley, Susan, Kronenberg, Zev N, Kong, Xiangmeng, Hormozdiari, Fereydoun, Lee, Dillon, Wenger, Aaron M, Hastie, Alex R, Antaki, Danny, Anantharaman, Thomas, Audano, Peter A, Brand, Harrison, Cantsilieris, Stuart, Cao, Han, Cerveira, Eliza, Chen, Chong, Chen, Xintong, Chin, Chen-Shan, Chong, Zechen, Chuang, Nelson T, Lambert, Christine C, Church, Deanna M, Clarke, Laura, Farrell, Andrew, Flores, Joey, Galeev, Timur, Gorkin, David U, Gujral, Madhusudan, Guryev, Victor, Heaton, William Haynes, Korlach, Jonas, Kumar, Sushant, Kwon, Jee Young, Lam, Ernest T, Lee, Jong Eun, Lee, Joyce, Lee, Wan-Ping, Lee, Sau Peng, Li, Shantao, Marks, Patrick, Viaud-Martinez, Karine, Meiers, Sascha, Munson, Katherine M, Navarro, Fabio CP, Nelson, Bradley J, Nodzak, Conor, Noor, Amina, Kyriazopoulou-Panagiotopoulou, Sofia, Pang, Andy WC, Qiu, Yunjiang, Rosanio, Gabriel, Ryan, Mallory, Stütz, Adrian, Spierings, Diana CJ, Ward, Alistair, Welch, AnneMarie E, Xiao, Ming, Xu, Wei, Zhang, Chengsheng, Zhu, Qihui, Zheng-Bradley, Xiangqun, Lowy, Ernesto, Yakneen, Sergei, McCarroll, Steven, Jun, Goo, Ding, Li, Koh, Chong Lek, Ren, Bing, Flicek, Paul, Chen, Ken, Gerstein, Mark B, Kwok, Pui-Yan, Lansdorp, Peter M, Marth, Gabor T, Sebat, Jonathan, Shi, Xinghua, Bashir, Ali, Ye, Kai, Devine, Scott E, Talkowski, Michael E, Mills, Ryan E, Marschall, Tobias, Korbel, Jan O, Eichler, Evan E, and Lee, Charles
- Subjects
Humans ,Chromosome Mapping ,Genomics ,Haplotypes ,Genome ,Human ,Algorithms ,Databases ,Genetic ,INDEL Mutation ,Genomic Structural Variation ,High-Throughput Nucleotide Sequencing ,Whole Genome Sequencing ,Genome ,Human ,Databases ,Genetic - Abstract
The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (
- Published
- 2019
33. Publisher Correction: Federated discovery and sharing of genomic data using Beacons
- Author
-
Fiume, Marc, Cupak, Miroslav, Keenan, Stephen, Rambla, Jordi, de la Torre, Sabela, Dyke, Stephanie OM, Brookes, Anthony J, Carey, Knox, Lloyd, David, Goodhand, Peter, Haeussler, Maximilian, Baudis, Michael, Stockinger, Heinz, Dolman, Lena, Lappalainen, Ilkka, Törnroos, Juha, Linden, Mikael, Spalding, J Dylan, Ur-Rehman, Saif, Page, Angela, Flicek, Paul, Sherry, Stephen, Haussler, David, Varma, Susheel, Saunders, Gary, and Scollen, Serena
- Subjects
Information and Computing Sciences ,Historical Studies ,History ,Heritage and Archaeology - Abstract
In the version of this article initially published, Lena Dolman's second affiliation was given as Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK. The correct second affiliation is Ontario Institute for Cancer Research, Toronto, Ontario, Canada. The error has been corrected in the HTML and PDF versions of the article.
- Published
- 2019
34. Federated discovery and sharing of genomic data using Beacons
- Author
-
Fiume, Marc, Cupak, Miroslav, Keenan, Stephen, Rambla, Jordi, de la Torre, Sabela, Dyke, Stephanie OM, Brookes, Anthony J, Carey, Knox, Lloyd, David, Goodhand, Peter, Haeussler, Maximilian, Baudis, Michael, Stockinger, Heinz, Dolman, Lena, Lappalainen, Ilkka, Törnroos, Juha, Linden, Mikael, Spalding, J Dylan, Ur-Rehman, Saif, Page, Angela, Flicek, Paul, Sherry, Stephen, Haussler, David, Varma, Susheel, Saunders, Gary, and Scollen, Serena
- Subjects
Genomics ,Humans ,Information Dissemination ,Information Storage and Retrieval - Published
- 2019
35. Author Correction: Identification of genes required for eye development by high-throughput screening of mouse knockouts
- Author
-
Moore, Bret A, Leonard, Brian C, Sebbag, Lionel, Edwards, Sydney G, Cooper, Ann, Imai, Denise M, Straiton, Ewan, Santos, Luis, Reilly, Christopher, Griffey, Stephen M, Bower, Lynette, Clary, David, Mason, Jeremy, Roux, Michel J, Meziane, Hamid, Herault, Yann, McKerlie, Colin, Flenniken, Ann M, Nutter, Lauryl MJ, Berberovic, Zorana, Owen, Celeste, Newbigging, Susan, Adissu, Hibret, Eskandarian, Mohammed, Hsu, Chih-Wei, Kalaga, Sowmya, Udensi, Uchechukwu, Asomugha, Chinwe, Bohat, Ritu, Gallegos, Juan J, Seavitt, John R, Heaney, Jason D, Beaudet, Arthur L, Dickinson, Mary E, Justice, Monica J, Philip, Vivek, Kumar, Vivek, Svenson, Karen L, Braun, Robert E, Wells, Sara, Cater, Heather, Stewart, Michelle, Clementson-Mobbs, Sharon, Joynson, Russell, Gao, Xiang, Suzuki, Tomohiro, Wakana, Shigeharu, Smedley, Damian, Seong, JK, Tocchini-Valentini, Glauco, Moore, Mark, Fletcher, Colin, Karp, Natasha, Ramirez-Solis, Ramiro, White, Jacqueline K, de Angelis, Martin Hrabe, Wurst, Wolfgang, Thomasy, Sara M, Flicek, Paul, Parkinson, Helen, Brown, Steve DM, Meehan, Terrence F, Nishina, Patsy M, Murray, Stephen A, Krebs, Mark P, Mallon, Ann-Marie, Kent Lloyd, KC, Murphy, Christopher J, and Moshiri, Ala
- Subjects
Biomedical and Clinical Sciences ,Genetics ,International Mouse Phenotyping Consortium ,Biological sciences ,Biomedical and clinical sciences - Abstract
[This corrects the article DOI: 10.1038/s42003-018-0226-0.].
- Published
- 2019
36. Erratum: Author Correction: Identification of genes required for eye development by high-throughput screening of mouse knockouts.
- Author
-
Moore, Bret A, Leonard, Brian C, Sebbag, Lionel, Edwards, Sydney G, Cooper, Ann, Imai, Denise M, Straiton, Ewan, Santos, Luis, Reilly, Christopher, Griffey, Stephen M, Bower, Lynette, Clary, David, Mason, Jeremy, Roux, Michel J, Meziane, Hamid, Herault, Yann, International Mouse Phenotyping Consortium, McKerlie, Colin, Flenniken, Ann M, Nutter, Lauryl MJ, Berberovic, Zorana, Owen, Celeste, Newbigging, Susan, Adissu, Hibret, Eskandarian, Mohammed, Hsu, Chih-Wei, Kalaga, Sowmya, Udensi, Uchechukwu, Asomugha, Chinwe, Bohat, Ritu, Gallegos, Juan J, Seavitt, John R, Heaney, Jason D, Beaudet, Arthur L, Dickinson, Mary E, Justice, Monica J, Philip, Vivek, Kumar, Vivek, Svenson, Karen L, Braun, Robert E, Wells, Sara, Cater, Heather, Stewart, Michelle, Clementson-Mobbs, Sharon, Joynson, Russell, Gao, Xiang, Suzuki, Tomohiro, Wakana, Shigeharu, Smedley, Damian, Seong, JK, Tocchini-Valentini, Glauco, Moore, Mark, Fletcher, Colin, Karp, Natasha, Ramirez-Solis, Ramiro, White, Jacqueline K, de Angelis, Martin Hrabe, Wurst, Wolfgang, Thomasy, Sara M, Flicek, Paul, Parkinson, Helen, Brown, Steve DM, Meehan, Terrence F, Nishina, Patsy M, Murray, Stephen A, Krebs, Mark P, Mallon, Ann-Marie, Kent Lloyd, KC, Murphy, Christopher J, and Moshiri, Ala
- Subjects
International Mouse Phenotyping Consortium ,Genetics - Abstract
[This corrects the article DOI: 10.1038/s42003-018-0226-0.].
- Published
- 2019
37. A joint NCBI and EMBL-EBI transcript set for clinical genomics and research
- Author
-
Morales, Joannella, Pujar, Shashikant, Loveland, Jane E., Astashyn, Alex, Bennett, Ruth, Berry, Andrew, Cox, Eric, Davidson, Claire, Ermolaeva, Olga, Farrell, Catherine M., Fatima, Reham, Gil, Laurent, Goldfarb, Tamara, Gonzalez, Jose M., Haddad, Diana, Hardy, Matthew, Hunt, Toby, Jackson, John, Joardar, Vinita S., Kay, Michael, Kodali, Vamsi K., McGarvey, Kelly M., McMahon, Aoife, Mudge, Jonathan M., Murphy, Daniel N., Murphy, Michael R., Rajput, Bhanu, Rangwala, Sanjida H., Riddick, Lillian D., Thibaud-Nissen, Françoise, Threadgold, Glen, Vatsan, Anjana R., Wallin, Craig, Webb, David, Flicek, Paul, Birney, Ewan, Pruitt, Kim D., Frankish, Adam, Cunningham, Fiona, and Murphy, Terence D.
- Published
- 2022
- Full Text
- View/download PDF
38. Mako: A Graph-based Pattern Growth Approach to Detect Complex Structural Variants
- Author
-
Jiadong Lin, Xiaofei Yang, Walter Kosters, Tun Xu, Yanyan Jia, Songbo Wang, Qihui Zhu, Mallory Ryan, Li Guo, Chengsheng Zhang, Charles Lee, Scott E. Devine, Evan E. Eichler, Kai Ye, Mark B. Gerstein, Ashley D. Sanders, Micheal C. Zody, Michael E. Talkowski, Ryan E. Mills, Jan O. Korbel, Tobias Marschall, Peter Ebert, Peter A. Audano, Bernardo Rodriguez-Martin, David Porubsky, Marc Jan Bonder, Arvis Sulovari, Jana Ebler, Weichen Zhou, Rebecca Serra Mari, Feyza Yilmaz, Xuefang Zhao, PingHsun Hsieh, Joyce Lee, Sushant Kumar, Tobias Rausch, Yu Chen, Zechen Chong, Katherine M. Munson, Mark J.P. Chaisson, Junjie Chen, Xinghua Shi, Aaron M. Wenger, William T. Harvey, Patrick Hansenfeld, Allison Regier, Ira M. Hall, Paul Flicek, Alex R. Hastie, and Susan Fairely
- Subjects
Next-generation sequencing ,Complex structural variant ,Pattern growth ,Graph mining ,Formation mechanism ,Biology (General) ,QH301-705.5 ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Complex structural variants (CSVs) are genomic alterations that have more than two breakpoints and are considered as the simultaneous occurrence of simple structural variants. However, detecting the compounded mutational signals of CSVs is challenging through a commonly used model-match strategy. As a result, there has been limited progress for CSV discovery compared with simple structural variants. Here, we systematically analyzed the multi-breakpoint connection feature of CSVs, and proposed Mako, utilizing a bottom-up guided model-free strategy, to detect CSVs from paired-end short-read sequencing. Specifically, we implemented a graph-based pattern growth approach, where the graph depicts potential breakpoint connections, and pattern growth enables CSV detection without pre-defined models. Comprehensive evaluations on both simulated and real datasets revealed that Mako outperformed other algorithms. Notably, validation rates of CSVs on real data based on experimental and computational validations as well as manual inspections are around 70%, where the medians of experimental and computational breakpoint shift are 13 bp and 26 bp, respectively. Moreover, the Mako CSV subgraph effectively characterized the breakpoint connections of a CSV event and uncovered a total of 15 CSV types, including two novel types of adjacent segment swap and tandem dispersed duplication. Further analysis of these CSVs also revealed the impact of sequence homology on the formation of CSVs. Mako is publicly available at https://github.com/xjtu-omics/Mako.
- Published
- 2022
- Full Text
- View/download PDF
39. Registered access: authorizing data access
- Author
-
Dyke, Stephanie OM, Linden, Mikael, Lappalainen, Ilkka, De Argila, Jordi Rambla, Carey, Knox, Lloyd, David, Spalding, J Dylan, Cabili, Moran N, Kerry, Giselle, Foreman, Julia, Cutts, Tim, Shabani, Mahsa, Rodriguez, Laura L, Haeussler, Maximilian, Walsh, Brian, Jiang, Xiaoqian, Wang, Shuang, Perrett, Daniel, Boughtwood, Tiffany, Matern, Andreas, Brookes, Anthony J, Cupak, Miro, Fiume, Marc, Pandya, Ravi, Tulchinsky, Ilia, Scollen, Serena, Törnroos, Juha, Das, Samir, Evans, Alan C, Malin, Bradley A, Beck, Stephan, Brenner, Steven E, Nyrönen, Tommi, Blomberg, Niklas, Firth, Helen V, Hurles, Matthew, Philippakis, Anthony A, Rätsch, Gunnar, Brudno, Michael, Boycott, Kym M, Rehm, Heidi L, Baudis, Michael, Sherry, Stephen T, Kato, Kazuto, Knoppers, Bartha M, Baker, Dixie, and Flicek, Paul
- Subjects
Biological Sciences ,Biomedical and Clinical Sciences ,Clinical Sciences ,Genetics ,8.3 Policy ,ethics ,and research governance ,Health and social care services research ,Generic health relevance ,Good Health and Well Being ,Access to Information ,Genetics ,Medical ,Genomics ,Humans ,Information Dissemination ,Licensure ,Practice Guidelines as Topic ,Genetics & Heredity ,Clinical sciences - Abstract
The Global Alliance for Genomics and Health (GA4GH) proposes a data access policy model-"registered access"-to increase and improve access to data requiring an agreement to basic terms and conditions, such as the use of DNA sequence and health data in research. A registered access policy would enable a range of categories of users to gain access, starting with researchers and clinical care professionals. It would also facilitate general use and reuse of data but within the bounds of consent restrictions and other ethical obligations. In piloting registered access with the Scientific Demonstration data sharing projects of GA4GH, we provide additional ethics, policy and technical guidance to facilitate the implementation of this access model in an international setting.
- Published
- 2018
40. Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci
- Author
-
Lilue, Jingtao, Doran, Anthony G, Fiddes, Ian T, Abrudan, Monica, Armstrong, Joel, Bennett, Ruth, Chow, William, Collins, Joanna, Collins, Stephan, Czechanski, Anne, Danecek, Petr, Diekhans, Mark, Dolle, Dirk-Dominik, Dunn, Matt, Durbin, Richard, Earl, Dent, Ferguson-Smith, Anne, Flicek, Paul, Flint, Jonathan, Frankish, Adam, Fu, Beiyuan, Gerstein, Mark, Gilbert, James, Goodstadt, Leo, Harrow, Jennifer, Howe, Kerstin, Ibarra-Soria, Ximena, Kolmogorov, Mikhail, Lelliott, Chris J, Logan, Darren W, Loveland, Jane, Mathews, Clayton E, Mott, Richard, Muir, Paul, Nachtweide, Stefanie, Navarro, Fabio CP, Odom, Duncan T, Park, Naomi, Pelan, Sarah, Pham, Son K, Quail, Mike, Reinholdt, Laura, Romoth, Lars, Shirley, Lesley, Sisu, Cristina, Sjoberg-Herrera, Marcela, Stanke, Mario, Steward, Charles, Thomas, Mark, Threadgold, Glen, Thybert, David, Torrance, James, Wong, Kim, Wood, Jonathan, Yalcin, Binnaz, Yang, Fengtang, Adams, David J, Paten, Benedict, and Keane, Thomas M
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Human Genome ,1.1 Normal biological development and functioning ,2.1 Biological and endogenous factors ,Animals ,Animals ,Laboratory ,Chromosome Mapping ,Genetic Loci ,Genome ,Haplotypes ,Mice ,Mice ,Inbred BALB C ,Mice ,Inbred C3H ,Mice ,Inbred C57BL ,Mice ,Inbred CBA ,Mice ,Inbred DBA ,Mice ,Inbred NOD ,Mice ,Inbred Strains ,Molecular Sequence Annotation ,Phylogeny ,Polymorphism ,Single Nucleotide ,Species Specificity ,Medical and Health Sciences ,Developmental Biology ,Agricultural biotechnology ,Bioinformatics and computational biology - Abstract
We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.
- Published
- 2018
41. Chromosome assembly of large and complex genomes using multiple references
- Author
-
Kolmogorov, Mikhail, Armstrong, Joel, Raney, Brian J, Streeter, Ian, Dunn, Matthew, Yang, Fengtang, Odom, Duncan, Flicek, Paul, Keane, Thomas M, Thybert, David, Paten, Benedict, and Pham, Son
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Generic health relevance ,Animals ,Contig Mapping ,Mice ,Reference Standards ,Whole Genome Sequencing ,Medical and Health Sciences ,Bioinformatics - Abstract
Despite the rapid development of sequencing technologies, the assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout 2, a reference-assisted assembly tool that works for large and complex genomes. By taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout 2 infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. By using Ragout 2, we transformed NGS assemblies of 16 laboratory mouse strains into sets of complete chromosomes, leaving
- Published
- 2018
42. The International Mouse Phenotyping Consortium (IMPC): a functional catalogue of the mammalian genome that informs conservation
- Author
-
Muñoz-Fuentes, Violeta, Cacheiro, Pilar, Meehan, Terrence F, Aguilar-Pimentel, Juan Antonio, Brown, Steve DM, Flenniken, Ann M, Flicek, Paul, Galli, Antonella, Mashhadi, Hamed Haseli, Hrabě de Angelis, Martin, Kim, Jong Kyoung, Lloyd, KC Kent, McKerlie, Colin, Morgan, Hugh, Murray, Stephen A, Nutter, Lauryl MJ, Reilly, Patrick T, Seavitt, John R, Seong, Je Kyung, Simon, Michelle, Wardle-Jones, Hannah, Mallon, Ann-Marie, Smedley, Damian, Parkinson, Helen E, and the IMPC consortium
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,Generic health relevance ,Life on Land ,Cheetah ,Endangered species ,Loss-of-function ,Non-model species ,Panda ,Polar bear ,Phenotype ,Wolf ,Essential genes ,IMPC ,Knockout ,Mouse ,IMPC consortium ,Environmental Sciences ,Evolutionary Biology ,Biological sciences ,Environmental sciences - Abstract
The International Mouse Phenotyping Consortium (IMPC) is building a catalogue of mammalian gene function by producing and phenotyping a knockout mouse line for every protein-coding gene. To date, the IMPC has generated and characterised 5186 mutant lines. One-third of the lines have been found to be non-viable and over 300 new mouse models of human disease have been identified thus far. While current bioinformatics efforts are focused on translating results to better understand human disease processes, IMPC data also aids understanding genetic function and processes in other species. Here we show, using gorilla genomic data, how genes essential to development in mice can be used to help assess the potentially deleterious impact of gene variants in other species. This type of analyses could be used to select optimal breeders in endangered species to maintain or increase fitness and avoid variants associated to impaired-health phenotypes or loss-of-function mutations in genes of critical importance. We also show, using selected examples from various mammal species, how IMPC data can aid in the identification of candidate genes for studying a condition of interest, deliver information about the mechanisms involved, or support predictions for the function of genes that may play a role in adaptation. With genotyping costs decreasing and the continued improvements of bioinformatics tools, the analyses we demonstrate can be routinely applied.
- Published
- 2018
43. Identification of genetic elements in metabolism by high-throughput mouse phenotyping
- Author
-
Rozman, Jan, Rathkolb, Birgit, Oestereicher, Manuela A, Schütt, Christine, Ravindranath, Aakash Chavan, Leuchtenberger, Stefanie, Sharma, Sapna, Kistler, Martin, Willershäuser, Monja, Brommage, Robert, Meehan, Terrence F, Mason, Jeremy, Haselimashhadi, Hamed, IMPC Consortium, Hough, Tertius, Mallon, Ann-Marie, Wells, Sara, Santos, Luis, Lelliott, Christopher J, White, Jacqueline K, Sorg, Tania, Champy, Marie-France, Bower, Lynette R, Reynolds, Corey L, Flenniken, Ann M, Murray, Stephen A, Nutter, Lauryl MJ, Svenson, Karen L, West, David, Tocchini-Valentini, Glauco P, Beaudet, Arthur L, Bosch, Fatima, Braun, Robert B, Dobbie, Michael S, Gao, Xiang, Herault, Yann, Moshiri, Ala, Moore, Bret A, Kent Lloyd, KC, McKerlie, Colin, Masuya, Hiroshi, Tanaka, Nobuhiko, Flicek, Paul, Parkinson, Helen E, Sedlacek, Radislav, Seong, Je Kyung, Wang, Chi-Kuang Leo, Moore, Mark, Brown, Steve D, Tschöp, Matthias H, Wurst, Wolfgang, Klingenspor, Martin, Wolf, Eckhard, Beckers, Johannes, Machicao, Fausto, Peter, Andreas, Staiger, Harald, Häring, Hans-Ulrich, Grallert, Harald, Campillos, Monica, Maier, Holger, Fuchs, Helmut, Gailus-Durner, Valerie, Werner, Thomas, and Hrabe de Angelis, Martin
- Subjects
Biochemistry and Cell Biology ,Bioinformatics and Computational Biology ,Genetics ,Biological Sciences ,Human Genome ,Nutrition ,2.1 Biological and endogenous factors ,Generic health relevance ,Animals ,Area Under Curve ,Basal Metabolism ,Blood Glucose ,Body Weight ,Diabetes Mellitus ,Type 2 ,Gene Regulatory Networks ,Genome-Wide Association Study ,High-Throughput Screening Assays ,Humans ,Metabolic Diseases ,Mice ,Mice ,Knockout ,Obesity ,Oxygen Consumption ,Phenotype ,Triglycerides ,IMPC Consortium - Abstract
Metabolic diseases are a worldwide problem but the underlying genetic factors and their relevance to metabolic disease remain incompletely understood. Genome-wide research is needed to characterize so-far unannotated mammalian metabolic genes. Here, we generate and analyze metabolic phenotypic data of 2016 knockout mouse strains under the aegis of the International Mouse Phenotyping Consortium (IMPC) and find 974 gene knockouts with strong metabolic phenotypes. 429 of those had no previous link to metabolism and 51 genes remain functionally completely unannotated. We compared human orthologues of these uncharacterized genes in five GWAS consortia and indeed 23 candidate genes are associated with metabolic disease. We further identify common regulatory elements in promoters of candidate genes. As each regulatory element is composed of several transcription factor binding sites, our data reveal an extensive metabolic phenotype-associated network of co-regulated genes. Our systematic mouse phenotype analysis thus paves the way for full functional annotation of the genome.
- Published
- 2018
44. Identification of genes required for eye development by high-throughput screening of mouse knockouts
- Author
-
Moore, Bret A, Leonard, Brian C, Sebbag, Lionel, Edwards, Sydney G, Cooper, Ann, Imai, Denise M, Straiton, Ewan, Santos, Luis, Reilly, Christopher, Griffey, Stephen M, Bower, Lynette, Clary, David, Mason, Jeremy, Roux, Michel J, Meziane, Hamid, Herault, Yann, McKerlie, Colin, Flenniken, Ann M, Nutter, Lauryl MJ, Berberovic, Zorana, Owen, Celeste, Newbigging, Susan, Adissu, Hibret, Eskandarian, Mohammed, Hsu, Chih-Wei, Kalaga, Sowmya, Udensi, Uchechukwu, Asomugha, Chinwe, Bohat, Ritu, Gallegos, Juan J, Seavitt, John R, Heaney, Jason D, Beaudet, Arthur L, Dickinson, Mary E, Justice, Monica J, Philip, Vivek, Kumar, Vivek, Svenson, Karen L, Braun, Robert E, Wells, Sara, Cater, Heather, Stewart, Michelle, Clementson-Mobbs, Sharon, Joynson, Russell, Gao, Xiang, Suzuki, Tomohiro, Wakana, Shigeharu, Smedley, Damian, Seong, JK, Tocchini-Valentini, Glauco, Moore, Mark, Fletcher, Colin, Karp, Natasha, Ramirez-Solis, Ramiro, White, Jacqueline K, de Angelis, Martin Hrabe, Wurst, Wolfgang, Thomasy, Sara M, Flicek, Paul, Parkinson, Helen, Brown, Steve DM, Meehan, Terrence F, Nishina, Patsy M, Murray, Stephen A, Krebs, Mark P, Mallon, Ann-Marie, Lloyd, KC Kent, Murphy, Christopher J, and Moshiri, Ala
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Biomedical and Clinical Sciences ,Genetics ,Ophthalmology and Optometry ,Human Genome ,Biotechnology ,Eye Disease and Disorders of Vision ,Aetiology ,2.1 Biological and endogenous factors ,Eye ,International Mouse Phenotyping Consortium ,Biological sciences ,Biomedical and clinical sciences - Abstract
Despite advances in next generation sequencing technologies, determining the genetic basis of ocular disease remains a major challenge due to the limited access and prohibitive cost of human forward genetics. Thus, less than 4,000 genes currently have available phenotype information for any organ system. Here we report the ophthalmic findings from the International Mouse Phenotyping Consortium, a large-scale functional genetic screen with the goal of generating and phenotyping a null mutant for every mouse gene. Of 4364 genes evaluated, 347 were identified to influence ocular phenotypes, 75% of which are entirely novel in ocular pathology. This discovery greatly increases the current number of genes known to contribute to ophthalmic disease, and it is likely that many of the genes will subsequently prove to be important in human ocular development and disease.
- Published
- 2018
45. Genetic variation and gene expression across multiple tissues and developmental stages in a nonhuman primate.
- Author
-
Jasinska, Anna J, Zelaya, Ivette, Service, Susan K, Peterson, Christine B, Cantor, Rita M, Choi, Oi-Wa, DeYoung, Joseph, Eskin, Eleazar, Fairbanks, Lynn A, Fears, Scott, Furterer, Allison E, Huang, Yu S, Ramensky, Vasily, Schmitt, Christopher A, Svardal, Hannes, Jorgensen, Matthew J, Kaplan, Jay R, Villar, Diego, Aken, Bronwen L, Flicek, Paul, Nag, Rishi, Wong, Emily S, Blangero, John, Dyer, Thomas D, Bogomolov, Marina, Benjamini, Yoav, Weinstock, George M, Dewar, Ken, Sabatti, Chiara, Wilson, Richard K, Jentsch, J David, Warren, Wesley, Coppola, Giovanni, Woods, Roger P, and Freimer, Nelson B
- Subjects
Brain ,Animals ,Humans ,Gene Expression Profiling ,Genotype ,Phenotype ,Polymorphism ,Single Nucleotide ,Quantitative Trait Loci ,Genetic Variation ,Genome-Wide Association Study ,Chlorocebus aethiops ,Genetics ,Biotechnology ,Human Genome ,Biological Sciences ,Medical and Health Sciences ,Developmental Biology - Abstract
By analyzing multitissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalog of expression quantitative trait loci (eQTLs) in a nonhuman primate model. This catalog contains more genome-wide significant eQTLs per sample than comparable human resources and identifies sex- and age-related expression patterns. Findings include a master regulatory locus that likely has a role in immune function and a locus regulating hippocampal long noncoding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders.
- Published
- 2017
46. Correction: Corrigendum: High-throughput discovery of novel developmental phenotypes
- Author
-
Dickinson, Mary E, Flenniken, Ann M, Ji, Xiao, Teboul, Lydia, Wong, Michael D, White, Jacqueline K, Meehan, Terrence F, Weninger, Wolfgang J, Westerberg, Henrik, Adissu, Hibret, Baker, Candice N, Bower, Lynette, Brown, James M, Caddle, L Brianna, Chiani, Francesco, Clary, Dave, Cleak, James, Daly, Mark J, Denegre, James M, Doe, Brendan, Dolan, Mary E, Edie Helmut Fuchs, Sarah M, Gailus-Durner, Valerie, Galli, Antonella, Gambadoro, Alessia, Gallegos, Juan, Guo, Shiying, Horner, Neil R, Hsu, Chih-Wei, Johnson, Sara J, Kalaga, Sowmya, Keith, Lance C, Lanoue, Louise, Lawson, Thomas N, Lek, Monkol, Mark, Manuel, Marschall, Susan, Mason, Jeremy, McElwee, Melissa L, Nutter, Susan Newbigging Lauryl MJ, Peterson, Kevin A, Ramirez-Solis, Ramiro, Rowland, Douglas J, Ryder, Edward, Samocha, Kaitlin E, Seavitt, John R, Selloum, Mohammed, Szoke-Kovacs, Zsombor, Tamura, Masaru, Trainor, Amanda G, Tudose, Ilinca, Wakana, Shigeharu, Warren, Jonathan, Wendling, Olivia, West, David B, Wong, Leeyean, Yoshiki, Atsushi, Wurst, Wolfgang, MacArthur, Daniel G, Tocchini-Valentini, Glauco P, Gao, Xiang, Flicek, Paul, Bradley, Allan, Skarnes, William C, Justice, Monica J, Parkinson, Helen E, Moore, Mark, Wells, Sara, Braun, Robert E, Svenson, Karen L, de Angelis, Martin Hrabe, Herault, Yann, Mohun, Tim, Mallon, Ann-Marie, Henkelman, R Mark, Brown, Steve DM, Adams, David J, Lloyd, KC Kent, McKerlie, Colin, Beaudet, Arthur L, and Murray, Maja Bućan Stephen A
- Subjects
International Mouse Phenotyping Consortium ,General Science & Technology - Abstract
This corrects the article DOI: 10.1038/nature19356.
- Published
- 2017
47. Disease model discovery from 3,328 gene knockouts by The International Mouse Phenotyping Consortium.
- Author
-
Meehan, Terrence F, Conte, Nathalie, West, David B, Jacobsen, Julius O, Mason, Jeremy, Warren, Jonathan, Chen, Chao-Kung, Tudose, Ilinca, Relac, Mike, Matthews, Peter, Karp, Natasha, Santos, Luis, Fiegel, Tanja, Ring, Natalie, Westerberg, Henrik, Greenaway, Simon, Sneddon, Duncan, Morgan, Hugh, Codner, Gemma F, Stewart, Michelle E, Brown, James, Horner, Neil, International Mouse Phenotyping Consortium, Haendel, Melissa, Washington, Nicole, Mungall, Christopher J, Reynolds, Corey L, Gallegos, Juan, Gailus-Durner, Valerie, Sorg, Tania, Pavlovic, Guillaume, Bower, Lynette R, Moore, Mark, Morse, Iva, Gao, Xiang, Tocchini-Valentini, Glauco P, Obata, Yuichi, Cho, Soo Young, Seong, Je Kyung, Seavitt, John, Beaudet, Arthur L, Dickinson, Mary E, Herault, Yann, Wurst, Wolfgang, de Angelis, Martin Hrabe, Lloyd, KC Kent, Flenniken, Ann M, Nutter, Lauryl MJ, Newbigging, Susan, McKerlie, Colin, Justice, Monica J, Murray, Stephen A, Svenson, Karen L, Braun, Robert E, White, Jacqueline K, Bradley, Allan, Flicek, Paul, Wells, Sara, Skarnes, William C, Adams, David J, Parkinson, Helen, Mallon, Ann-Marie, Brown, Steve DM, and Smedley, Damian
- Subjects
International Mouse Phenotyping Consortium ,Animals ,Mice ,Knockout ,Humans ,Mice ,Genetic Diseases ,Inborn ,Disease Models ,Animal ,Genetic Predisposition to Disease ,Phenotype ,Female ,Male ,Gene Knockout Techniques ,Knockout ,Genetic Diseases ,Inborn ,Disease Models ,Animal ,Developmental Biology ,Biological Sciences ,Medical and Health Sciences - Abstract
Although next-generation sequencing has revolutionized the ability to associate variants with human diseases, diagnostic rates and development of new therapies are still limited by a lack of knowledge of the functions and pathobiological mechanisms of most genes. To address this challenge, the International Mouse Phenotyping Consortium is creating a genome- and phenome-wide catalog of gene function by characterizing new knockout-mouse strains across diverse biological systems through a broad set of standardized phenotyping tests. All mice will be readily available to the biomedical community. Analyzing the first 3,328 genes identified models for 360 diseases, including the first models, to our knowledge, for type C Bernard-Soulier, Bardet-Biedl-5 and Gordon Holmes syndromes. 90% of our phenotype annotations were novel, providing functional evidence for 1,092 genes and candidates in genetically uncharacterized diseases including arrhythmogenic right ventricular dysplasia 3. Finally, we describe our role in variant functional validation with The 100,000 Genomes Project and others.
- Published
- 2017
48. Addressing Beacon re-identification attacks: quantification and mitigation of privacy risks
- Author
-
Raisaro, Jean Louis, Tramèr, Florian, Ji, Zhanglong, Bu, Diyue, Zhao, Yongan, Carey, Knox, Lloyd, David, Sofia, Heidi, Baker, Dixie, Flicek, Paul, Shringarpure, Suyash, Bustamante, Carlos, Wang, Shuang, Jiang, Xiaoqian, Ohno-Machado, Lucila, Tang, Haixu, Wang, XiaoFeng, and Hubaux, Jean-Pierre
- Subjects
Information and Computing Sciences ,Biomedical and Clinical Sciences ,Health Sciences ,Genetics ,Prevention ,Human Genome ,Generic health relevance ,Data Anonymization ,Genetic Privacy ,Genomics ,Humans ,Information Dissemination ,genomic privacy ,ga4gh ,beacon ,re-identification ,genomic data sharing ,Engineering ,Medical and Health Sciences ,Medical Informatics ,Biomedical and clinical sciences ,Health sciences ,Information and computing sciences - Abstract
The Global Alliance for Genomics and Health (GA4GH) created the Beacon Project as a means of testing the willingness of data holders to share genetic data in the simplest technical context-a query for the presence of a specified nucleotide at a given position within a chromosome. Each participating site (or "beacon") is responsible for assuring that genomic data are exposed through the Beacon service only with the permission of the individual to whom the data pertains and in accordance with the GA4GH policy and standards.While recognizing the inference risks associated with large-scale data aggregation, and the fact that some beacons contain sensitive phenotypic associations that increase privacy risk, the GA4GH adjudged the risk of re-identification based on the binary yes/no allele-presence query responses as acceptable. However, recent work demonstrated that, given a beacon with specific characteristics (including relatively small sample size and an adversary who possesses an individual's whole genome sequence), the individual's membership in a beacon can be inferred through repeated queries for variants present in the individual's genome.In this paper, we propose three practical strategies for reducing re-identification risks in beacons. The first two strategies manipulate the beacon such that the presence of rare alleles is obscured; the third strategy budgets the number of accesses per user for each individual genome. Using a beacon containing data from the 1000 Genomes Project, we demonstrate that the proposed strategies can effectively reduce re-identification risk in beacon-like datasets.
- Published
- 2017
49. A large scale hearing loss screen reveals an extensive unexplored genetic landscape for auditory dysfunction
- Author
-
Bowl, Michael R, Simon, Michelle M, Ingham, Neil J, Greenaway, Simon, Santos, Luis, Cater, Heather, Taylor, Sarah, Mason, Jeremy, Kurbatova, Natalja, Pearson, Selina, Bower, Lynette R, Clary, Dave A, Meziane, Hamid, Reilly, Patrick, Minowa, Osamu, Kelsey, Lois, The International Mouse Phenotyping Consortium, Tocchini-Valentini, Glauco P, Gao, Xiang, Bradley, Allan, Skarnes, William C, Moore, Mark, Beaudet, Arthur L, Justice, Monica J, Seavitt, John, Dickinson, Mary E, Wurst, Wolfgang, de Angelis, Martin Hrabe, Herault, Yann, Wakana, Shigeharu, Nutter, Lauryl MJ, Flenniken, Ann M, McKerlie, Colin, Murray, Stephen A, Svenson, Karen L, Braun, Robert E, West, David B, Lloyd, KC Kent, Adams, David J, White, Jacqui, Karp, Natasha, Flicek, Paul, Smedley, Damian, Meehan, Terrence F, Parkinson, Helen E, Teboul, Lydia M, Wells, Sara, Steel, Karen P, Mallon, Ann-Marie, and Brown, Steve DM
- Subjects
Biological Sciences ,Biomedical and Clinical Sciences ,Genetics ,Allied Health and Rehabilitation Science ,Clinical Sciences ,Health Sciences ,Clinical Research ,Hearing Loss ,Human Genome ,Neurosciences ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,Neurological ,Ear ,Animals ,Datasets as Topic ,Genetic Testing ,Hearing Tests ,Mice ,Mice ,Knockout ,Phenotype ,Protein Interaction Maps ,International Mouse Phenotyping Consortium - Abstract
The developmental and physiological complexity of the auditory system is likely reflected in the underlying set of genes involved in auditory function. In humans, over 150 non-syndromic loci have been identified, and there are more than 400 human genetic syndromes with a hearing loss component. Over 100 non-syndromic hearing loss genes have been identified in mouse and human, but we remain ignorant of the full extent of the genetic landscape involved in auditory dysfunction. As part of the International Mouse Phenotyping Consortium, we undertook a hearing loss screen in a cohort of 3006 mouse knockout strains. In total, we identify 67 candidate hearing loss genes. We detect known hearing loss genes, but the vast majority, 52, of the candidate genes were novel. Our analysis reveals a large and unexplored genetic landscape involved with auditory function.The full extent of the genetic basis for hearing impairment is unknown. Here, as part of the International Mouse Phenotyping Consortium, the authors perform a hearing loss screen in 3006 mouse knockout strains and identify 52 new candidate genes for genetic hearing loss.
- Published
- 2017
50. Genetic perturbation of PU.1 binding and chromatin looping at neutrophil enhancers associates with autoimmune disease
- Author
-
Stephen Watt, Louella Vasquez, Klaudia Walter, Alice L. Mann, Kousik Kundu, Lu Chen, Ying Sims, Simone Ecker, Frances Burden, Samantha Farrow, Ben Farr, Valentina Iotchkova, Heather Elding, Daniel Mead, Manuel Tardaguila, Hannes Ponstingl, David Richardson, Avik Datta, Paul Flicek, Laura Clarke, Kate Downes, Tomi Pastinen, Peter Fraser, Mattia Frontini, Biola-Maria Javierre, Mikhail Spivakov, and Nicole Soranzo
- Subjects
Science - Abstract
PU.1 is a master regulator of myeloid development but its role in disease-relevant neutrophils is not well known. Here, the authors look at primary neutrophils from a human population and find that genetic variants affecting binding of PU.1 are associated with cell count and disease susceptibility.
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.