12 results on '"Joshua M. Wang"'
Search Results
2. The Genome of the Human Pathogen Candida albicans Is Shaped by Mutation and Cryptic Sexual Recombination
- Author
-
Joshua M. Wang, Richard J. Bennett, and Matthew Z. Anderson
- Subjects
Candida ,evolution ,loss of heterozygosity ,parasex ,recombination ,Microbiology ,QR1-502 - Abstract
ABSTRACT The opportunistic fungal pathogen Candida albicans lacks a conventional sexual program and is thought to evolve, at least primarily, through the clonal acquisition of genetic changes. Here, we performed an analysis of heterozygous diploid genomes from 21 clinical isolates to determine the natural evolutionary processes acting on the C. albicans genome. Mutation and recombination shaped the genomic landscape among the C. albicans isolates. Strain-specific single nucleotide polymorphisms (SNPs) and insertions/deletions (indels) clustered across the genome. Additionally, loss-of-heterozygosity (LOH) events contributed substantially to genotypic variation, with most long-tract LOH events extending to the ends of the chromosomes suggestive of repair via break-induced replication. Consistent with a model of inheritance by descent, most polymorphisms were shared between closely related strains. However, some isolates contained highly mosaic genomes consistent with strains having experienced interclade recombination during their evolutionary history. A detailed examination of mitochondrial genomes also revealed clear examples of interclade recombination among sequenced strains. These analyses therefore establish that both (para)sexual recombination and mitotic mutational processes drive evolution of this important pathogen. To further facilitate the study of C. albicans genomes, we also introduce an online platform, SNPMap, to examine SNP patterns in sequenced isolates. IMPORTANCE Mutations introduce variation into the genome upon which selection can act. Defining the nature of these changes is critical for determining species evolution, as well as for understanding the genetic changes driving important cellular processes. The heterozygous diploid fungus Candida albicans is both a frequent commensal organism and a prevalent opportunistic pathogen. A prevailing theory is that C. albicans evolves primarily through the gradual buildup of mitotic mutations, and a pressing issue is whether sexual or parasexual processes also operate within natural populations. Here, we establish that the C. albicans genome evolves by a combination of localized mutation and both short-tract and long-tract loss-of-heterozygosity (LOH) events within the sequenced isolates. Mutations are more prevalent within noncoding and heterozygous regions and LOH increases towards chromosome ends. Furthermore, we provide evidence for genetic exchange between isolates, establishing that sexual or parasexual processes have contributed to the diversity of both nuclear and mitochondrial genomes.
- Published
- 2018
- Full Text
- View/download PDF
3. Predictive Modeling of Morbidity and Mortality in Patients Hospitalized With COVID-19 and its Clinical Implications: Algorithm Development and Interpretation
- Author
-
Joshua M Wang, Wenke Liu, Xiaoshan Chen, Michael P McRae, John T McDevitt, and David Fenyö
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Public aspects of medicine ,RA1-1270 - Abstract
BackgroundThe COVID-19 pandemic began in early 2021 and placed significant strains on health care systems worldwide. There remains a compelling need to analyze factors that are predictive for patients at elevated risk of morbidity and mortality. ObjectiveThe goal of this retrospective study of patients who tested positive with COVID-19 and were treated at NYU (New York University) Langone Health was to identify clinical markers predictive of disease severity in order to assist in clinical decision triage and to provide additional biological insights into disease progression. MethodsThe clinical activity of 3740 patients at NYU Langone Hospital was obtained between January and August 2020; patient data were deidentified. Models were trained on clinical data during different parts of their hospital stay to predict three clinical outcomes: deceased, ventilated, or admitted to the intensive care unit (ICU). ResultsThe XGBoost (eXtreme Gradient Boosting) model that was trained on clinical data from the final 24 hours excelled at predicting mortality (area under the curve [AUC]=0.92; specificity=86%; and sensitivity=85%). Respiration rate was the most important feature, followed by SpO2 (peripheral oxygen saturation) and being aged 75 years and over. Performance of this model to predict the deceased outcome extended 5 days prior, with AUC=0.81, specificity=70%, and sensitivity=75%. When only using clinical data from the first 24 hours, AUCs of 0.79, 0.80, and 0.77 were obtained for deceased, ventilated, or ICU-admitted outcomes, respectively. Although respiration rate and SpO2 levels offered the highest feature importance, other canonical markers, including diabetic history, age, and temperature, offered minimal gain. When lab values were incorporated, prediction of mortality benefited the most from blood urea nitrogen and lactate dehydrogenase (LDH). Features that were predictive of morbidity included LDH, calcium, glucose, and C-reactive protein. ConclusionsTogether, this work summarizes efforts to systematically examine the importance of a wide range of features across different endpoint outcomes and at different hospitalization time points.
- Published
- 2021
- Full Text
- View/download PDF
4. Proteogenomic Characterization of Pancreatic Ductal Adenocarcinoma
- Author
-
Marcin J. Domagalski, Wen Jiang, Michael Smith, Li Ding, Michael Schnaubelt, Oxana Paklina, Gilbert S. Omenn, Magdalena Derejska, Karin D. Rodland, Johanna Gardner, Saravana M. Dhanasekaran, Pamela Grady, Pushpa Hariharan, David Mallery, Jesse Francis, Maciej Wiznerowicz, Eunkyung An, Nancy Roche, Ralph H. Hruban, Samuel H. Payne, Chen Huang, Olga Potapova, Gad Getz, Zhiao Shi, Shuai Guo, Oliver F. Bathe, Stacey Gabriel, Sandra Cottingham, Hui Zhang, Daniel Cui Zhou, Maureen Dyer, Houxiang Zhu, James Suh, Shuang Cai, Christopher R. Kinsinger, Felipe da Veiga Leprevost, Steven Chen, Chelsea J. Newton, Amanda G. Paulovich, Steven A. Carr, Melissa Borucki, Sandra Cerda, Troy Shelton, D. R. Mani, Tara Hiltke, Lijun Chen, Benjamin Haibe-Kains, Jiang Long, Ratna R. Thangudu, Arul M. Chinnaiyan, Mathangi Thiagarajan, Negin Vatanian, Peter Ronning, Thomas L. Bauer, Ki Sung Um, Christina Ayad, Seungyeul Yoo, Mitual Amin, Ruiyang Liu, Alicia Francis, Nikolay Gabrovski, Eric E. Schadt, Zhen Zhang, Alexey I. Nesvizhskii, Hariharan Easwaran, Huan Chen, Tao Liu, Elizabeth R. Duffy, Liwei Cao, Joshua M. Wang, Michael H.A. Roehrl, Antonio Colaprico, Ana I. Robles, Emily S. Boja, Rita Jui-Hsien Lu, Rodrigo Vargas Eguez, Yize Li, Jennifer M. Koziak, Wenke Liu, Weiming Yang, Arvind Singh Mer, Dana R. Valley, Sailaja Mareedu, Song Cao, Scott D. Jewell, William Bocik, Shilpi Singh, Yongchao Dou, Matthew A. Wyczalkowski, David Fenyö, Galen Hostetter, Liqun Qi, Wenyi Wang, Yvonne Shutack, Shirley Tsang, Karen A. Ketchum, Charles A. Goldthwaite, Katherine A. Hoadley, Richard D. Smith, Karsten Krug, Yuxing Liao, Nadezhda V. Terekhanova, Henry Rodriguez, Barbara Hindenach, Matthew J. Ellis, Yingwei Hu, Pei Wang, Daniel C. Rohrer, Sara R. Savage, Grace Zhao, Ludmila Danilova, Yige Wu, Parham Minoo, Jennifer M. Eschbacher, Nathan Edwards, T. Mamie Lih, Simina M. Boca, George D. Wilson, Alexey Shabunin, Bing Zhang, Michael A. Gillette, Brian J. Druker, David J. Clark, Jianbo Pan, Katarzyna Kusnierz, David Chesla, Ronald Matteotti, Corbin D. Jones, Michael J. Birrer, Lori J. Sokoll, Qing Kay Li, Mehdi Mesri, Peter B. McGarvey, Chet Birger, Barbara Pruetz, Daniel W. Chan, Bo Wen, Nicollette Maunganidze, and Jasmine Huang
- Subjects
Adult ,Male ,Pancreatic ductal adenocarcinoma ,Proteome ,Gene Dosage ,Biology ,Adenocarcinoma ,medicine.disease_cause ,General Biochemistry, Genetics and Molecular Biology ,Article ,Epigenesis, Genetic ,Substrate Specificity ,Cohort Studies ,medicine ,Humans ,Molecular Targeted Therapy ,Phosphorylation ,Aged ,Glycoproteins ,Proteogenomics ,Aged, 80 and over ,MicroRNA sequencing ,Genome, Human ,RNA ,Endothelial Cells ,Methylation ,Middle Aged ,Phosphoproteins ,Prognosis ,Pancreatic Neoplasms ,Phenotype ,Cancer research ,Female ,KRAS ,Signal transduction ,Carcinogenesis ,Transcriptome ,Glycolysis ,Protein Kinases ,Algorithms ,Carcinoma, Pancreatic Ductal - Abstract
Summary Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with poor patient survival. Toward understanding the underlying molecular alterations that drive PDAC oncogenesis, we conducted comprehensive proteogenomic analysis of 140 pancreatic cancers, 67 normal adjacent tissues, and 9 normal pancreatic ductal tissues. Proteomic, phosphoproteomic, and glycoproteomic analyses were used to characterize proteins and their modifications. In addition, whole-genome sequencing, whole-exome sequencing, methylation, RNA sequencing (RNA-seq), and microRNA sequencing (miRNA-seq) were performed on the same tissues to facilitate an integrated proteogenomic analysis and determine the impact of genomic alterations on protein expression, signaling pathways, and post-translational modifications. To ensure robust downstream analyses, tumor neoplastic cellularity was assessed via multiple orthogonal strategies using molecular features and verified via pathological estimation of tumor cellularity based on histological review. This integrated proteogenomic characterization of PDAC will serve as a valuable resource for the community, paving the way for early detection and identification of novel therapeutic targets.
- Published
- 2021
5. Predictive Modeling of Morbidity and Mortality in Patients Hospitalized With COVID-19 and its Clinical Implications: Algorithm Development and Interpretation
- Author
-
David Fenyö, Michael P. McRae, John T. McDevitt, Wenke Liu, Joshua M. Wang, and Xiaoshan Chen
- Subjects
Male ,coronavirus ,030204 cardiovascular system & hematology ,clinical ,law.invention ,0302 clinical medicine ,law ,Health care ,Medicine ,030212 general & internal medicine ,Young adult ,Child ,Blood urea nitrogen ,Area under the curve ,Middle Aged ,Intensive care unit ,Hospitals ,Hospitalization ,Intensive Care Units ,Area Under Curve ,Child, Preschool ,Female ,predictive modeling ,Algorithms ,Adult ,medicine.medical_specialty ,Adolescent ,Health Informatics ,Article ,03 medical and health sciences ,Young Adult ,Diabetes mellitus ,Diabetes Mellitus ,Humans ,Pandemics ,Aged ,Retrospective Studies ,business.industry ,SARS-CoV-2 ,Infant, Newborn ,COVID-19 ,Infant ,Retrospective cohort study ,medicine.disease ,Triage ,Emergency medicine ,New York City ,Morbidity ,business - Abstract
Background The COVID-19 pandemic began in early 2021 and placed significant strains on health care systems worldwide. There remains a compelling need to analyze factors that are predictive for patients at elevated risk of morbidity and mortality. Objective The goal of this retrospective study of patients who tested positive with COVID-19 and were treated at NYU (New York University) Langone Health was to identify clinical markers predictive of disease severity in order to assist in clinical decision triage and to provide additional biological insights into disease progression. Methods The clinical activity of 3740 patients at NYU Langone Hospital was obtained between January and August 2020; patient data were deidentified. Models were trained on clinical data during different parts of their hospital stay to predict three clinical outcomes: deceased, ventilated, or admitted to the intensive care unit (ICU). Results The XGBoost (eXtreme Gradient Boosting) model that was trained on clinical data from the final 24 hours excelled at predicting mortality (area under the curve [AUC]=0.92; specificity=86%; and sensitivity=85%). Respiration rate was the most important feature, followed by SpO2 (peripheral oxygen saturation) and being aged 75 years and over. Performance of this model to predict the deceased outcome extended 5 days prior, with AUC=0.81, specificity=70%, and sensitivity=75%. When only using clinical data from the first 24 hours, AUCs of 0.79, 0.80, and 0.77 were obtained for deceased, ventilated, or ICU-admitted outcomes, respectively. Although respiration rate and SpO2 levels offered the highest feature importance, other canonical markers, including diabetic history, age, and temperature, offered minimal gain. When lab values were incorporated, prediction of mortality benefited the most from blood urea nitrogen and lactate dehydrogenase (LDH). Features that were predictive of morbidity included LDH, calcium, glucose, and C-reactive protein. Conclusions Together, this work summarizes efforts to systematically examine the importance of a wide range of features across different endpoint outcomes and at different hospitalization time points.
- Published
- 2021
6. Predictive Modeling of Morbidity and Mortality in Patients Hospitalized With COVID-19 and its Clinical Implications: Algorithm Development and Interpretation (Preprint)
- Author
-
Joshua M Wang, Wenke Liu, Xiaoshan Chen, Michael P McRae, John T McDevitt, and David Fenyö
- Abstract
BACKGROUND The COVID-19 pandemic began in early 2021 and placed significant strains on health care systems worldwide. There remains a compelling need to analyze factors that are predictive for patients at elevated risk of morbidity and mortality. OBJECTIVE The goal of this retrospective study of patients who tested positive with COVID-19 and were treated at NYU (New York University) Langone Health was to identify clinical markers predictive of disease severity in order to assist in clinical decision triage and to provide additional biological insights into disease progression. METHODS The clinical activity of 3740 patients at NYU Langone Hospital was obtained between January and August 2020; patient data were deidentified. Models were trained on clinical data during different parts of their hospital stay to predict three clinical outcomes: deceased, ventilated, or admitted to the intensive care unit (ICU). RESULTS The XGBoost (eXtreme Gradient Boosting) model that was trained on clinical data from the final 24 hours excelled at predicting mortality (area under the curve [AUC]=0.92; specificity=86%; and sensitivity=85%). Respiration rate was the most important feature, followed by SpO2 (peripheral oxygen saturation) and being aged 75 years and over. Performance of this model to predict the deceased outcome extended 5 days prior, with AUC=0.81, specificity=70%, and sensitivity=75%. When only using clinical data from the first 24 hours, AUCs of 0.79, 0.80, and 0.77 were obtained for deceased, ventilated, or ICU-admitted outcomes, respectively. Although respiration rate and SpO2 levels offered the highest feature importance, other canonical markers, including diabetic history, age, and temperature, offered minimal gain. When lab values were incorporated, prediction of mortality benefited the most from blood urea nitrogen and lactate dehydrogenase (LDH). Features that were predictive of morbidity included LDH, calcium, glucose, and C-reactive protein. CONCLUSIONS Together, this work summarizes efforts to systematically examine the importance of a wide range of features across different endpoint outcomes and at different hospitalization time points.
- Published
- 2021
7. Intraspecies Transcriptional Profiling Reveals Key Regulators of Candida albicans Pathogenic Traits
- Author
-
Matthew Z. Anderson, Robert J. Fillinger, Richard J. Bennett, Matthew J. Dunn, Andrew L. Woodruff, and Joshua M. Wang
- Subjects
Genotype ,Gene regulatory network ,Microbiology ,transcriptional networks ,Virology ,Gene Expression Regulation, Fungal ,Candida albicans ,Genetic variation ,Humans ,Epigenetics ,Gene ,Phylogeny ,Candida ,Genetics ,Virulence ,biology ,Sequence Analysis, RNA ,Genetic heterogeneity ,Gene Expression Profiling ,Candidiasis ,Genetic Variation ,Phenotypic trait ,Editor's Pick ,coexpression networks ,biology.organism_classification ,Corpus albicans ,QR1-502 ,Phenotype ,gene expression ,variation ,Genome, Fungal ,Research Article - Abstract
Infectious fungal species are often treated uniformly despite clear evidence of genotypic and phenotypic heterogeneity being widespread across strains. Identifying the genetic basis for this phenotypic diversity is extremely challenging because of the tens or hundreds of thousands of variants that may distinguish two strains., The human commensal and opportunistic fungal pathogen Candida albicans displays extensive genetic and phenotypic variation across clinical isolates. Here, we performed RNA sequencing on 21 well-characterized isolates to examine how genetic variation contributes to gene expression differences and to link these differences to phenotypic traits. C. albicans adapts primarily through clonal evolution, and yet hierarchical clustering of gene expression profiles in this set of isolates did not reproduce their phylogenetic relationship. Strikingly, strain-specific gene expression was prevalent in some strain backgrounds. Association of gene expression with phenotypic data by differential analysis, linear correlation, and assembly of gene networks connected both previously characterized and novel genes with 23 C. albicans traits. Construction of de novo gene modules produced a gene atlas incorporating 67% of C. albicans genes and revealed correlations between expression modules and important phenotypes such as systemic virulence. Furthermore, targeted investigation of two modules that have novel roles in growth and filamentation supported our bioinformatic predictions. Together, these studies reveal widespread transcriptional variation across C. albicans isolates and identify genetic and epigenetic links to phenotypic variation based on coexpression network analysis.
- Published
- 2021
8. Predictive modeling of morbidity and mortality in COVID-19 hospitalized patients and its clinical implications
- Author
-
Michael P. McRae, Joshua M. Wang, John T. McDevitt, Xiaoshan Chen, Wenke Liu, and David Fenyö
- Subjects
medicine.medical_specialty ,Vital signs ,MEDLINE ,coronavirus ,severity ,morbidity ,Disease ,Logistic regression ,Procalcitonin ,decision making ,Medicine ,hospital ,marker ,Original Paper ,model ,business.industry ,SARS-CoV-2 ,COVID-19 ,Retrospective cohort study ,prediction ,Triage ,mortality ,symptom ,Blood pressure ,machine learning ,Emergency medicine ,outcome ,New York City ,business ,predictive modeling - Abstract
Objective Retrospective study of COVID-19 positive patients treated at NYU Langone Health (NYULH) to identify clinical markers predictive of disease severity to assist in clinical decision triage and provide additional biological insights into disease progression. Materials and Methods Clinical activity of 3740 de-identified patients at NYULH between January and August 2020. Models were trained on clinical data during different parts of their hospital stay to predict three clinical outcomes: deceased, ventilated, or admitted to ICU. Results XGBoost model trained on clinical data from the final 24 hours excelled at predicting mortality (AUC=0.92, specificity=86% and sensitivity=85%). Respiration rate was the most important feature, followed by SpO2 and age 75+. Performance of this model to predict the deceased outcome extended 5 days prior with AUC=0.81, specificity=70%, sensitivity=75%. When only using clinical data from the first 24 hours, AUCs of 0.79, 0.80, and 0.77 were obtained for deceased, ventilated, or ICU admitted, respectively. Although respiration rate and SpO2 levels offered the highest feature importance, other canonical markers including diabetic history, age and temperature offered minimal gain. When lab values were incorporated, prediction of mortality benefited the most from blood urea nitrogen (BUN) and lactate dehydrogenase (LDH). Features predictive of morbidity included LDH, calcium, glucose, and C-reactive protein (CRP). Conclusion Together this work summarizes efforts to systematically examine the importance of a wide range of features across different endpoint outcomes and at different hospitalization time points. BACKGROUND AND SIGNIFICANCE The first cluster of SARS-CoV-2 was reported in Wuhan, Hubei Province on December 31, 2019. Inciting symptoms remarkably similar to pneumonia, the disease quickly traveled around the world, earning its pandemic status by the World Health Organization on March 11, 2020. Although the first wave has since passed for hardest-hit regions such as New York City (NYC) and most of Asia, a resurgence of cases has already been reported in Europe and record new cases tallied in the Midwest and rural United States (US). As of November 12th, the US alone logged its highest tally to date with a 317% growth over the preceding 30 days1. The coronavirus disease (COVID-19) is far from seeing the end of its days and there remains a compelling need to prioritize care and resources for patients at elevated risk of morbidity and mortality. Previous work building machine learning models used patient data from Tongji Hospital2,3 (Wuhan, China), Zhongnan Hospital4 (Wuhan China), Mount Sinai Hospital5 (NYC, US), and NYU Family Health Center6 (NYC, US). Surprisingly, clinical features selected varied widely across studies. For example, while McRae et al.’s 2-tiered model6 trained on 701 NYC patients to predict mortality was based on actual age, C-reactive protein (CRP), procalcitonin, and D-dimer, Yan et al.’s model2 trained on 485 patients from Wuhan selected lactate dehydrogenase (LDH), lymphocyte count, and CRP as the most predictive for mortality. Variations in selected features differed greatly even when trained to predict similar outcomes on data from patients of the same city. Yao et al.’s model3 was trained on 137 patients from Wuhan and relied on 28 biomarkers in their final model to predict morbidity. Given the differences among prior models, some of which were driven by domain-specific knowledge, we decided to systematically examine the importance of a wide range of features across different endpoint outcomes and at different hospitalization time points. This study analyzes retrospective PCR-confirmed COVID-19 inpatient data collected at NYU Langone Hospital spanning 1/1/2020 to 8/7/2020 to predict three sets of clinical outcomes: alive vs deceased, ventilated vs not ventilated, or ICU admitted vs not ICU admitted. The clinical information of 3740 patient encounters included demographic data (age, sex, insurance, past diagnosis of diabetes, presence of cardiovascular comorbidities), vital signs (SpO2, pulse, respiration rate, temperature, blood pressure), and the 50 most frequently ordered lab tests in our dataset. Models were developed using two methods: logistic regression with feature selection using Least Absolute Shrinkage and Selection Operator7 (LASSO) and gradient tree boosting with XGBoost8. An explainable algorithm, such as logistic regression, provides easy to interpret insights into the features of importance. Conversely, the larger model capacity of XGBoost better handles data complexities to explore the extent that predictive performance can be optimized. Together, these methods ensure a holistic survey that explores the clinical underpinnings of disease etiology and the prospects of building models that are sufficiently competent to be effective decision support tools.
- Published
- 2020
9. Integrated Proteogenomic Characterization across Major Histological Types of Pediatric Brain Cancer
- Author
-
Matthew E. Monroe, Saravana M. Dhanasekaran, Brian R. Rood, Zeynep H. Gümüş, Jena Lilly, Samuel G. Winebrake, Richard G. Ivey, William Bocik, Mahdi Sarmady, Alicia Francis, Lamiya Tauhid, Nathan Edwards, Lizabeth Katsnelson, Rui Zhao, Matilda Broberg, Jo Lynne Rokita, Mateusz Koptyra, Henry Rodriguez, Cassie Kline, Shrabanti Chowdhury, Nicole Tignor, Ying Wang, Christopher R. Kinsinger, Antonio Colaprico, Amanda G. Paulovich, Weiping Ma, Emily S. Boja, Tara Hiltke, Sabine Mueller, Liang-Bo Wang, Javad Nazarian, Marcin J. Domagalski, Karl K. Weitz, Jessica B. Foster, Robert Lober, Carina A. Leonard, Bo Zhang, Gerald A. Grant, Anna Calinawan, Gonzalo Lopez, Shuang Cai, Joanna J. Phillips, Guo Ci Teo, July E. Palma, Felipe da Veiga Leprevost, Yiran Guo, Angela Waanders, Xiaoyu Song, Li Ding, Allison Heath, Steven P. Gygi, Rosalie K. Chu, Vasileios Stathias, Bailey Farrow, Oren J. Becher, Dmitry Rykunov, Nithin D. Adappa, Ron Firestein, Adam C. Resnick, Marcin Cieślik, Jennifer Mason, D. R. Mani, Selim Kalayci, Boris Reva, Antonio Iavarone, MacIntosh Cornwell, Uliana J. Voytovich, Gabrielle S. Stone, Miguel A. Brown, Jacob J. Kennedy, Tao Liu, Ronald J. Moore, Emily Kawaler, Eric H. Raabe, Marina A. Gritsenko, Valerie Baubet, Francesca Petralia, Maciej Wiznerowicz, Olena Morozova Vaske, Eric E. Schadt, Ian F. Pollack, Arul M. Chinnaiyan, Meghan Connors, Jason E. Cain, Lei Zhao, Matthew A. Wyczalkowski, Nalin Gupta, Bing Zhang, Jiayi Ji, Marilyn M. Li, Samuel Rivero-Hinojosa, Mariarita Santi, Wenke Liu, John Szpyt, Brian Ennis, Alexey I. Nesvizhskii, Joshua M. Wang, Jeffrey P. Greenfield, Sanjukta Guha Thakurta, Hui Yin Chang, Peter B. McGarvey, Xi Chen, Karen A. Ketchum, Stephan C. Schürer, Sarah Leary, Lili Blumenberg, Matthew J. Ellis, Pei Wang, Anna Maria Buccoliero, Karsten Krug, Chiara Caporalini, Gad Getz, David E. Kram, Pichai Raman, Eric M. Jackson, James N. Palmer, Mehdi Mesri, Kelly V. Ruggles, Chunde Li, Jun Zhu, Sonia Partap, Jeffrey R. Whiteaker, Mirko Scagnet, Krutika S. Gaonkar, Azra Krek, Allison M. Morgan, Tatiana Omelchenko, Richard D. Smith, Elizabeth Appert, Karin D. Rodland, Derek Hanson, Phillip B. Storm, Jamie Moon, Vladislav A. Petyuk, Nathan Young, Travis D. Lorentzen, David Fenyö, Angela N. Viaene, Seungyeul Yoo, Yuankun Zhu, Nicholas A Vitanza, Toan Le, Tatiana Patton, and Ana I. Robles
- Subjects
DNA Copy Number Variations ,Computational biology ,Biology ,Proteomics ,Article ,General Biochemistry, Genetics and Molecular Biology ,Ganglioglioma ,03 medical and health sciences ,Lymphocytes, Tumor-Infiltrating ,0302 clinical medicine ,Glioma ,medicine ,Humans ,Gene Regulatory Networks ,RNA, Messenger ,Copy-number variation ,Phosphorylation ,Child ,Proteogenomics ,030304 developmental biology ,Medulloblastoma ,0303 health sciences ,Brain Neoplasms ,Genome, Human ,Phosphoproteomics ,Phosphoproteins ,medicine.disease ,Gene Expression Regulation, Neoplastic ,Mutation ,Atypical teratoid rhabdoid tumor ,Neoplasm Grading ,Neoplasm Recurrence, Local ,Transcriptome ,030217 neurology & neurosurgery - Abstract
We report a comprehensive proteogenomics analysis, including whole-genome sequencing, RNA sequencing, and proteomics and phosphoproteomics profiling, of 218 tumors across 7 histological types of childhood brain cancer: low-grade glioma (n = 93), ependymoma (32), high-grade glioma (25), medulloblastoma (22), ganglioglioma (18), craniopharyngioma (16), and atypical teratoid rhabdoid tumor (12). Proteomics data identify common biological themes that span histological boundaries, suggesting that treatments used for one histological type may be applied effectively to other tumors sharing similar proteomics features. Immune landscape characterization reveals diverse tumor microenvironments across and within diagnoses. Proteomics data further reveal functional effects of somatic mutations and copy number variations (CNVs) not evident in transcriptomics data. Kinase-substrate association and co-expression network analysis identify important biological mechanisms of tumorigenesis. This is the first large-scale proteogenomics analysis across traditional histological boundaries to uncover foundational pediatric brain tumor biology and inform rational treatment selection.
- Published
- 2020
10. The Genome of the Human Pathogen Candida albicans Is Shaped by Mutation and Cryptic Sexual Recombination
- Author
-
Richard J. Bennett, Matthew Z. Anderson, and Joshua M. Wang
- Subjects
0301 basic medicine ,030106 microbiology ,Genomics ,Single-nucleotide polymorphism ,medicine.disease_cause ,Genetic recombination ,Genome ,Microbiology ,Parasexual cycle ,Loss of heterozygosity ,03 medical and health sciences ,Virology ,evolution ,medicine ,parasex ,Indel ,Candida albicans ,030304 developmental biology ,Candida ,Genetics ,0303 health sciences ,Mutation ,biology ,030306 microbiology ,Chromosome ,biology.organism_classification ,Corpus albicans ,recombination ,QR1-502 ,030104 developmental biology ,loss of heterozygosity - Abstract
The opportunistic fungal pathogenCandida albicanslacks a conventional sexual program and is thought to evolve, at least primarily, through the clonal acquisition of genetic changes. Here, we performed an analysis of heterozygous diploid genomes from 21 clinical isolates to determine the natural evolutionary processes acting on theC. albicansgenome. Consistent with a model of inheritance by descent, most single nucleotide polymorphisms (SNPs) were shared between closely related strains. However, strain-specific SNPs and insertions/deletions (indels) were distributed non-randomly across the genome. For example, base substitution rates were higher in the immediate vicinity of indels, and heterozygous regions of the genome contained significantly more strain-specific polymorphisms than homozygous regions. Loss of heterozygosity (LOH) events also contributed substantially to genotypic variation, with most long-tract LOH events extending to the ends of the chromosomes suggestive of repair via break-induced replication. Importantly, some isolates contained highly mosaic genomes and failed to cluster closely with other isolates within their assigned clades. Mosaicism is consistent with strains having experienced inter-clade recombination during their evolutionary history and a detailed examination of nuclear and mitochondrial genomes revealed striking examples of recombination. Together, our analyses reveal that both (para)sexual recombination and mitotic mutational processes drive evolution of this important pathogen in nature. To further facilitate the study of genome differences we also introduce an online platform, SNPMap, to examine SNP patterns in sequencedC. albicansgenomes.AUTHOR SUMMARYMutations introduce variation into the genome upon which selection can act. Defining the nature of these changes is critical for determining species evolution, as well as for understanding the genetic changes driving important cellular processes such as carcinogenesis. The fungusCandida albicansis a heterozygous diploid species that is both a frequent commensal organism and a prevalent opportunistic pathogen. Prevailing theory is thatC. albicansevolves primarily through the gradual build-up of mutations, and a pressing question is whether sexual or parasexual processes also operate within natural populations. Here, we determine the evolutionary patterns of genetic change that have accompanied species evolution in nature by examining genomic differences between clinical isolates. We establish that theC. albicansgenome evolves by a combination of base-substitution mutations, insertions/deletion events, and both short-tract and long-tract loss of heterozygosity (LOH) events. These mutations are unevenly distributed across the genome, and reveal that non-coding regions and heterozygous regions are evolving more quickly than coding regions and homozygous regions, respectively. Furthermore, we provide evidence that genetic exchange has occurred between isolates, establishing that sexual or parasexual processes have transpired inC. albicanspopulations and contribute to the diversity of both nuclear and mitochondrial genomes.
- Published
- 2018
11. Genetic and phenotypic intra-species variation in Candida albicans
- Author
-
Aaron M. Berlin, Matthew Z. Anderson, Diego Martinez, Christina A. Cuomo, Judith Berman, Qiandong Zeng, Matthew P. Hirakawa, Joshua M. Greenberg, Richard J. Bennett, Joshua M. Wang, Ethan Zisson, Sharvari Gujja, and Sharadha Sakthikumar
- Subjects
DNA Copy Number Variations ,Genotype ,medicine.disease_cause ,Polymorphism, Single Nucleotide ,Evolution, Molecular ,Loss of heterozygosity ,Candida albicans ,Genetic variation ,Genetics ,medicine ,Humans ,Copy-number variation ,Selection, Genetic ,Phylogeny ,Genetics (clinical) ,Chromosome 7 (human) ,Mutation ,biology ,Research ,Candidiasis ,Genetic Variation ,Sequence Analysis, DNA ,Aneuploidy ,biology.organism_classification ,Corpus albicans ,Phenotype ,Chromosome 4 ,Chromosomes, Fungal ,Genome, Fungal - Abstract
Candida albicans is a commensal fungus of the human gastrointestinal tract and a prevalent opportunistic pathogen. To examine diversity within this species, extensive genomic and phenotypic analyses were performed on 21 clinical C. albicans isolates. Genomic variation was evident in the form of polymorphisms, copy number variations, chromosomal inversions, subtelomeric hypervariation, loss of heterozygosity (LOH), and whole or partial chromosome aneuploidies. All 21 strains were diploid, although karyotypic changes were present in eight of the 21 isolates, with multiple strains being trisomic for Chromosome 4 or Chromosome 7. Aneuploid strains exhibited a general fitness defect relative to euploid strains when grown under replete conditions. All strains were also heterozygous, yet multiple, distinct LOH tracts were present in each isolate. Higher overall levels of genome heterozygosity correlated with faster growth rates, consistent with increased overall fitness. Genes with the highest rates of amino acid substitutions included many cell wall proteins, implicating fast evolving changes in cell adhesion and host interactions. One clinical isolate, P94015, presented several striking properties including a novel cellular phenotype, an inability to filament, drug resistance, and decreased virulence. Several of these properties were shown to be due to a homozygous nonsense mutation in the EFG1 gene. Furthermore, loss of EFG1 function resulted in increased fitness of P94015 in a commensal model of infection. Our analysis therefore reveals intra-species genetic and phenotypic differences in C. albicans and delineates a natural mutation that alters the balance between commensalism and pathogenicity.
- Published
- 2014
12. Hemizygosity Enables a Mutational Transition Governing Fungal Virulence and Commensalism
- Author
-
Matthew Z. Anderson, Shen-Huan Liang, Richard J. Bennett, Matthew P. Hirakawa, Corey Frazer, Gregory J. Thomson, Leenah M. Alaalm, Iuliana V. Ene, and Joshua M. Wang
- Subjects
Cellular differentiation ,Gene Dosage ,medicine.disease_cause ,Microbiology ,Article ,Fungal Proteins ,03 medical and health sciences ,0302 clinical medicine ,Gene Expression Regulation, Fungal ,Virology ,Candida albicans ,Genetic model ,medicine ,Humans ,Gene conversion ,Allele ,Symbiosis ,Gene ,030304 developmental biology ,Genetics ,0303 health sciences ,Mutation ,Phenotypic plasticity ,Virulence ,biology ,Candidiasis ,biology.organism_classification ,DNA-Binding Proteins ,Gastrointestinal Tract ,Parasitology ,030217 neurology & neurosurgery ,Transcription Factors - Abstract
Summary Candida albicans is a commensal fungus of human gastrointestinal and reproductive tracts, but also causes life-threatening systemic infections. The balance between colonization and pathogenesis is associated with phenotypic plasticity, with alternative cell states producing different outcomes in a mammalian host. Here, we reveal that gene dosage of a master transcription factor regulates cell differentiation in diploid C. albicans cells, as EFG1 hemizygous cells undergo a phenotypic transition inaccessible to “wild-type” cells with two functional EFG1 alleles. Notably, clinical isolates are often EFG1 hemizygous and thus licensed to undergo this transition. Phenotypic change corresponds to high-frequency loss of the functional EFG1 allele via de novo mutation or gene conversion events. This phenomenon also occurs during passaging in the gastrointestinal tract with the resulting cell type being hypercompetitive for commensal and systemic infections. A “two-hit” genetic model therefore underlies a key phenotypic transition in C. albicans that enables adaptation to host niches.
- Published
- 2019
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.