826 results on '"Dudley, Joel T."'
Search Results
2. Validation of a Transcriptome-Based Assay for Classifying Cancers of Unknown Primary Origin
- Author
-
Michuda, Jackson, Breschi, Alessandra, Kapilivsky, Joshuah, Manghnani, Kabir, McCarter, Calvin, Hockenberry, Adam J., Mineo, Brittany, Igartua, Catherine, Dudley, Joel T., Stumpe, Martin C., Beaubier, Nike, Shirazi, Maryam, Jones, Ryan, Morency, Elizabeth, Blackwell, Kim, Guinney, Justin, Beauchamp, Kyle A., and Taxter, Timothy
- Published
- 2023
- Full Text
- View/download PDF
3. Multiplex qPCR Discriminates Variants of Concern to Enhance Global Surveillance of SARS-CoV-2
- Author
-
Vogels, Chantal B. F., primary, Breban, Mallery I., additional, Ott, Isabel M., additional, Alpert, Tara, additional, Petrone, Mary E., additional, Watkins, Anne E., additional, Kalinich, Chaney C., additional, Earnest, Rebecca, additional, Rothman, Jessica E., additional, de Jesus, Jaqueline Goes, additional, Claro, Ingra Morales, additional, Ferreira, Giulia Magalhães, additional, Crispim, Myuki A. E., additional, Singh, Lavanya, additional, Tegally, Houriiyah, additional, Anyaneji, Ugochukwu J., additional, Hodcroft, Emma B., additional, Mason, Christopher E., additional, Khullar, Gaurav, additional, Metti, Jessica, additional, Dudley, Joel T., additional, MacKay, Matthew J., additional, Nash, Megan, additional, Wang, Jianhui, additional, Liu, Chen, additional, Hui, Pei, additional, Murphy, Steven, additional, Neal, Caleb, additional, Laszlo, Eva, additional, Landry, Marie L., additional, Muyombwe, Anthony, additional, Downing, Randy, additional, Razeq, Jafar, additional, de Oliveira, Tulio, additional, Faria, Nuno R., additional, Sabino, Ester C., additional, Neher, Richard A., additional, Fauver, Joseph R., additional, and Grubaugh, Nathan D., additional
- Published
- 2023
- Full Text
- View/download PDF
4. Deep Representation Learning of Electronic Health Records to Unlock Patient Stratification at Scale
- Author
-
Landi, Isotta, Glicksberg, Benjamin S., Lee, Hao-Chih, Cherng, Sarah, Landi, Giulia, Danieletto, Matteo, Dudley, Joel T., Furlanello, Cesare, and Miotto, Riccardo
- Subjects
Quantitative Biology - Quantitative Methods ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Deriving disease subtypes from electronic health records (EHRs) can guide next-generation personalized medicine. However, challenges in summarizing and representing patient data prevent widespread practice of scalable EHR-based stratification analysis. Here we present an unsupervised framework based on deep learning to process heterogeneous EHRs and derive patient representations that can efficiently and effectively enable patient stratification at scale. We considered EHRs of 1,608,741 patients from a diverse hospital cohort comprising of a total of 57,464 clinical concepts. We introduce a representation learning model based on word embeddings, convolutional neural networks, and autoencoders (i.e., ConvAE) to transform patient trajectories into low-dimensional latent vectors. We evaluated these representations as broadly enabling patient stratification by applying hierarchical clustering to different multi-disease and disease-specific patient cohorts. ConvAE significantly outperformed several baselines in a clustering task to identify patients with different complex conditions, with 2.61 entropy and 0.31 purity average scores. When applied to stratify patients within a certain condition, ConvAE led to various clinically relevant subtypes for different disorders, including type 2 diabetes, Parkinson's disease and Alzheimer's disease, largely related to comorbidities, disease progression, and symptom severity. With these results, we demonstrate that ConvAE can generate patient representations that lead to clinically meaningful insights. This scalable framework can help better understand varying etiologies in heterogeneous sub-populations and unlock patterns for EHR-based research in the realm of personalized medicine., Comment: C.F. and R.M. share senior authorship
- Published
- 2020
- Full Text
- View/download PDF
5. Scaling structural learning with NO-BEARS to infer causal transcriptome networks
- Author
-
Lee, Hao-Chih, Danieletto, Matteo, Miotto, Riccardo, Cherng, Sarah T., and Dudley, Joel T.
- Subjects
Quantitative Biology - Genomics ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Constructing gene regulatory networks is a critical step in revealing disease mechanisms from transcriptomic data. In this work, we present NO-BEARS, a novel algorithm for estimating gene regulatory networks. The NO-BEARS algorithm is built on the basis of the NOTEARS algorithm with two improvements. First, we propose a new constraint and its fast approximation to reduce the computational cost of the NO-TEARS algorithm. Next, we introduce a polynomial regression loss to handle non-linearity in gene expressions. Our implementation utilizes modern GPU computation that can decrease the time of hours-long CPU computation to seconds. Using synthetic data, we demonstrate improved performance, both in processing time and accuracy, on inferring gene regulatory networks from gene expression data., Comment: Preprint of an article submitted for consideration in Pacific Symposium on Biocomputing copyright 2019 World Scientific Publishing Co., Singapore, http://psb.stanford.edu/http://psb.stanford.edu/
- Published
- 2019
6. Enhancing high-content imaging for studying microtubule networks at large-scale
- Author
-
Lee, Hao-Chih, Cherng, Sarah T, Miotto, Riccardo, and Dudley, Joel T
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Given the crucial role of microtubules for cell survival, many researchers have found success using microtubule-targeting agents in the search for effective cancer therapeutics. Understanding microtubule responses to targeted interventions requires that the microtubule network within cells can be consistently observed across a large sample of images. However, fluorescence noise sources captured simultaneously with biological signals while using wide-field microscopes can obfuscate fine microtubule structures. Such requirements are particularly challenging for high-throughput imaging, where researchers must make decisions related to the trade-off between imaging quality and speed. Here, we propose a computational framework to enhance the quality of high-throughput imaging data to achieve fast speed and high quality simultaneously. Using CycleGAN, we learn an image model from low-throughput, high-resolution images to enhance features, such as microtubule networks in high-throughput low-resolution images. We show that CycleGAN is effective in identifying microtubules with 0.93+ AUC-ROC and that these results are robust to different kinds of image noise. We further apply CycleGAN to quantify the changes in microtubule density as a result of the application of drug compounds, and show that the quantified responses correspond well with known drug effects, Comment: accepted and presented in Machine Learning for Healthcare 2019
- Published
- 2019
7. Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review
- Author
-
Sheikhalishahi, Seyedmostafa, Miotto, Riccardo, Dudley, Joel T, Lavelli, Alberto, Rinaldi, Fabio, and Osmani, Venet
- Subjects
Computer Science - Computers and Society ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Information Retrieval - Abstract
Of the 2652 articles considered, 106 met the inclusion criteria. Review of the included papers resulted in identification of 43 chronic diseases, which were then further classified into 10 disease categories using ICD-10. The majority of studies focused on diseases of the circulatory system (n=38) while endocrine and metabolic diseases were fewest (n=14). This was due to the structure of clinical records related to metabolic diseases, which typically contain much more structured data, compared with medical records for diseases of the circulatory system, which focus more on unstructured data and consequently have seen a stronger focus of NLP. The review has shown that there is a significant increase in the use of machine learning methods compared to rule-based approaches; however, deep learning methods remain emergent (n=3). Consequently, the majority of works focus on classification of disease phenotype with only a handful of papers addressing extraction of comorbidities from the free text or integration of clinical notes with structured data. There is a notable use of relatively simple methods, such as shallow classifiers (or combination with rule-based methods), due to the interpretability of predictions, which still represents a significant issue for more complex methods. Finally, scarcity of publicly available data may also have contributed to insufficient development of more advanced methods, such as extraction of word embeddings from clinical notes. Further efforts are still required to improve (1) progression of clinical NLP methods from extraction toward understanding; (2) recognition of relations among entities rather than entities in isolation; (3) temporal extraction to understand past, current, and future clinical events; (4) exploitation of alternative sources of clinical knowledge; and (5) availability of large-scale, de-identified clinical corpora., Comment: Supplementary material detailing articles reviewed, classification of diseases and associated algorithms, can be found at: http://venetosmani.com/research/publications.html
- Published
- 2019
- Full Text
- View/download PDF
8. Deep Learning Predicts Hip Fracture using Confounding Patient and Healthcare Variables
- Author
-
Badgeley, Marcus A., Zech, John R., Oakden-Rayner, Luke, Glicksberg, Benjamin S., Liu, Manway, Gale, William, McConnell, Michael V., Percha, Beth, Snyder, Thomas M., and Dudley, Joel T.
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Hip fractures are a leading cause of death and disability among older adults. Hip fractures are also the most commonly missed diagnosis on pelvic radiographs. Computer-Aided Diagnosis (CAD) algorithms have shown promise for helping radiologists detect fractures, but the image features underpinning their predictions are notoriously difficult to understand. In this study, we trained deep learning models on 17,587 radiographs to classify fracture, five patient traits, and 14 hospital process variables. All 20 variables could be predicted from a radiograph (p < 0.05), with the best performances on scanner model (AUC=1.00), scanner brand (AUC=0.98), and whether the order was marked "priority" (AUC=0.79). Fracture was predicted moderately well from the image (AUC=0.78) and better when combining image features with patient data (AUC=0.86, p=2e-9) or patient data plus hospital process features (AUC=0.91, p=1e-21). The model performance on a test set with matched patient variables was significantly lower than a random test set (AUC=0.67, p=0.003); and when the test set was matched on patient and image acquisition variables, the model performed randomly (AUC=0.52, 95% CI 0.46-0.58), indicating that these variables were the main source of the model's predictive ability overall. We also used Naive Bayes to combine evidence from image models with patient and hospital data and found their inclusion improved performance, but that this approach was nevertheless inferior to directly modeling all variables. If CAD algorithms are inexplicably leveraging patient and process variables in their predictions, it is unclear how radiologists should interpret their predictions in the context of other known patient data. Further research is needed to illuminate deep learning decision processes so that computers and clinicians can effectively cooperate.
- Published
- 2018
9. PatientExploreR: an extensible application for dynamic visualization of patient clinical history from electronic health records in the OMOP common data model
- Author
-
Glicksberg, Benjamin S, Oskotsky, Boris, Thangaraj, Phyllis M, Giangreco, Nicholas, Badgeley, Marcus A, Johnson, Kipp W, Datta, Debajyoti, Rudrapatna, Vivek A, Rappoport, Nadav, Shervey, Mark M, Miotto, Riccardo, Goldstein, Theodore C, Rutenberg, Eugenia, Frazier, Remi, Lee, Nelson, Israni, Sharat, Larsen, Rick, Percha, Bethany, Li, Li, Dudley, Joel T, Tatonetti, Nicholas P, and Butte, Atul J
- Subjects
Information and Computing Sciences ,Information Systems ,Networking and Information Technology R&D (NITRD) ,Patient Safety ,Clinical Research ,Bioengineering ,Generic health relevance ,Good Health and Well Being ,Computers ,Databases ,Factual ,Electronic Health Records ,Humans ,Observational Studies as Topic ,Software ,Mathematical Sciences ,Biological Sciences ,Bioinformatics ,Biological sciences ,Information and computing sciences ,Mathematical sciences - Abstract
MotivationElectronic health records (EHRs) are quickly becoming omnipresent in healthcare, but interoperability issues and technical demands limit their use for biomedical and clinical research. Interactive and flexible software that interfaces directly with EHR data structured around a common data model (CDM) could accelerate more EHR-based research by making the data more accessible to researchers who lack computational expertise and/or domain knowledge.ResultsWe present PatientExploreR, an extensible application built on the R/Shiny framework that interfaces with a relational database of EHR data in the Observational Medical Outcomes Partnership CDM format. PatientExploreR produces patient-level interactive and dynamic reports and facilitates visualization of clinical data without any programming required. It allows researchers to easily construct and export patient cohorts from the EHR for analysis with other software. This application could enable easier exploration of patient-level data for physicians and researchers. PatientExploreR can incorporate EHR data from any institution that employs the CDM for users with approved access. The software code is free and open source under the MIT license, enabling institutions to install and users to expand and modify the application for their own purposes.Availability and implementationPatientExploreR can be freely obtained from GitHub: https://github.com/BenGlicksberg/PatientExploreR. We provide instructions for how researchers with approved access to their institutional EHR can use this package. We also release an open sandbox server of synthesized patient data for users without EHR access to explore: http://patientexplorer.ucsf.edu.Supplementary informationSupplementary data are available at Bioinformatics online.
- Published
- 2019
10. MetaCyto: A Tool for Automated Meta-analysis of Mass and Flow Cytometry Data
- Author
-
Hu, Zicheng, Jujjavarapu, Chethan, Hughey, Jacob J, Andorf, Sandra, Lee, Hao-Chih, Gherardini, Pier Federico, Spitzer, Matthew H, Thomas, Cristel G, Campbell, John, Dunn, Patrick, Wiser, Jeff, Kidd, Brian A, Dudley, Joel T, Nolan, Garry P, Bhattacharya, Sanchita, and Butte, Atul J
- Subjects
Biological Sciences ,Genetics ,Bioengineering ,Generic health relevance ,Adult ,Datasets as Topic ,Flow Cytometry ,Humans ,Meta-Analysis as Topic ,Software ,CyTOF ,flow cytometry ,human immunology ,immune cells ,meta-analysis ,Biochemistry and Cell Biology ,Medical Physiology ,Biological sciences - Abstract
While meta-analysis has demonstrated increased statistical power and more robust estimations in studies, the application of this commonly accepted methodology to cytometry data has been challenging. Different cytometry studies often involve diverse sets of markers. Moreover, the detected values of the same marker are inconsistent between studies due to different experimental designs and cytometer configurations. As a result, the cell subsets identified by existing auto-gating methods cannot be directly compared across studies. We developed MetaCyto for automated meta-analysis of both flow and mass cytometry (CyTOF) data. By combining clustering methods with a silhouette scanning method, MetaCyto is able to identify commonly labeled cell subsets across studies, thus enabling meta-analysis. Applying MetaCyto across a set of ten heterogeneous cytometry studies totaling 2,926 samples enabled us to identify multiple cell populations exhibiting differences in abundance between demographic groups. Software is released to the public through Bioconductor (http://bioconductor.org/packages/release/bioc/html/MetaCyto.html).
- Published
- 2018
11. Phe2vec: Automated disease phenotyping based on unsupervised embeddings from electronic health records
- Author
-
De Freitas, Jessica K., Johnson, Kipp W., Golden, Eddye, Nadkarni, Girish N., Dudley, Joel T., Bottinger, Erwin P., Glicksberg, Benjamin S., and Miotto, Riccardo
- Published
- 2021
- Full Text
- View/download PDF
12. The Evolution of Mining Electronic Health Records in the Era of Deep Learning
- Author
-
Landi, Isotta, primary, De Freitas, Jessica, additional, Kidd, Brian A., additional, Dudley, Joel T., additional, Glicksberg, Benjamin S., additional, and Miotto, Riccardo, additional
- Published
- 2022
- Full Text
- View/download PDF
13. A Multiomics, Molecular Atlas of Breast Cancer Survivors.
- Author
-
Bauer, Brent A., Schmidt, Caleb M., Ruddy, Kathryn J., Olson, Janet E., Meydan, Cem, Schmidt, Julian C., Smith, Sheena Y., Couch, Fergus J., Earls, John C., Price, Nathan D., Dudley, Joel T., Mason, Christopher E., Zhang, Bodi, Phipps, Stephen M., and Schmidt, Michael A.
- Subjects
OMEGA-3 fatty acids ,BREAST cancer ,FATTY acids ,MULTIOMICS ,CANCER survivors - Abstract
Breast cancer imposes a significant burden globally. While the survival rate is steadily improving, much remains to be elucidated. This observational, single time point, multiomic study utilizing genomics, proteomics, targeted and untargeted metabolomics, and metagenomics in a breast cancer survivor (BCS) and age-matched healthy control cohort (N = 100) provides deep molecular phenotyping of breast cancer survivors. In this study, the BCS cohort had significantly higher polygenic risk scores for breast cancer than the control group. Carnitine and hexanoyl carnitine were significantly different. Several bile acid and fatty acid metabolites were significantly dissimilar, most notably the Omega-3 Index (O3I) (significantly lower in BCS). Proteomic and metagenomic analyses identified group and pathway differences, which warrant further investigation. The database built from this study contributes a wealth of data on breast cancer survivorship where there has been a paucity, affording the ability to identify patterns and novel insights that can drive new hypotheses and inform future research. Expansion of this database in the treatment-naïve, newly diagnosed, controlling for treatment confounders, and through the disease progression, can be leveraged to profile and contextualize breast cancer and breast cancer survivorship, potentially leading to the development of new strategies to combat this disease and improve the quality of life for its victims. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Toward clinical genomics in everyday medicine: perspectives and recommendations
- Author
-
Delaney, Susan K, Hultner, Michael L, Jacob, Howard J, Ledbetter, David H, McCarthy, Jeanette J, Ball, Michael, Beckman, Kenneth B, Belmont, John W, Bloss, Cinnamon S, Christman, Michael F, Cosgrove, Andy, Damiani, Stephen A, Danis, Timothy, Delledonne, Massimo, Dougherty, Michael J, Dudley, Joel T, Faucett, W Andrew, Friedman, Jennifer R, Haase, David H, Hays, Tom S, Heilsberg, Stu, Huber, Jeff, Kaminsky, Leah, Ledbetter, Nikki, Lee, Warren H, Levin, Elissa, Libiger, Ondrej, Linderman, Michael, Love, Richard L, Magnus, David C, Martland, AnneMarie, McClure, Susan L, Megill, Scott E, Messier, Helen, Nussbaum, Robert L, Palaniappan, Latha, Patay, Bradley A, Popovich, Bradley W, Quackenbush, John, Savant, Mark J, Su, Michael M, Terry, Sharon F, Tucker, Steven, Wong, William T, and Green, Robert C
- Subjects
Genetics ,Biotechnology ,Clinical Research ,Cancer ,Human Genome ,Generic health relevance ,Good Health and Well Being ,Delivery of Health Care ,Genetic Testing ,Genome ,Human ,Genomics ,High-Throughput Nucleotide Sequencing ,Humans ,Precision Medicine ,Reagent Kits ,Diagnostic ,Personalized medicine ,precision medicine ,clinical genomics ,practice standards ,genomic data ,exome ,genome ,sequencing ,genetic testing ,Clinical Sciences - Abstract
Precision or personalized medicine through clinical genome and exome sequencing has been described by some as a revolution that could transform healthcare delivery, yet it is currently used in only a small fraction of patients, principally for the diagnosis of suspected Mendelian conditions and for targeting cancer treatments. Given the burden of illness in our society, it is of interest to ask how clinical genome and exome sequencing can be constructively integrated more broadly into the routine practice of medicine for the betterment of public health. In November 2014, 46 experts from academia, industry, policy and patient advocacy gathered in a conference sponsored by Illumina, Inc. to discuss this question, share viewpoints and propose recommendations. This perspective summarizes that work and identifies some of the obstacles and opportunities that must be considered in translating advances in genomics more widely into the practice of medicine.
- Published
- 2016
15. Polygenic overlap between schizophrenia risk and antipsychotic response: a genomic medicine approach
- Author
-
Ruderfer, Douglas M, Charney, Alexander W, Readhead, Ben, Kidd, Brian A, Kähler, Anna K, Kenny, Paul J, Keiser, Michael J, Moran, Jennifer L, Hultman, Christina M, Scott, Stuart A, Sullivan, Patrick F, Purcell, Shaun M, Dudley, Joel T, and Sklar, Pamela
- Subjects
Biological Psychology ,Pharmacology and Pharmaceutical Sciences ,Biomedical and Clinical Sciences ,Psychology ,Schizophrenia ,Mental Health ,Genetics ,Prevention ,Biotechnology ,Serious Mental Illness ,Clinical Research ,Brain Disorders ,Human Genome ,5.1 Pharmaceuticals ,2.1 Biological and endogenous factors ,Evaluation of treatments and therapeutic interventions ,6.1 Pharmaceuticals ,Aetiology ,Development of treatments and therapeutic interventions ,Mental health ,Good Health and Well Being ,Antipsychotic Agents ,Case-Control Studies ,Clozapine ,Genetic Predisposition to Disease ,Genome-Wide Association Study ,Genomics ,Humans ,Multifactorial Inheritance ,Registries ,Sweden ,Treatment Outcome ,Clinical Sciences ,Public Health and Health Services ,Clinical sciences ,Applied and developmental psychology ,Clinical and health psychology - Abstract
BackgroundTherapeutic treatments for schizophrenia do not alleviate symptoms for all patients and efficacy is limited by common, often severe, side-effects. Genetic studies of disease can identify novel drug targets, and drugs for which the mechanism has direct genetic support have increased likelihood of clinical success. Large-scale genetic studies of schizophrenia have increased the number of genes and gene sets associated with risk. We aimed to examine the overlap between schizophrenia risk loci and gene targets of a comprehensive set of medications to potentially inform and improve treatment of schizophrenia.MethodsWe defined schizophrenia risk loci as genomic regions reaching genome-wide significance in the latest Psychiatric Genomics Consortium schizophrenia genome-wide association study (GWAS) of 36 989 cases and 113 075 controls and loss of function variants observed only once among 5079 individuals in an exome-sequencing study of 2536 schizophrenia cases and 2543 controls (Swedish Schizophrenia Study). Using two large and orthogonally created databases, we collated drug targets into 167 gene sets targeted by pharmacologically similar drugs and examined enrichment of schizophrenia risk loci in these sets. We further linked the exome-sequenced data with a national drug registry (the Swedish Prescribed Drug Register) to assess the contribution of rare variants to treatment response, using clozapine prescription as a proxy for treatment resistance.FindingsWe combined results from testing rare and common variation and, after correction for multiple testing, two gene sets were associated with schizophrenia risk: agents against amoebiasis and other protozoal diseases (106 genes, p=0·00046, pcorrected =0·024) and antipsychotics (347 genes, p=0·00078, pcorrected=0·046). Further analysis pointed to antipsychotics as having independent enrichment after removing genes that overlapped these two target sets. We noted significant enrichment both in known targets of antipsychotics (70 genes, p=0·0078) and novel predicted targets (277 genes, p=0·019). Patients with treatment-resistant schizophrenia had an excess of rare disruptive variants in gene targets of antipsychotics (347 genes, p=0·0067) and in genes with evidence for a role in antipsychotic efficacy (91 genes, p=0·0029).InterpretationOur results support genetic overlap between schizophrenia pathogenesis and antipsychotic mechanism of action. This finding is consistent with treatment efficacy being polygenic and suggests that single-target therapeutics might be insufficient. We provide evidence of a role for rare functional variants in antipsychotic treatment response, pointing to a subset of patients where their genetic information could inform treatment. Finally, we present a novel framework for identifying treatments from genetic data and improving our understanding of therapeutic mechanism.FundingUS National Institutes of Health.
- Published
- 2016
16. miR155 regulation of behavior, neuropathology, and cortical transcriptomics in Alzheimer's disease
- Author
-
Readhead, Ben, Haure-Mirande, Jean-Vianney, Mastroeni, Diego, Audrain, Mickael, Fanutza, Tomas, Kim, Soong H., Blitzer, Robert D., Gandy, Sam, Dudley, Joel T., and Ehrlich, Michelle E.
- Published
- 2020
- Full Text
- View/download PDF
17. Genomic and Network Patterns of Schizophrenia Genetic Variation in Human Evolutionary Accelerated Regions
- Author
-
Xu, Ke, Schadt, Eric E, Pollard, Katherine S, Roussos, Panos, and Dudley, Joel T
- Subjects
Biological Sciences ,Genetics ,Mental Health ,Schizophrenia ,Human Genome ,Serious Mental Illness ,Brain Disorders ,2.1 Biological and endogenous factors ,Aetiology ,Mental health ,Animals ,Evolution ,Molecular ,Gene Expression Regulation ,Gene Regulatory Networks ,Genome-Wide Association Study ,Genomics ,Humans ,Prefrontal Cortex ,RNA ,Untranslated ,schizophrenia ,human accelerated evolution ,networks ,GWAS ,Biochemistry and Cell Biology ,Evolutionary Biology ,Biochemistry and cell biology ,Evolutionary biology - Abstract
The population persistence of schizophrenia despite associated reductions in fitness and fecundity suggests that the genetic basis of schizophrenia has a complex evolutionary history. A recent meta-analysis of schizophrenia genome-wide association studies offers novel opportunities for assessment of the evolutionary trajectories of schizophrenia-associated loci. In this study, we hypothesize that components of the genetic architecture of schizophrenia are attributable to human lineage-specific evolution. Our results suggest that schizophrenia-associated loci enrich in genes near previously identified human accelerated regions (HARs). Specifically, we find that genes near HARs conserved in nonhuman primates (pHARs) are enriched for schizophrenia-associated loci, and that pHAR-associated schizophrenia genes are under stronger selective pressure than other schizophrenia genes and other pHAR-associated genes. We further evaluate pHAR-associated schizophrenia genes in regulatory network contexts to investigate associated molecular functions and mechanisms. We find that pHAR-associated schizophrenia genes significantly enrich in a GABA-related coexpression module that was previously found to be differentially regulated in schizophrenia affected individuals versus healthy controls. In another two independent networks constructed from gene expression profiles from prefrontal cortex samples, we find that pHAR-associated schizophrenia genes are located in more central positions and their average path lengths to the other nodes are significantly shorter than those of other schizophrenia genes. Together, our results suggest that HARs are associated with potentially important functional roles in the genetic architecture of schizophrenia.
- Published
- 2015
18. Repositioning of a novel GABA-B receptor agonist, AZD3355 (Lesogaberan), for the treatment of non-alcoholic steatohepatitis
- Author
-
Bhattacharya, Dipankar, Becker, Christine, Readhead, Benjamin, Goossens, Nicolas, Novik, Jacqueline, Fiel, Maria Isabel, Cousens, Leslie P., Magnusson, Björn, Backmark, Anna, Hicks, Ryan, Dudley, Joel T., and Friedman, Scott L.
- Published
- 2021
- Full Text
- View/download PDF
19. Transcriptomic Network Interactions in Human Skin Treated with Topical Glucocorticoid Clobetasol Propionate
- Author
-
Lili, Loukia N., Klopot, Anna, Readhead, Benjamin, Baida, Gleb, Dudley, Joel T., and Budunova, Irina
- Published
- 2019
- Full Text
- View/download PDF
20. A transcriptomic model to predict increase in fibrous cap thickness in response to high-dose statin treatment: Validation by serial intracoronary OCT imaging
- Author
-
Johnson, Kipp W., Glicksberg, Benjamin S., Shameer, Khader, Vengrenyuk, Yuliya, Krittanawong, Chayakrit, Russak, Adam J., Sharma, Samin K., Narula, Jagat N., Dudley, Joel T., and Kini, Annapoorna S.
- Published
- 2019
- Full Text
- View/download PDF
21. Critical period plasticity-related transcriptional aberrations in schizophrenia and bipolar disorder
- Author
-
Smith, Milo R., Readhead, Ben, Dudley, Joel T., and Morishita, Hirofumi
- Published
- 2019
- Full Text
- View/download PDF
22. PERSONALIZED MEDICINE: FROM GENOTYPES, MOLECULAR PHENOTYPES AND THE QUANTIFIED SELF, TOWARDS IMPROVED MEDICINE
- Author
-
Altman, Russ B, Dunker, A Keith, Hunter, Lawrence, Ritchie, Marylyn D, Murray, Tiffany A, Klein, Teri E, DUDLEY, JOEL T, LISTGARTEN, JENNIFER, STEGLE, OLIVER, BRENNER, STEVEN E, and PARTS, LEOPOLD
- Subjects
Clinical Research ,Bioengineering ,Genetics ,Networking and Information Technology R&D (NITRD) ,Generic health relevance ,Good Health and Well Being ,Computational Biology ,Genotype ,Humans ,Patient-Specific Modeling ,Phenotype ,Precision Medicine ,Systems Biology - Abstract
Advances in molecular profiling and sensor technologies are expanding the scope of personalized medicine beyond genotypes, providing new opportunities for developing richer and more dynamic multi-scale models of individual health. Recent studies demonstrate the value of scoring high-dimensional microbiome, immune, and metabolic traits from individuals to inform personalized medicine. Efforts to integrate multiple dimensions of clinical and molecular data towards predictive multi-scale models of individual health and wellness are already underway. Improved methods for mining and discovery of clinical phenotypes from electronic medical records and technological developments in wearable sensor technologies present new opportunities for mapping and exploring the critical yet poorly characterized "phenome" and "envirome" dimensions of personalized medicine. There are ambitious new projects underway to collect multi-scale molecular, sensor, clinical, behavioral, and environmental data streams from large population cohorts longitudinally to enable more comprehensive and dynamic models of individual biology and personalized health. Personalized medicine stands to benefit from inclusion of rich new sources and dimensions of data. However, realizing these improvements in care relies upon novel informatics methodologies, tools, and systems to make full use of these data to advance both the science and translational applications of personalized medicine.
- Published
- 2014
23. Disease Risk Factors Identified Through Shared Genetic Architecture and Electronic Medical Records
- Author
-
Li, Li, Ruau, David J, Patel, Chirag J, Weber, Susan C, Chen, Rong, Tatonetti, Nicholas P, Dudley, Joel T, and Butte, Atul J
- Subjects
Medical Biotechnology ,Engineering ,Biomedical and Clinical Sciences ,Biomedical Engineering ,Genetics ,Human Genome ,Hematology ,Cancer ,Biotechnology ,Pediatric Research Initiative ,Prevention ,Rare Diseases ,Clinical Research ,2.1 Biological and endogenous factors ,Aetiology ,Good Health and Well Being ,Electronic Health Records ,Genome-Wide Association Study ,Humans ,Models ,Biological ,Risk Factors ,Biological Sciences ,Medical and Health Sciences ,Medical biotechnology ,Biomedical engineering - Abstract
Genome-wide association studies have identified genetic variants for thousands of diseases and traits. We evaluated the relationships between specific risk factors (for example, blood cholesterol level) and diseases on the basis of their shared genetic architecture in a comprehensive human disease-single-nucleotide polymorphism association database (VARIMED), analyzing the findings from 8962 published association studies. Similarity between traits and diseases was statistically evaluated on the basis of their association with shared gene variants. We identified 120 disease-trait pairs that were statistically similar, and of these, we tested and validated five previously unknown disease-trait associations by searching electronic medical records (EMRs) from three independent medical centers for evidence of the trait appearing in patients within 1 year of first diagnosis of the disease. We validated that the mean corpuscular volume is elevated before diagnosis of acute lymphoblastic leukemia; both have associated variants in the gene IKZF1. Platelet count is decreased before diagnosis of alcohol dependence; both are associated with variants in the gene C12orf51. Alkaline phosphatase level is elevated in patients with venous thromboembolism; both share variants in ABO. Similarly, we found that prostate-specific antigen and serum magnesium levels were altered before the diagnosis of lung cancer and gastric cancer, respectively. Disease-trait associations identify traits that could serve as future prognostics, if validated through EMR and subsequent prospective trials.
- Published
- 2014
24. Automated detection of off-label drug use.
- Author
-
Jung, Kenneth, LePendu, Paea, Chen, William S, Iyer, Srinivasan V, Readhead, Ben, Dudley, Joel T, and Shah, Nigam H
- Subjects
Algorithms ,Models ,Theoretical ,Databases ,Factual ,Pattern Recognition ,Automated ,Off-Label Use ,Data Mining ,Databases ,Factual ,Models ,Theoretical ,Pattern Recognition ,Automated ,General Science & Technology - Abstract
Off-label drug use, defined as use of a drug in a manner that deviates from its approved use defined by the drug's FDA label, is problematic because such uses have not been evaluated for safety and efficacy. Studies estimate that 21% of prescriptions are off-label, and only 27% of those have evidence of safety and efficacy. We describe a data-mining approach for systematically identifying off-label usages using features derived from free text clinical notes and features extracted from two databases on known usage (Medi-Span and DrugBank). We trained a highly accurate predictive model that detects novel off-label uses among 1,602 unique drugs and 1,472 unique indications. We validated 403 predicted uses across independent data sources. Finally, we prioritize well-supported novel usages for further investigation on the basis of drug safety and cost.
- Published
- 2014
25. A public resource of single cell transcriptomes and multiscale networks from persons with and without Alzheimer’s disease
- Author
-
Wang, Qi, primary, Antone, Jerry, additional, Alsop, Eric, additional, Reiman, Rebecca, additional, Funk, Cory, additional, Bendl, Jaroslav, additional, Dudley, Joel T, additional, Liang, Winnie S, additional, Karr, Timothy L, additional, Roussos, Panos, additional, Bennett, David A, additional, De Jager, Philip L, additional, Serrano, Geidy E, additional, Beach, Thomas G, additional, Keuren-Jensen, Kendall Van, additional, Mastroeni, Diego, additional, Reiman, Eric M, additional, and Readhead, Benjamin P, additional
- Published
- 2023
- Full Text
- View/download PDF
26. Rapamycin Modulates Glucocorticoid Receptor Function, Blocks Atrophogene REDD1, and Protects Skin from Steroid Atrophy
- Author
-
Lesovaya, Ekaterina, Agarwal, Shivani, Readhead, Ben, Vinokour, Elena, Baida, Gleb, Bhalla, Pankaj, Kirsanov, Kirill, Yakubovskaya, Marianna, Platanias, Leonidas C., Dudley, Joel T., and Budunova, Irina
- Published
- 2018
- Full Text
- View/download PDF
27. Isolation and Identification of the Follicular Microbiome: Implications for Acne Research
- Author
-
Hall, Jacob B., Cong, Zhaoyuan, Imamura-Kawasawa, Yuka, Kidd, Brian A., Dudley, Joel T., Thiboutot, Diane M., and Nelson, Amanda M.
- Published
- 2018
- Full Text
- View/download PDF
28. Artificial Intelligence in Cardiology
- Author
-
Johnson, Kipp W., Torres Soto, Jessica, Glicksberg, Benjamin S., Shameer, Khader, Miotto, Riccardo, Ali, Mohsin, Ashley, Euan, and Dudley, Joel T.
- Published
- 2018
- Full Text
- View/download PDF
29. A Drug Repositioning Approach Identifies Tricyclic Antidepressants as Inhibitors of Small Cell Lung Cancer and Other Neuroendocrine Tumors
- Author
-
Jahchan, Nadine S, Dudley, Joel T, Mazur, Pawel K, Flores, Natasha, Yang, Dian, Palmerton, Alec, Zmoos, Anne-Flore, Vaka, Dedeepya, Tran, Kim QT, Zhou, Margaret, Krasinska, Karolina, Riess, Jonathan W, Neal, Joel W, Khatri, Purvesh, Park, Kwon S, Butte, Atul J, and Sage, Julien
- Subjects
Biomedical and Clinical Sciences ,Oncology and Carcinogenesis ,Cancer ,Rare Diseases ,Orphan Drug ,Neurosciences ,Lung Cancer ,Lung ,5.1 Pharmaceuticals ,Development of treatments and therapeutic interventions ,Animals ,Antidepressive Agents ,Tricyclic ,Antineoplastic Agents ,Apoptosis ,Cell Line ,Tumor ,Computational Biology ,Drug Repositioning ,Humans ,Lung Neoplasms ,Mice ,Neuroendocrine Tumors ,Receptors ,G-Protein-Coupled ,Small Cell Lung Carcinoma ,Xenograft Model Antitumor Assays ,Biochemistry and cell biology ,Oncology and carcinogenesis - Abstract
UnlabelledSmall cell lung cancer (SCLC) is an aggressive neuroendocrine subtype of lung cancer with high mortality. We used a systematic drug repositioning bioinformatics approach querying a large compendium of gene expression profiles to identify candidate U.S. Food and Drug Administration (FDA)-approved drugs to treat SCLC. We found that tricyclic antidepressants and related molecules potently induce apoptosis in both chemonaïve and chemoresistant SCLC cells in culture, in mouse and human SCLC tumors transplanted into immunocompromised mice, and in endogenous tumors from a mouse model for human SCLC. The candidate drugs activate stress pathways and induce cell death in SCLC cells, at least in part by disrupting autocrine survival signals involving neurotransmitters and their G protein-coupled receptors. The candidate drugs inhibit the growth of other neuroendocrine tumors, including pancreatic neuroendocrine tumors and Merkel cell carcinoma. These experiments identify novel targeted strategies that can be rapidly evaluated in patients with neuroendocrine tumors through the repurposing of approved drugs.SignificanceOur work shows the power of bioinformatics-based drug approaches to rapidly repurpose FDA-approved drugs and identifies a novel class of molecules to treat patients with SCLC, a cancer for which no effective novel systemic treatments have been identified in several decades. In addition, our experiments highlight the importance of novel autocrine mechanisms in promoting the growth of neuroendocrine tumor cells.
- Published
- 2013
30. Systematic functional regulatory assessment of disease-associated variants.
- Author
-
Karczewski, Konrad J, Dudley, Joel T, Kukurba, Kimberly R, Chen, Rong, Butte, Atul J, Montgomery, Stephen B, and Snyder, Michael
- Subjects
Humans ,Genetic Diseases ,Inborn ,NF-kappa B ,Transcription Factors ,Gene Expression Profiling ,Computational Biology ,Systems Biology ,Protein Binding ,Polymorphism ,Single Nucleotide ,Genetic Variation ,Genome-Wide Association Study ,regulatory genomics ,systems biology ,translational bioinformatics ,Arthritis ,Lung ,Human Genome ,Atherosclerosis ,Biotechnology ,Asthma ,Genetics ,2.1 Biological and endogenous factors ,Aetiology ,Inflammatory and immune system ,Generic health relevance ,Cardiovascular - Abstract
Genome-wide association studies have discovered many genetic loci associated with disease traits, but the functional molecular basis of these associations is often unresolved. Genome-wide regulatory and gene expression profiles measured across individuals and diseases reflect downstream effects of genetic variation and may allow for functional assessment of disease-associated loci. Here, we present a unique approach for systematic integration of genetic disease associations, transcription factor binding among individuals, and gene expression data to assess the functional consequences of variants associated with hundreds of human diseases. In an analysis of genome-wide binding profiles of NFκB, we find that disease-associated SNPs are enriched in NFκB binding regions overall, and specifically for inflammatory-mediated diseases, such as asthma, rheumatoid arthritis, and coronary artery disease. Using genome-wide variation in transcription factor-binding data, we find that NFκB binding is often correlated with disease-associated variants in a genotype-specific and allele-specific manner. Furthermore, we show that this binding variation is often related to expression of nearby genes, which are also found to have altered expression in independent profiling of the variant-associated disease condition. Thus, using this integrative approach, we provide a unique means to assign putative function to many disease-associated SNPs.
- Published
- 2013
31. Integrative approach to sporadic Alzheimer’s disease: deficiency of TYROBP in cerebral Aβ amyloidosis mouse normalizes clinical phenotype and complement subnetwork molecular pathology without reducing Aβ burden
- Author
-
Haure-Mirande, Jean-Vianney, Wang, Minghui, Audrain, Mickael, Fanutza, Tomas, Kim, Soong Ho, Heja, Szilvia, Readhead, Ben, Dudley, Joel T., Blitzer, Robert D., Schadt, Eric E., Zhang, Bin, Gandy, Sam, and Ehrlich, Michelle E.
- Published
- 2019
- Full Text
- View/download PDF
32. Human genomic disease variants: A neutral evolutionary explanation
- Author
-
Dudley, Joel T, Kim, Yuseob, Liu, Li, Markov, Glenn J, Gerold, Kristyn, Chen, Rong, Butte, Atul J, and Kumar, Sudhir
- Subjects
Biological Sciences ,Evolutionary Biology ,Genetics ,Human Genome ,Generic health relevance ,Good Health and Well Being ,Adaptation ,Biological ,Alleles ,Evolution ,Molecular ,Gene Frequency ,Genetic Diseases ,Inborn ,Genetic Predisposition to Disease ,Genetic Variation ,Genetics ,Population ,Genome ,Human ,Humans ,Molecular Medicine ,Selection ,Genetic ,Medical and Health Sciences ,Bioinformatics - Abstract
Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease.
- Published
- 2012
33. Identification of Cell Surface Targets through Meta-analysis of Microarray Data
- Author
-
Haeberle, Henry, Dudley, Joel T, Liu, Jonathan TC, Butte, Atul J, and Contag, Christopher H
- Subjects
Biomedical and Clinical Sciences ,Oncology and Carcinogenesis ,Rare Diseases ,Genetics ,Brain Disorders ,Orphan Drug ,Neurosciences ,Pediatric ,Pediatric Cancer ,Brain Cancer ,Cancer ,1.1 Normal biological development and functioning ,Underpinning research ,Biomarkers ,Tumor ,Cerebellar Neoplasms ,Computational Biology ,Gene Expression Profiling ,Gene Expression Regulation ,Neoplastic ,Humans ,Medulloblastoma ,Oligonucleotide Array Sequence Analysis ,Clinical Sciences ,Oncology & Carcinogenesis ,Clinical sciences ,Oncology and carcinogenesis - Abstract
High-resolution image guidance for resection of residual tumor cells would enable more precise and complete excision for more effective treatment of cancers, such as medulloblastoma, the most common pediatric brain cancer. Numerous studies have shown that brain tumor patient outcomes correlate with the precision of resection. To enable guided resection with molecular specificity and cellular resolution, molecular probes that effectively delineate brain tumor boundaries are essential. Therefore, we developed a bioinformatics approach to analyze micro-array datasets for the identification of transcripts that encode candidate cell surface biomarkers that are highly enriched in medulloblastoma. The results identified 380 genes with greater than a two-fold increase in the expression in the medulloblastoma compared with that in the normal cerebellum. To enrich for targets with accessibility for extracellular molecular probes, we further refined this list by filtering it with gene ontology to identify genes with protein localization on, or within, the plasma membrane. To validate this meta-analysis, the top 10 candidates were evaluated with immunohistochemistry. We identified two targets, fibrillin 2 and EphA3, which specifically stain medulloblastoma. These results demonstrate a novel bioinformatics approach that successfully identified cell surface and extracellular candidate markers enriched in medulloblastoma versus adjacent cerebellum. These two proteins are high-value targets for the development of tumor-specific probes in medulloblastoma. This bioinformatics method has broad utility for the identification of accessible molecular targets in a variety of cancers and will enable probe development for guided resection.
- Published
- 2012
34. Type 2 diabetes risk alleles demonstrate extreme directional differentiation among human populations, compared to other diseases.
- Author
-
Chen, Rong, Corona, Erik, Sikora, Martin, Dudley, Joel T, Morgan, Alex A, Moreno-Estrada, Andres, Nilsen, Geoffrey B, Ruau, David, Lincoln, Stephen E, Bustamante, Carlos D, and Butte, Atul J
- Subjects
Humans ,Diabetes Mellitus ,Type 2 ,Genetic Predisposition to Disease ,Risk Factors ,Genetics ,Population ,Gene Frequency ,Haplotypes ,Linkage Disequilibrium ,Polymorphism ,Single Nucleotide ,Genome ,Human ,African Continental Ancestry Group ,Asian Continental Ancestry Group ,European Continental Ancestry Group ,Genome-Wide Association Study ,HapMap Project ,Prevention ,Human Genome ,Diabetes ,Obesity ,Genetics ,Nutrition ,Clinical Research ,2.1 Biological and endogenous factors ,Metabolic and endocrine ,Developmental Biology - Abstract
Many disease-susceptible SNPs exhibit significant disparity in ancestral and derived allele frequencies across worldwide populations. While previous studies have examined population differentiation of alleles at specific SNPs, global ethnic patterns of ensembles of disease risk alleles across human diseases are unexamined. To examine these patterns, we manually curated ethnic disease association data from 5,065 papers on human genetic studies representing 1,495 diseases, recording the precise risk alleles and their measured population frequencies and estimated effect sizes. We systematically compared the population frequencies of cross-ethnic risk alleles for each disease across 1,397 individuals from 11 HapMap populations, 1,064 individuals from 53 HGDP populations, and 49 individuals with whole-genome sequences from 10 populations. Type 2 diabetes (T2D) demonstrated extreme directional differentiation of risk allele frequencies across human populations, compared with null distributions of European-frequency matched control genomic alleles and risk alleles for other diseases. Most T2D risk alleles share a consistent pattern of decreasing frequencies along human migration into East Asia. Furthermore, we show that these patterns contribute to disparities in predicted genetic risk across 1,397 HapMap individuals, T2D genetic risk being consistently higher for individuals in the African populations and lower in the Asian populations, irrespective of the ethnicity considered in the initial discovery of risk alleles. We observed a similar pattern in the distribution of T2D Genetic Risk Scores, which are associated with an increased risk of developing diabetes in the Diabetes Prevention Program cohort, for the same individuals. This disparity may be attributable to the promotion of energy storage and usage appropriate to environments and inconsistent energy intake. Our results indicate that the differential frequencies of T2D risk alleles may contribute to the observed disparity in T2D incidence rates across ethnic populations.
- Published
- 2012
35. Computational prediction and experimental validation associating FABP-1 and pancreatic adenocarcinoma with diabetes
- Author
-
Sharaf, Ravi N, Butte, Atul J, Montgomery, Kelli D, Pai, Reetesh, Dudley, Joel T, and Pasricha, Pankaj J
- Subjects
Biomedical and Clinical Sciences ,Clinical Sciences ,Pancreatic Cancer ,Clinical Research ,Prevention ,Biotechnology ,Diabetes ,Rare Diseases ,Cancer ,Digestive Diseases ,Detection ,screening and diagnosis ,4.1 Discovery and preclinical testing of markers and technologies ,Metabolic and endocrine ,Adenocarcinoma ,Adult ,Aged ,Computational Biology ,Diabetes Complications ,Diabetes Mellitus ,Fatty Acid-Binding Proteins ,Female ,Genomics ,Humans ,Immunohistochemistry ,Logistic Models ,Male ,Middle Aged ,Pancreatic Neoplasms ,Pilot Projects ,Predictive Value of Tests ,Protein Array Analysis ,Public Health and Health Services ,Gastroenterology & Hepatology ,Clinical sciences - Abstract
BackgroundPancreatic cancer, composed principally of pancreatic adenocarcinoma (PaC), is the fourth leading cause of cancer death in the United States. PaC-associated diabetes may be a marker of early disease. We sought to identify molecules associated with PaC and PaC with diabetes (PaC-DM) using a novel translational bioinformatics approach. We identified fatty acid binding protein-1 (FABP-1) as one of several candidates. The primary aim of this pilot study was to experimentally validate the predicted association between FABP-1 with PaC and PaC with diabetes.MethodsWe searched public microarray measurements for genes that were specifically highly expressed in PaC. We then filtered for proteins with known involvement in diabetes. Validation of FABP-1 was performed via antibody immunohistochemistry on formalin-fixed paraffin embedded pancreatic tissue microarrays (FFPE TMA). FFPE TMA were constructed using 148 cores of pancreatic tissue from 134 patients collected between 1995 and 2002 from patients who underwent pancreatic surgery. Primary analysis was performed on 21 normal and 60 pancreatic adenocarcinoma samples, stratified for diabetes. Clinical data on samples was obtained via retrospective chart review. Serial sections were cut per standard protocol. Antibody staining was graded by an experienced pathologist on a scale of 0-3. Bivariate and multivariate analyses were conducted to assess FABP-1 staining and clinical characteristics.ResultsNormal samples were significantly more likely to come from younger patients. PaC samples were significantly more likely to stain for FABP-1, when FABP-1 staining was considered a binary variable. Compared to normals, there was significantly increased staining in diabetic PaC samples (p = 0.004) and there was a trend towards increased staining in the non-diabetic PaC group (p = 0.07). In logistic regression modeling, FABP-1 staining was significantly associated with diagnosis of PaC (OR 8.6 95% CI 1.1-68, p = 0.04), though age was a confounder.ConclusionsCompared to normal controls, there was a significant positive association between FABP-1 staining and PaC on FFPE-TMA, strengthened by the presence of diabetes. Further studies with closely phenotyped patient samples are required to understand the true relationship between FABP-1, PaC and PaC-associated diabetes. A translational bioinformatics approach has potential to identify novel disease associations and potential biomarkers in gastroenterology.
- Published
- 2011
36. Phased whole-genome genetic risk in a family quartet using a major allele reference sequence.
- Author
-
Dewey, Frederick E, Chen, Rong, Cordero, Sergio P, Ormond, Kelly E, Caleshu, Colleen, Karczewski, Konrad J, Whirl-Carrillo, Michelle, Wheeler, Matthew T, Dudley, Joel T, Byrnes, Jake K, Cornejo, Omar E, Knowles, Joshua W, Woon, Mark, Sangkuhl, Katrin, Gong, Li, Thorn, Caroline F, Hebert, Joan M, Capriotti, Emidio, David, Sean P, Pavlovic, Aleksandra, West, Anne, Thakuria, Joseph V, Ball, Madeleine P, Zaranek, Alexander W, Rehm, Heidi L, Church, George M, West, John S, Bustamante, Carlos D, Snyder, Michael, Altman, Russ B, Klein, Teri E, Butte, Atul J, and Ashley, Euan A
- Subjects
Humans ,Thrombophilia ,Genetic Predisposition to Disease ,Risk Assessment ,Pedigree ,Sequence Alignment ,Sequence Analysis ,DNA ,DNA Mutational Analysis ,Base Sequence ,Genotype ,Haplotypes ,Alleles ,Genes ,Synthetic ,Genome ,Human ,Reference Standards ,Female ,Male ,Genetic Variation ,Genome-Wide Association Study ,Biotechnology ,Genetics ,Human Genome ,2.1 Biological and endogenous factors ,Generic health relevance ,Developmental Biology - Abstract
Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (< 1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.
- Published
- 2011
37. Identifying Acute Low Back Pain Episodes in Primary Care Practice From Clinical Notes: Observational Study
- Author
-
Miotto, Riccardo, Percha, Bethany L, Glicksberg, Benjamin S, Lee, Hao-Chih, Cruz, Lisanne, Dudley, Joel T, and Nabeel, Ismail
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
BackgroundAcute and chronic low back pain (LBP) are different conditions with different treatments. However, they are coded in electronic health records with the same International Classification of Diseases, 10th revision (ICD-10) code (M54.5) and can be differentiated only by retrospective chart reviews. This prevents an efficient definition of data-driven guidelines for billing and therapy recommendations, such as return-to-work options. ObjectiveThe objective of this study was to evaluate the feasibility of automatically distinguishing acute LBP episodes by analyzing free-text clinical notes. MethodsWe used a dataset of 17,409 clinical notes from different primary care practices; of these, 891 documents were manually annotated as acute LBP and 2973 were generally associated with LBP via the recorded ICD-10 code. We compared different supervised and unsupervised strategies for automated identification: keyword search, topic modeling, logistic regression with bag of n-grams and manual features, and deep learning (a convolutional neural network-based architecture [ConvNet]). We trained the supervised models using either manual annotations or ICD-10 codes as positive labels. ResultsConvNet trained using manual annotations obtained the best results with an area under the receiver operating characteristic curve of 0.98 and an F score of 0.70. ConvNet’s results were also robust to reduction of the number of manually annotated documents. In the absence of manual annotations, topic models performed better than methods trained using ICD-10 codes, which were unsatisfactory for identifying LBP acuity. ConclusionsThis study uses clinical notes to delineate a potential path toward systematic learning of therapeutic strategies, billing guidelines, and management options for acute LBP at the point of care.
- Published
- 2020
- Full Text
- View/download PDF
38. Content-based microarray search using differential expression profiles.
- Author
-
Engreitz, Jesse M, Morgan, Alexander A, Dudley, Joel T, Chen, Rong, Thathoo, Rahul, Altman, Russ B, and Butte, Atul J
- Subjects
Animals ,Humans ,Mice ,Lung Neoplasms ,Oligonucleotide Array Sequence Analysis ,Gene Expression Profiling ,Computational Biology ,Databases ,Genetic ,Embryonic Stem Cells ,Neural Stem Cells ,Databases ,Genetic ,Mathematical Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
BackgroundWith the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations.ResultsWe develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3.ConclusionsContent-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.
- Published
- 2010
39. An integrative method for scoring candidate genes from association studies: application to warfarin dosing.
- Author
-
Tatonetti, Nicholas P, Dudley, Joel T, Sagreiya, Hersh, Butte, Atul J, and Altman, Russ B
- Subjects
Warfarin ,Cytochrome P-450 Enzyme System ,Mixed Function Oxygenases ,Anticoagulants ,Logistic Models ,Pharmacogenetics ,Dose-Response Relationship ,Drug ,Gene Frequency ,Genotype ,Polymorphism ,Single Nucleotide ,Genome-Wide Association Study ,Vitamin K Epoxide Reductases ,Dose-Response Relationship ,Drug ,Polymorphism ,Single Nucleotide ,Mathematical Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
BackgroundA key challenge in pharmacogenomics is the identification of genes whose variants contribute to drug response phenotypes, which can include severe adverse effects. Pharmacogenomics GWAS attempt to elucidate genotypes predictive of drug response. However, the size of these studies has severely limited their power and potential application. We propose a novel knowledge integration and SNP aggregation approach for identifying genes impacting drug response. Our SNP aggregation method characterizes the degree to which uncommon alleles of a gene are associated with drug response. We first use pre-existing knowledge sources to rank pharmacogenes by their likelihood to affect drug response. We then define a summary score for each gene based on allele frequencies and train linear and logistic regression classifiers to predict drug response phenotypes.ResultsWe applied our method to a published warfarin GWAS data set comprising 181 individuals. We find that our method can increase the power of the GWAS to identify both VKORC1 and CYP2C9 as warfarin pharmacogenes, where the original analysis had only identified VKORC1. Additionally, we find that our method can be used to discriminate between low-dose (AUROC=0.886) and high-dose (AUROC=0.764) responders.ConclusionsOur method offers a new route for candidate pharmacogene discovery from pharmacogenomics GWAS, and serves as a foundation for future work in methods for predictive pharmacogenomics.
- Published
- 2010
40. Latent physiological factors of complex human diseases revealed by independent component analysis of clinarrays.
- Author
-
Chen, David P, Dudley, Joel T, and Butte, Atul J
- Subjects
Humans ,Cystic Fibrosis ,Down Syndrome ,Diabetes Mellitus ,Type 2 ,Clinical Laboratory Techniques ,Biomarkers ,Diabetes Mellitus ,Type 2 ,Mathematical Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
BackgroundDiagnosis and treatment of patients in the clinical setting is often driven by known symptomatic factors that distinguish one particular condition from another. Treatment based on noticeable symptoms, however, is limited to the types of clinical biomarkers collected, and is prone to overlooking dysfunctions in physiological factors not easily evident to medical practitioners. We used a vector-based representation of patient clinical biomarkers, or clinarrays, to search for latent physiological factors that underlie human diseases directly from clinical laboratory data. Knowledge of these factors could be used to improve assessment of disease severity and help to refine strategies for diagnosis and monitoring disease progression.ResultsApplying Independent Component Analysis on clinarrays built from patient laboratory measurements revealed both known and novel concomitant physiological factors for asthma, types 1 and 2 diabetes, cystic fibrosis, and Duchenne muscular dystrophy. Serum sodium was found to be the most significant factor for both type 1 and type 2 diabetes, and was also significant in asthma. TSH3, a measure of thyroid function, and blood urea nitrogen, indicative of kidney function, were factors unique to type 1 diabetes respective to type 2 diabetes. Platelet count was significant across all the diseases analyzed.ConclusionsThe results demonstrate that large-scale analyses of clinical biomarkers using unsupervised methods can offer novel insights into the pathophysiological basis of human disease, and suggest novel clinical utility of established laboratory measurements.
- Published
- 2010
41. Differentially expressed RNA from public microarray data identifies serum protein biomarkers for cross-organ transplant rejection and other conditions.
- Author
-
Chen, Rong, Sigdel, Tara K, Li, Li, Kambham, Neeraja, Dudley, Joel T, Hsieh, Szu-Chuan, Klassen, R Bryan, Chen, Amery, Caohuu, Tuyen, Morgan, Alexander A, Valantine, Hannah A, Khush, Kiran K, Sarwal, Minnie M, and Butte, Atul J
- Subjects
Kidney Glomerulus ,Humans ,Proteinuria ,Blood Proteins ,RNA ,Enzyme-Linked Immunosorbent Assay ,Kidney Transplantation ,Heart Transplantation ,Oligonucleotide Array Sequence Analysis ,Histocytochemistry ,Reproducibility of Results ,ROC Curve ,Gene Expression Profiling ,Computational Biology ,Graft Rejection ,Databases ,Genetic ,Data Mining ,Biomarkers ,Databases ,Genetic ,Mathematical Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Serum proteins are routinely used to diagnose diseases, but are hard to find due to low sensitivity in screening the serum proteome. Public repositories of microarray data, such as the Gene Expression Omnibus (GEO), contain RNA expression profiles for more than 16,000 biological conditions, covering more than 30% of United States mortality. We hypothesized that genes coding for serum- and urine-detectable proteins, and showing differential expression of RNA in disease-damaged tissues would make ideal diagnostic protein biomarkers for those diseases. We showed that predicted protein biomarkers are significantly enriched for known diagnostic protein biomarkers in 22 diseases, with enrichment significantly higher in diseases for which at least three datasets are available. We then used this strategy to search for new biomarkers indicating acute rejection (AR) across different types of transplanted solid organs. We integrated three biopsy-based microarray studies of AR from pediatric renal, adult renal and adult cardiac transplantation and identified 45 genes upregulated in all three. From this set, we chose 10 proteins for serum ELISA assays in 39 renal transplant patients, and discovered three that were significantly higher in AR. Interestingly, all three proteins were also significantly higher during AR in the 63 cardiac transplant recipients studied. Our best marker, serum PECAM1, identified renal AR with 89% sensitivity and 75% specificity, and also showed increased expression in AR by immunohistochemistry in renal, hepatic and cardiac transplant biopsies. Our results demonstrate that integrating gene expression microarray measurements from disease samples and even publicly-available data sets can be a powerful, fast, and cost-effective strategy for the discovery of new diagnostic serum protein biomarkers.
- Published
- 2010
42. Translational bioinformatics in the cloud: an affordable alternative.
- Author
-
Dudley, Joel T, Pouliot, Yannick, Chen, Rong, Morgan, Alexander A, and Butte, Atul J
- Subjects
Genetics ,Clinical Sciences - Abstract
With the continued exponential expansion of publicly available genomic data and access to low-cost, high-throughput molecular technologies for profiling patient populations, computational technologies and informatics are becoming vital considerations in genomic medicine. Although cloud computing technology is being heralded as a key enabling technology for the future of genomic research, available case studies are limited to applications in the domain of high-throughput sequence data analysis. The goal of this study was to evaluate the computational and economic characteristics of cloud computing in performing a large-scale data integration and analysis representative of research problems in genomic medicine. We find that the cloud-based analysis compares favorably in both performance and cost in comparison to a local computational cluster, suggesting that cloud computing technologies might be a viable resource for facilitating large-scale translational research in genomic medicine.
- Published
- 2010
43. Beneficial effects of physical exercise and an orally active mGluR2/3 antagonist pro-drug on neurogenesis and behavior in an Alzheimer's amyloidosis model
- Author
-
Perez Garcia, Georgina, primary, Bicak, Mesude, additional, Buros, Jacqueline, additional, Haure-Mirande, Jean-Vianney, additional, Perez, Gissel M., additional, Otero-Pagan, Alena, additional, Gama Sosa, Miguel A., additional, De Gasperi, Rita, additional, Sano, Mary, additional, Gage, Fred H., additional, Barlow, Carrolee, additional, Dudley, Joel T., additional, Glicksberg, Benjamin S., additional, Wang, Yanzhuang, additional, Readhead, Benjamin, additional, Ehrlich, Michelle E., additional, Elder, Gregory A., additional, and Gandy, Sam, additional
- Published
- 2023
- Full Text
- View/download PDF
44. Disease signatures are robust across tissues and experiments
- Author
-
Dudley, Joel T, Tibshirani, Robert, Deshpande, Tarangini, and Butte, Atul J
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Generic health relevance ,Analysis of Variance ,Biomarkers ,Databases ,Factual ,Disease ,Histocytochemistry ,Humans ,Oligonucleotide Array Sequence Analysis ,Pathology ,Systems Biology ,computational biology ,meta-analysis ,microarrays ,Biochemistry and Cell Biology ,Other Biological Sciences ,Bioinformatics ,Biochemistry and cell biology - Abstract
Meta-analyses combining gene expression microarray experiments offer new insights into the molecular pathophysiology of disease not evident from individual experiments. Although the established technical reproducibility of microarrays serves as a basis for meta-analysis, pathophysiological reproducibility across experiments is not well established. In this study, we carried out a large-scale analysis of disease-associated experiments obtained from NCBI GEO, and evaluated their concordance across a broad range of diseases and tissue types. On evaluating 429 experiments, representing 238 diseases and 122 tissues from 8435 microarrays, we find evidence for a general, pathophysiological concordance between experiments measuring the same disease condition. Furthermore, we find that the molecular signature of disease across tissues is overall more prominent than the signature of tissue expression across diseases. The results offer new insight into the quality of public microarray data using pathophysiological metrics, and support new directions in meta-analysis that include characterization of the commonalities of disease irrespective of tissue, as well as the creation of multi-tissue systems models of disease pathology using public data.
- Published
- 2009
45. Deep Learning to Predict Patient Future Diseases from the Electronic Health Records
- Author
-
Miotto, Riccardo, Li, Li, Dudley, Joel T., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Ferro, Nicola, editor, Crestani, Fabio, editor, Moens, Marie-Francine, editor, Mothe, Josiane, editor, Silvestri, Fabrizio, editor, Di Nunzio, Giorgio Maria, editor, Hauff, Claudia, editor, and Silvello, Gianmaria, editor
- Published
- 2016
- Full Text
- View/download PDF
46. Predicting age by mining electronic medical records with deep learning characterizes differences between chronological and physiological age
- Author
-
Wang, Zichen, Li, Li, Glicksberg, Benjamin S., Israel, Ariel, Dudley, Joel T., and Ma'ayan, Avi
- Published
- 2017
- Full Text
- View/download PDF
47. Enabling Precision Cardiology Through Multiscale Biology and Systems Medicine
- Author
-
Johnson, Kipp W., Shameer, Khader, Glicksberg, Benjamin S., Readhead, Ben, Sengupta, Partho P., Björkegren, Johan L.M., Kovacic, Jason C., and Dudley, Joel T.
- Published
- 2017
- Full Text
- View/download PDF
48. Integrative genomic meta-analysis reveals novel molecular insights into cystic fibrosis and ΔF508-CFTR rescue
- Author
-
Hodos, Rachel A., Strub, Matthew D., Ramachandran, Shyam, Li, Li, McCray, Jr., Paul B., and Dudley, Joel T.
- Published
- 2020
- Full Text
- View/download PDF
49. Longitudinal data in peripheral blood confirm that PM20D1 is a quantitative trait locus (QTL) for Alzheimer’s disease and implicate its dynamic role in disease progression
- Author
-
Wang, Qi, Chen, Yinghua, Readhead, Benjamin, Chen, Kewei, Su, Yi, Reiman, Eric M., and Dudley, Joel T.
- Published
- 2020
- Full Text
- View/download PDF
50. MicroRNA‐195 controls MICU1 expression and tumor growth in ovarian cancer
- Author
-
Rao, Geeta, Dwivedi, Shailendra Kumar Dhar, Zhang, Yushan, Dey, Anindya, Shameer, Khader, Karthik, Ramachandran, Srikantan, Subramanya, Hossen, Md Nazir, Wren, Jonathan D, Madesh, Muniswamy, Dudley, Joel T, Bhattacharya, Resham, and Mukherjee, Priyabrata
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.