87 results on '"Naumann, Tristan"'
Search Results
52. An Investigation into the Effects of Pre-training Data Distributions for Pathology Report Classification
- Author
-
Hsu, Aliyah R., Cherapanamjeri, Yeshwanth, Park, Briton, Naumann, Tristan, Odisho, Anobel Y., and Yu, Bin
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Computation and Language ,Artificial Intelligence (cs.AI) ,Computer Science - Artificial Intelligence ,Computation and Language (cs.CL) ,Machine Learning (cs.LG) - Abstract
Pre-trained transformer models have demonstrated success across many natural language processing (NLP) tasks. In applying these models to the clinical domain, a prevailing assumption is that pre-training language models from scratch on large-scale biomedical data results in substantial improvements. We test this assumption with 4 pathology classification tasks on a corpus of 2907 prostate cancer pathology reports. We evaluate 5 transformer pre-trained models that are the same size but differ in pre-training corpora. Specifically, we analyze 3 categories of models: 1)General-domain: BERT and Turing Natural Language Representation (TNLR) models, which use general corpora for pre-training, 2)Mixed-domain: BioBERT which is obtained from BERT by including PubMed abstracts in pre-training and Clinical BioBERT which additionally includes MIMIC-III clinical notes and 3)Domain-specific: PubMedBERT which is pre-trained from scratch on PubMed abstracts. We find the mixed-domain and domain-specific models exhibit faster feature disambiguation during fine-tuning. However, the domain-specific model, PubMedBERT, can overfit to minority classes when presented with class imbalance, a common scenario in pathology report data. At the same time, the mixed-domain models are more resistant to overfitting. Our findings indicate that the use of general natural language and domain-specific corpora in pre-training serve complementary purposes for pathology report classification. The first enables resistance to overfitting when fine-tuning on an imbalanced dataset while the second allows for more accurate modelling of the fine-tuning domain. An expert evaluation is also conducted to reveal common outlier modes of each model. Our results could inform better fine-tuning practices in the clinical domain, to possibly leverage the benefits of mixed-domain models for imbalanced downstream datasets.
- Published
- 2023
- Full Text
- View/download PDF
53. MIMIC-Extract: A Data Extraction, Preprocessing, and Representation Pipeline for MIMIC-III
- Author
-
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Wang, Shirly, McDermott, Matthew B. A., Chauhan, Geeticka, Ghassemi, Marzyeh, Hughes, Michael C., Naumann, Tristan, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Wang, Shirly, McDermott, Matthew B. A., Chauhan, Geeticka, Ghassemi, Marzyeh, Hughes, Michael C., and Naumann, Tristan
- Published
- 2022
54. Knowledge-Rich Self-Supervision for Biomedical Entity Linking
- Author
-
Zhang, Sheng, primary, Cheng, Hao, additional, Vashishth, Shikhar, additional, Wong, Cliff, additional, Xiao, Jinfeng, additional, Liu, Xiaodong, additional, Naumann, Tristan, additional, Gao, Jianfeng, additional, and Poon, Hoifung, additional
- Published
- 2022
- Full Text
- View/download PDF
55. Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
- Author
-
Gu, Yu, primary, Tinn, Robert, additional, Cheng, Hao, additional, Lucas, Michael, additional, Usuyama, Naoto, additional, Liu, Xiaodong, additional, Naumann, Tristan, additional, Gao, Jianfeng, additional, and Poon, Hoifung, additional
- Published
- 2021
- Full Text
- View/download PDF
56. Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature
- Author
-
Wang, Yu, primary, Li, Jinchao, additional, Naumann, Tristan, additional, Xiong, Chenyan, additional, Cheng, Hao, additional, Tinn, Robert, additional, Wong, Cliff, additional, Usuyama, Naoto, additional, Rogahn, Richard, additional, Shen, Zhihong, additional, Qin, Yang, additional, Horvitz, Eric, additional, Bennett, Paul N., additional, Gao, Jianfeng, additional, and Poon, Hoifung, additional
- Published
- 2021
- Full Text
- View/download PDF
57. The role of machine learning in clinical research: transforming the future of evidence generation
- Author
-
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Weissler, E. H., Naumann, Tristan, Andersson, Tomas, Ranganath, Rajesh, Elemento, Olivier, Luo, Yuan, Freitag, Daniel F., Benoit, James, Hughes, Michael C., Khan, Faisal, Slater, Paul, Shameer, Khader, Roe, Matthew, Hutchison, Emmette, Kollins, Scott H., Broedl, Uli, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Weissler, E. H., Naumann, Tristan, Andersson, Tomas, Ranganath, Rajesh, Elemento, Olivier, Luo, Yuan, Freitag, Daniel F., Benoit, James, Hughes, Michael C., Khan, Faisal, Slater, Paul, Shameer, Khader, Roe, Matthew, Hutchison, Emmette, Kollins, Scott H., and Broedl, Uli
- Abstract
Background Interest in the application of machine learning (ML) to the design, conduct, and analysis of clinical trials has grown, but the evidence base for such applications has not been surveyed. This manuscript reviews the proceedings of a multi-stakeholder conference to discuss the current and future state of ML for clinical research. Key areas of clinical trial methodology in which ML holds particular promise and priority areas for further investigation are presented alongside a narrative review of evidence supporting the use of ML across the clinical trial spectrum. Results Conference attendees included stakeholders, such as biomedical and ML researchers, representatives from the US Food and Drug Administration (FDA), artificial intelligence technology and data analytics companies, non-profit organizations, patient advocacy groups, and pharmaceutical companies. ML contributions to clinical research were highlighted in the pre-trial phase, cohort selection and participant management, and data collection and analysis. A particular focus was paid to the operational and philosophical barriers to ML in clinical research. Peer-reviewed evidence was noted to be lacking in several areas. Conclusions ML holds great promise for improving the efficiency and quality of clinical research, but substantial barriers remain, the surmounting of which will require addressing significant gaps in evidence.
- Published
- 2021
58. Semi-supervised biomedical translation with cycle Wasserstein regression GaNs
- Author
-
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, McDermott, Matthew, Yan, Tom, Naumann, Tristan, Hunt, Nathan, Suresh, Harini S., Szolovits, Peter, Ghassemi, Marzyeh, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, McDermott, Matthew, Yan, Tom, Naumann, Tristan, Hunt, Nathan, Suresh, Harini S., Szolovits, Peter, and Ghassemi, Marzyeh
- Abstract
The biomedical field offers many learning tasks that share unique challenges: large amounts of unpaired data, and a high cost to generate labels. In this work, we develop a method to address these issues with semi-supervised learning in regression tasks (e.g., translation from source to target). Our model uses adversarial signals to learn from unpaired datapoints, and imposes a cycle-loss reconstruction error penalty to regularize mappings in either direction against one another. We first evaluate our method on synthetic experiments, demonstrating two primary advantages of the system: 1) distribution matching via the adversarial loss and 2) regularization towards invertible mappings via the cycle loss. We then show a regularization effect and improved performance when paired data is supplemented by additional unpaired data on two real biomedical regression tasks: estimating the physiological effect of medical treatments, and extrapolating gene expression (transcriptomics) signals. Our proposed technique is a promising initial step towards more robust use of adversarial signals in semi-supervised regression, and could be useful for other tasks (e.g., causal inference or modality translation) in the biomedical field.
- Published
- 2020
59. Modular Self-Supervision for Document-Level Relation Extraction
- Author
-
Zhang, Sheng, primary, Wong, Cliff, additional, Usuyama, Naoto, additional, Jain, Sarthak, additional, Naumann, Tristan, additional, and Poon, Hoifung, additional
- Published
- 2021
- Full Text
- View/download PDF
60. Practical guidance on artificial intelligence for health-care data
- Author
-
Ghassemi, Marzyeh, Naumann, Tristan, Schulam, Peter, Beam, Andrew L, Chen, Irene Y, and Ranganath, Rajesh
- Published
- 2019
- Full Text
- View/download PDF
61. MIMIC-Extract
- Author
-
Wang, Shirly, primary, McDermott, Matthew B. A., additional, Chauhan, Geeticka, additional, Ghassemi, Marzyeh, additional, Hughes, Michael C., additional, and Naumann, Tristan, additional
- Published
- 2020
- Full Text
- View/download PDF
62. A multivariate timeseries modeling approach to severity of illness assessment and forecasting in ICU with sparse, heterogeneous clinical data
- Author
-
Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Harvard University--MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ghassemi, Marzyeh, Naumann, Tristan, Brennan, Thomas P, Szolovits, Peter, Feng, Mengling, Pimentel, Marco A. F., Clifton, David A., Naumann, Tristan Josef, Brennan, Thomas Patrick, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Harvard University--MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ghassemi, Marzyeh, Naumann, Tristan, Brennan, Thomas P, Szolovits, Peter, Feng, Mengling, Pimentel, Marco A. F., Clifton, David A., Naumann, Tristan Josef, and Brennan, Thomas Patrick
- Abstract
The ability to determine patient acuity (or severity of illness) has immediate practical use for clinicians. We evaluate the use of multivariate timeseries modeling with the multi-task Gaussian process (GP) models using noisy, incomplete, sparse, heterogeneous and unevenly-sampled clinical data, including both physiological signals and clinical notes. The learned multi-task GP (MTGP) hyperparameters are then used to assess and forecast patient acuity. Experiments were conducted with two real clinical data sets acquired from ICU patients: firstly, estimating cerebrovascular pressure reactivity, an important indicator of secondary damage for traumatic brain injury patients, by learning the interactions between intracranial pressure and mean arterial blood pressure signals, and secondly, mortality prediction using clinical progress notes. In both cases, MTGPs provided improved results: an MTGP model provided better results than single-task GP models for signal interpolation and forecasting (0.91 vs 0.69 RMSE), and the use of MTGP hyperparameters obtained improved results when used as additional classification features (0.812 vs 0.788 AUC)., Intel Science and Technology Center for Big Data, National Institutes of Health. (U.S.). National Library of Medicine (Biomedical Informatics Research Training Grant NIH/NLM 2T15 LM007092-22), National Institute of Biomedical Imaging and Bioengineering (U.S.) (R01 Grant EB001659), Singapore. Agency for Science, Technology and Research (Graduate Scholarship)
- Published
- 2017
63. Datathons and Software to Promote Reproducible Research
- Author
-
Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, MIT Critical Data (Laboratory), Celi, Leo Anthony G., Lokhandwala, Sharukh, Montgomery, Robert, Moses, Christopher A, Naumann, Tristan, Pollard, Tom Joseph, Stretch, Robert, Spitz, Daniel, Naumann, Tristan Josef, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, MIT Critical Data (Laboratory), Celi, Leo Anthony G., Lokhandwala, Sharukh, Montgomery, Robert, Moses, Christopher A, Naumann, Tristan, Pollard, Tom Joseph, Stretch, Robert, Spitz, Daniel, and Naumann, Tristan Josef
- Abstract
Background: Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. Objective: We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. Methods: Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. Results: Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes—a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. Conclusions: This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.
- Published
- 2017
64. Semi-Supervised Biomedical Translation With Cycle Wasserstein Regression GANs
- Author
-
McDermott, Matthew, Yan, Tom, Naumann, Tristan, Hunt, Nathan, Suresh, Harini S., Szolovits, Peter, Ghassemi, Marzyeh, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, and Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
- Subjects
General Medicine - Abstract
The biomedical field offers many learning tasks that share unique challenges: large amounts of unpaired data, and a high cost to generate labels. In this work, we develop a method to address these issues with semi-supervised learning in regression tasks (e.g., translation from source to target). Our model uses adversarial signals to learn from unpaired datapoints, and imposes a cycle-loss reconstruction error penalty to regularize mappings in either direction against one another. We first evaluate our method on synthetic experiments, demonstrating two primary advantages of the system: 1) distribution matching via the adversarial loss and 2) regularization towards invertible mappings via the cycle loss. We then show a regularization effect and improved performance when paired data is supplemented by additional unpaired data on two real biomedical regression tasks: estimating the physiological effect of medical treatments, and extrapolating gene expression (transcriptomics) signals. Our proposed technique is a promising initial step towards more robust use of adversarial signals in semi-supervised regression, and could be useful for other tasks (e.g., causal inference or modality translation) in the biomedical field.
- Published
- 2018
65. Trends and Focus of Machine Learning Applications for Health Research
- Author
-
Beaulieu-Jones, Brett, primary, Finlayson, Samuel G., additional, Chivers, Corey, additional, Chen, Irene, additional, McDermott, Matthew, additional, Kandola, Jaz, additional, Dalca, Adrian V., additional, Beam, Andrew, additional, Fiterau, Madalina, additional, and Naumann, Tristan, additional
- Published
- 2019
- Full Text
- View/download PDF
66. Natural Language Processing for EHR-Based Computational Phenotyping
- Author
-
Zeng, Zexian, primary, Deng, Yu, additional, Li, Xiaoyu, additional, Naumann, Tristan, additional, and Luo, Yuan, additional
- Published
- 2019
- Full Text
- View/download PDF
67. Publicly Available Clinical
- Author
-
Alsentzer, Emily, primary, Murphy, John, additional, Boag, William, additional, Weng, Wei-Hung, additional, Jindi, Di, additional, Naumann, Tristan, additional, and McDermott, Matthew, additional
- Published
- 2019
- Full Text
- View/download PDF
68. Making Big Data Useful for Health Care: A Summary of the Inaugural MIT Critical Data Conference
- Author
-
Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Brennan, Thomas Patrick, Celi, Leo Anthony G., Feng, Mengling, Ghassemi, Marzyeh, Mark, Roger Greenwood, Naumann, Tristan, Badawi, Omar, Ippolito, Andrea, Johnson, Alistair, Mayaud, Louis, Moody, George B., Moses, Christopher, Naumann, Tristan Josef, Nikore, Vipan, Pimentel, Marco, Pollard, Tom J., Santos, Mauro, Stone, David J., Zimolzak, Andrew, Mark, Roger G, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Brennan, Thomas Patrick, Celi, Leo Anthony G., Feng, Mengling, Ghassemi, Marzyeh, Mark, Roger Greenwood, Naumann, Tristan, Badawi, Omar, Ippolito, Andrea, Johnson, Alistair, Mayaud, Louis, Moody, George B., Moses, Christopher, Naumann, Tristan Josef, Nikore, Vipan, Pimentel, Marco, Pollard, Tom J., Santos, Mauro, Stone, David J., Zimolzak, Andrew, and Mark, Roger G
- Abstract
With growing concerns that big data will only augment the problem of unreliable research, the Laboratory of Computational Physiology at the Massachusetts Institute of Technology organized the Critical Data Conference in January 2014. Thought leaders from academia, government, and industry across disciplines--including clinical medicine, computer science, public health, informatics, biomedical research, health technology, statistics, and epidemiology--gathered and discussed the pitfalls and challenges of big data in health care. The key message from the conference is that the value of large amounts of data hinges on the ability of researchers to share data, methodologies, and findings in an open setting. If empirical value is to be from the analysis of retrospective data, groups must continuously work together on similar problems to create more effective peer review. This will lead to improvement in methodology and quality, with each iteration of analysis resulting in more reliability.
- Published
- 2015
69. A multivariate timeseries modeling approach to severity of illness assessment and forecasting in ICU with sparse, heterogeneous clinical data
- Author
-
Pimentel, Marco A. F., Clifton, David A., Ghassemi, Marzyeh, Naumann, Tristan, Brennan, Thomas P, Szolovits, Peter, Feng, Mengling, Massachusetts Institute of Technology. Institute for Medical Engineering & Science, Harvard University--MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ghassemi, Marzyeh, Naumann, Tristan, Brennan, Thomas P, Szolovits, Peter, and Feng, Mengling
- Abstract
The ability to determine patient acuity (or severity of illness) has immediate practical use for clinicians. We evaluate the use of multivariate timeseries modeling with the multi-task Gaussian process (GP) models using noisy, incomplete, sparse, heterogeneous and unevenly-sampled clinical data, including both physiological signals and clinical notes. The learned multi-task GP (MTGP) hyperparameters are then used to assess and forecast patient acuity. Experiments were conducted with two real clinical data sets acquired from ICU patients: firstly, estimating cerebrovascular pressure reactivity, an important indicator of secondary damage for traumatic brain injury patients, by learning the interactions between intracranial pressure and mean arterial blood pressure signals, and secondly, mortality prediction using clinical progress notes. In both cases, MTGPs provided improved results: an MTGP model provided better results than single-task GP models for signal interpolation and forecasting (0.91 vs 0.69 RMSE), and the use of MTGP hyperparameters obtained improved results when used as additional classification features (0.812 vs 0.788 AUC)., Intel Science and Technology Center for Big Data, National Institutes of Health. (U.S.). National Library of Medicine (Biomedical Informatics Research Training Grant NIH/NLM 2T15 LM007092-22), National Institute of Biomedical Imaging and Bioengineering (U.S.) (R01 Grant EB001659), Singapore. Agency for Science, Technology and Research (Graduate Scholarship)
- Published
- 2015
70. Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
- Author
-
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Naumann, Tristan, Szolovits, Peter, Rumshisky, Anna A., Ghassemi, Marzyeh, Castro, V M, McCoy, T H, Perlis, R H, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Naumann, Tristan, Szolovits, Peter, Rumshisky, Anna A., Ghassemi, Marzyeh, Castro, V M, McCoy, T H, and Perlis, R H
- Abstract
The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline+1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established
- Published
- 2017
71. Predicting Clinical Outcomes Across Changing Electronic Health Record Systems
- Author
-
Gong, Jen J., primary, Naumann, Tristan, additional, Szolovits, Peter, additional, and Guttag, John V., additional
- Published
- 2017
- Full Text
- View/download PDF
72. Unfolding physiological state: mortality modelling in intensive care units
- Author
-
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ghassemi, Marzyeh, Naumann, Tristan, Brimmer, Nicole J., Joshi, Rohit, Szolovits, Peter, Doshi-Velez, Finale, Rumshisky, Anna, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ghassemi, Marzyeh, Naumann, Tristan, Brimmer, Nicole J., Joshi, Rohit, Szolovits, Peter, Doshi-Velez, Finale, and Rumshisky, Anna
- Abstract
Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: in-hospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work, Intel Corporation. Science and Technology Center for Big Data, National Library of Medicine (U.S.) (Biomedical Informatics Research Training Grant NIH/NLM 2T15 LM007092-22)
- Published
- 2016
73. Datathons and Software to Promote Reproducible Research
- Author
-
Celi, Leo Anthony, primary, Lokhandwala, Sharukh, additional, Montgomery, Robert, additional, Moses, Christopher, additional, Naumann, Tristan, additional, Pollard, Tom, additional, Spitz, Daniel, additional, and Stretch, Robert, additional
- Published
- 2016
- Full Text
- View/download PDF
74. A “datathon” model to support cross-disciplinary collaboration
- Author
-
Aboab, Jerôme, primary, Celi, Leo Anthony, additional, Charlton, Peter, additional, Feng, Mengling, additional, Ghassemi, Mohammad, additional, Marshall, Dominic C., additional, Mayaud, Louis, additional, Naumann, Tristan, additional, McCague, Ned, additional, Paik, Kenneth E., additional, Pollard, Tom J., additional, Resche-Rigon, Matthieu, additional, Salciccioli, Justin D., additional, and Stone, David J., additional
- Published
- 2016
- Full Text
- View/download PDF
75. A Multivariate Timeseries Modeling Approach to Severity of Illness Assessment and Forecasting in ICU with Sparse, Heterogeneous Clinical Data
- Author
-
Ghassemi, Marzyeh, primary, Pimentel, Marco, additional, Naumann, Tristan, additional, Brennan, Thomas, additional, Clifton, David, additional, Szolovits, Peter, additional, and Feng, Mengling, additional
- Published
- 2015
- Full Text
- View/download PDF
76. Unfolding physiological state
- Author
-
Ghassemi, Marzyeh, primary, Naumann, Tristan, additional, Doshi-Velez, Finale, additional, Brimmer, Nicole, additional, Joshi, Rohit, additional, Rumshisky, Anna, additional, and Szolovits, Peter, additional
- Published
- 2014
- Full Text
- View/download PDF
77. Scaling the PhysioNet WFDB Toolbox for MATLAB and Octave.
- Author
-
Naumann, Tristan and Silva, Ikaro
- Published
- 2014
78. The 4th Workshop on Clinical Natural Language Processing (ClinicalNLP) : Proceedings of the Workshop ; July 14, 2022
- Author
-
Naumann, Tristan (Herausgeber*in), Bethard, Steven (Herausgeber*in), Roberts, Kirk (Herausgeber*in), and Rumshisky, Anna (Herausgeber*in)
- Subjects
Medizin - Published
- 2022
79. Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
- Author
-
Roy H. Perlis, Victor M. Castro, Anna Rumshisky, Thomas H. McCoy, Tristan Naumann, Marzyeh Ghassemi, Peter Szolovits, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Naumann, Tristan, Szolovits, Peter, Rumshisky, Anna A., and Ghassemi, Marzyeh
- Subjects
Adult ,Male ,Topic model ,medicine.medical_specialty ,Time Factors ,Patient Discharge Summaries ,Kaplan-Meier Estimate ,computer.software_genre ,Patient Readmission ,Risk Assessment ,Cohort Studies ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,0302 clinical medicine ,Electronic Health Records ,Humans ,Medicine ,Generalizability theory ,030212 general & internal medicine ,Psychiatry ,Biological Psychiatry ,Aged ,Natural Language Processing ,Depressive Disorder, Major ,Models, Statistical ,Narration ,business.industry ,Middle Aged ,medicine.disease ,Psychiatry and Mental health ,Massachusetts ,Schizophrenia ,Cohort ,Anxiety ,Major depressive disorder ,Female ,Original Article ,Artificial intelligence ,medicine.symptom ,business ,computer ,030217 neurology & neurosurgery ,Natural language processing ,Cohort study ,Clinical psychology - Abstract
The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline+1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established in other clinical cohorts.
- Published
- 2015
80. Unfolding physiological state: mortality modelling in intensive care units
- Author
-
Finale Doshi-Velez, Anna Rumshisky, Marzyeh Ghassemi, Peter Szolovits, Rohit Joshi, Tristan Naumann, Nicole J. Brimmer, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ghassemi, Marzyeh, Naumann, Tristan, Brimmer, Nicole J., Joshi, Rohit, and Szolovits, Peter
- Subjects
medicine.medical_specialty ,business.industry ,MEDLINE ,Latent variable ,computer.software_genre ,Latent Dirichlet allocation ,Article ,Task (project management) ,symbols.namesake ,Intensive care ,Health care ,medicine ,symbols ,Predictive power ,Data mining ,Intensive care medicine ,business ,Baseline (configuration management) ,computer - Abstract
Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: in-hospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work focuses on the dynamic (time-varying) setting because models from this regime could facilitate an on-going severity stratification system that helps direct care-staff resources and inform treatment strategies., Intel Corporation. Science and Technology Center for Big Data, National Library of Medicine (U.S.) (Biomedical Informatics Research Training Grant NIH/NLM 2T15 LM007092-22)
- Published
- 2014
81. Toward a responsible future: recommendations for AI-enabled clinical decision support.
- Author
-
Labkoff S, Oladimeji B, Kannry J, Solomonides A, Leftwich R, Koski E, Joseph AL, Lopez-Gonzalez M, Fleisher LA, Nolen K, Dutta S, Levy DR, Price A, Barr PJ, Hron JD, Lin B, Srivastava G, Pastor N, Luque US, Bui TTT, Singh R, Williams T, Weiner MG, Naumann T, Sittig DF, Jackson GP, and Quintana Y
- Subjects
- Humans, Decision Support Systems, Clinical, Artificial Intelligence
- Abstract
Background: Integrating artificial intelligence (AI) in healthcare settings has the potential to benefit clinical decision-making. Addressing challenges such as ensuring trustworthiness, mitigating bias, and maintaining safety is paramount. The lack of established methodologies for pre- and post-deployment evaluation of AI tools regarding crucial attributes such as transparency, performance monitoring, and adverse event reporting makes this situation challenging., Objectives: This paper aims to make practical suggestions for creating methods, rules, and guidelines to ensure that the development, testing, supervision, and use of AI in clinical decision support (CDS) systems are done well and safely for patients., Materials and Methods: In May 2023, the Division of Clinical Informatics at Beth Israel Deaconess Medical Center and the American Medical Informatics Association co-sponsored a working group on AI in healthcare. In August 2023, there were 4 webinars on AI topics and a 2-day workshop in September 2023 for consensus-building. The event included over 200 industry stakeholders, including clinicians, software developers, academics, ethicists, attorneys, government policy experts, scientists, and patients. The goal was to identify challenges associated with the trusted use of AI-enabled CDS in medical practice. Key issues were identified, and solutions were proposed through qualitative analysis and a 4-month iterative consensus process., Results: Our work culminated in several key recommendations: (1) building safe and trustworthy systems; (2) developing validation, verification, and certification processes for AI-CDS systems; (3) providing a means of safety monitoring and reporting at the national level; and (4) ensuring that appropriate documentation and end-user training are provided., Discussion: AI-enabled Clinical Decision Support (AI-CDS) systems promise to revolutionize healthcare decision-making, necessitating a comprehensive framework for their development, implementation, and regulation that emphasizes trustworthiness, transparency, and safety. This framework encompasses various aspects including model training, explainability, validation, certification, monitoring, and continuous evaluation, while also addressing challenges such as data privacy, fairness, and the need for regulatory oversight to ensure responsible integration of AI into clinical workflow., Conclusions: Achieving responsible AI-CDS systems requires a collective effort from many healthcare stakeholders. This involves implementing robust safety, monitoring, and transparency measures while fostering innovation. Future steps include testing and piloting proposed trust mechanisms, such as safety reporting protocols, and establishing best practice guidelines., (© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association.)
- Published
- 2024
- Full Text
- View/download PDF
82. What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization.
- Author
-
Adams G, Nguyen BH, Smith J, Xia Y, Xie S, Ostropolets A, Deb B, Chen YJ, Naumann T, and Elhadad N
- Abstract
Summarization models often generate text that is poorly calibrated to quality metrics because they are trained to maximize the likelihood of a single reference (MLE). To address this, recent work has added a calibration step, which exposes a model to its own ranked outputs to improve relevance or, in a separate line of work, contrasts positive and negative sets to improve faithfulness. While effective, much of this work has focused on how to generate and optimize these sets. Less is known about why one setup is more effective than another. In this work, we uncover the underlying characteristics of effective sets. For each training instance, we form a large, diverse pool of candidates and systematically vary the subsets used for calibration fine-tuning. Each selection strategy targets distinct aspects of the sets, such as lexical diversity or the size of the gap between positive and negatives. On three diverse scientific long-form summarization datasets (spanning biomedical, clinical, and chemical domains), we find, among others, that faithfulness calibration is optimal when the negative sets are extractive and more likely to be generated, whereas for relevance calibration, the metric margin between candidates should be maximized and surprise-the disagreement between model and metric defined candidate rankings-minimized. Code to create, select, and optimize calibration sets is available at https://github.com/griff4692/calibrating-summaries.
- Published
- 2023
- Full Text
- View/download PDF
83. A Review of Challenges and Opportunities in Machine Learning for Health.
- Author
-
Ghassemi M, Naumann T, Schulam P, Beam AL, Chen IY, and Ranganath R
- Abstract
Modern electronic health records (EHRs) provide data to answer clinically meaningful questions. The growing data in EHRs makes healthcare ripe for the use of machine learning. However, learning in a clinical setting presents unique challenges that complicate the use of common machine learning methodologies. For example, diseases in EHRs are poorly labeled, conditions can encompass multiple underlying endotypes, and healthy individuals are underrepresented. This article serves as a primer to illuminate these challenges and highlights opportunities for members of the machine learning community to contribute to healthcare., (©2020 AMIA - All rights reserved.)
- Published
- 2020
84. What's in a Note? Unpacking Predictive Value in Clinical Note Representations.
- Author
-
Boag W, Doss D, Naumann T, and Szolovits P
- Abstract
Electronic Health Records (EHRs) have seen a rapid increase in adoption during the last decade. The narrative prose contained in clinical notes is unstructured and unlocking its full potential has proved challenging. Many studies incorporating clinical notes have applied simple information extraction models to build representations that enhance a downstream clinical prediction task, such as mortality or readmission. Improved predictive performance suggests a "good" representation. However, these extrinsic evaluations are blind to most of the insight contained in the notes. In order to better understand the power of expressive clinical prose, we investigate both intrinsic and extrinsic methods for understanding several common note representations. To ensure replicability and to support the clinical modeling community, we run all experiments on publicly-available data and provide our code.
- Published
- 2018
85. Data Analysis
- Author
-
Raffa JD, Ghassemi M, Naumann T, Feng M, and Hsu D
- Abstract
This chapter presents an overview of data analysis for health data. We give a brief introduction to some of the most common methods for data analysis of health care data, focusing on choosing appropriate methodology for different types of study objectives, and on presentation and the interpretation of data analysis generated from health data. We will provide an overview of three very powerful analysis methods: linear regression, logistic regression and Cox proportional hazards models, which provide the foundation for most data analysis conducted in clinical studies., (Copyright 2016, The Author(s).)
- Published
- 2016
- Full Text
- View/download PDF
86. A Multivariate Timeseries Modeling Approach to Severity of Illness Assessment and Forecasting in ICU with Sparse, Heterogeneous Clinical Data.
- Author
-
Ghassemi M, Pimentel MA, Naumann T, Brennan T, Clifton DA, Szolovits P, and Feng M
- Abstract
The ability to determine patient acuity (or severity of illness) has immediate practical use for clinicians. We evaluate the use of multivariate timeseries modeling with the multi-task Gaussian process (GP) models using noisy, incomplete, sparse, heterogeneous and unevenly-sampled clinical data, including both physiological signals and clinical notes. The learned multi-task GP (MTGP) hyperparameters are then used to assess and forecast patient acuity. Experiments were conducted with two real clinical data sets acquired from ICU patients: firstly, estimating cerebrovascular pressure reactivity, an important indicator of secondary damage for traumatic brain injury patients, by learning the interactions between intracranial pressure and mean arterial blood pressure signals, and secondly, mortality prediction using clinical progress notes. In both cases, MTGPs provided improved results: an MTGP model provided better results than single-task GP models for signal interpolation and forecasting (0.91 vs 0.69 RMSE), and the use of MTGP hyperparameters obtained improved results when used as additional classification features (0.812 vs 0.788 AUC).
- Published
- 2015
87. Unfolding Physiological State: Mortality Modelling in Intensive Care Units.
- Author
-
Ghassemi M, Naumann T, Doshi-Velez F, Brimmer N, Joshi R, Rumshisky A, and Szolovits P
- Abstract
Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: inhospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work focuses on the dynamic (time-varying) setting because models from this regime could facilitate an on-going severity stratification system that helps direct care-staff resources and inform treatment strategies.
- Published
- 2014
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.