Author: "Chen, Irene Y." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chen, Irene Y."' showing total 159 results

Start Over Author "Chen, Irene Y."

159 results on '"Chen, Irene Y."'

1. The Data Addition Dilemma

Author: Shen, Judy Hanwen, Raji, Inioluwa Deborah, and Chen, Irene Y.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: In many machine learning for healthcare tasks, standard datasets are constructed by amassing data across many, often fundamentally dissimilar, sources. But when does adding more data help, and when does it hinder progress on desired model outcomes in real-world settings? We identify this situation as the \textit{Data Addition Dilemma}, demonstrating that adding training data in this multi-source scaling context can at times result in reduced overall accuracy, uncertain fairness outcomes, and reduced worst-subgroup performance. We find that this possibly arises from an empirically observed trade-off between model performance improvements due to data scaling and model deterioration from distribution shift. We thus establish baseline strategies for navigating this dilemma, introducing distribution shift heuristics to guide decision-making on which data sources to add in data scaling, in order to yield the expected model performance improvements. We conclude with a discussion of the required considerations for data collection and suggestions for studying data composition and scale in the age of increasingly larger models., Comment: Machine Learning For Health Care 2024 (MLHC)
Published: 2024

2. The Minimum Information about CLinical Artificial Intelligence Checklist for Generative Modeling Research (MI-CLAIM-GEN)

Author: Miao, Brenda Y., Chen, Irene Y., Williams, Christopher YK, Davidson, Jaysón, Garcia-Agundez, Augusto, Sun, Shenghuan, Zack, Travis, Saria, Suchi, Arnaout, Rima, Quer, Giorgio, Sadaei, Hossein J., Torkamani, Ali, Beaulieu-Jones, Brett, Yu, Bin, Gianfrancesco, Milena, Butte, Atul J., Norgeot, Beau, and Sushil, Madhumita
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advances in generative models, including large language models (LLMs), vision language models (VLMs), and diffusion models, have accelerated the field of natural language and image processing in medicine and marked a significant paradigm shift in how biomedical models can be developed and deployed. While these models are highly adaptable to new tasks, scaling and evaluating their usage presents new challenges not addressed in previous frameworks. In particular, the ability of these models to produce useful outputs with little to no specialized training data ("zero-" or "few-shot" approaches), as well as the open-ended nature of their outputs, necessitate the development of new guidelines for robust reporting of clinical generative model research. In response to gaps in standards and best practices for the development of clinical AI tools identified by US Executive Order 141103 and several emerging national networks for clinical AI evaluation, we begin to formalize some of these guidelines by building on the original MI-CLAIM checklist. The new checklist, MI-CLAIM-GEN (Table 1), aims to address differences in training, evaluation, interpretability, and reproducibility of new generative models compared to non-generative ("predictive") AI models. This MI-CLAIM-GEN checklist also seeks to clarify cohort selection reporting with unstructured clinical data and adds additional items on alignment with ethical standards for clinical AI research.
Published: 2024

3. Identifying Reasons for Contraceptive Switching from Real-World Data Using Large Language Models

Author: Miao, Brenda Y., Williams, Christopher YK, Chinedu-Eneh, Ebenezer, Zack, Travis, Alsentzer, Emily, Butte, Atul J., and Chen, Irene Y.
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: Prescription contraceptives play a critical role in supporting women's reproductive health. With nearly 50 million women in the United States using contraceptives, understanding the factors that drive contraceptives selection and switching is of significant interest. However, many factors related to medication switching are often only captured in unstructured clinical notes and can be difficult to extract. Here, we evaluate the zero-shot abilities of a recently developed large language model, GPT-4 (via HIPAA-compliant Microsoft Azure API), to identify reasons for switching between classes of contraceptives from the UCSF Information Commons clinical notes dataset. We demonstrate that GPT-4 can accurately extract reasons for contraceptive switching, outperforming baseline BERT-based models with microF1 scores of 0.849 and 0.881 for contraceptive start and stop extraction, respectively. Human evaluation of GPT-4-extracted reasons for switching showed 91.4% accuracy, with minimal hallucinations. Using extracted reasons, we identified patient preference, adverse events, and insurance as key reasons for switching using unsupervised topic modeling approaches. Notably, we also showed using our approach that "weight gain/mood change" and "insurance coverage" are disproportionately found as reasons for contraceptive switching in specific demographic populations. Our code and supplemental data are available at https://github.com/BMiao10/contraceptive-switching.
Published: 2024

4. NLP for Maternal Healthcare: Perspectives and Guiding Principles in the Age of LLMs

Author: Antoniak, Maria, Naik, Aakanksha, Alvarado, Carla S., Wang, Lucy Lu, and Chen, Irene Y.
Subjects: Computer Science - Computation and Language
Abstract: Ethical frameworks for the use of natural language processing (NLP) are urgently needed to shape how large language models (LLMs) and similar tools are used for healthcare applications. Healthcare faces existing challenges including the balance of power in clinician-patient relationships, systemic health disparities, historical injustices, and economic constraints. Drawing directly from the voices of those most affected, and focusing on a case study of a specific healthcare setting, we propose a set of guiding principles for the use of NLP in maternal healthcare. We led an interactive session centered on an LLM-based chatbot demonstration during a full-day workshop with 39 participants, and additionally surveyed 30 healthcare workers and 30 birthing people about their values, needs, and perceptions of NLP tools in the context of maternal health. We conducted quantitative and qualitative analyses of the survey results and interactive discussions to consolidate our findings into a set of guiding principles. We propose nine principles for ethical use of NLP for maternal healthcare, grouped into three themes: (i) recognizing contextual significance (ii) holistic measurements, and (iii) who/what is valued. For each principle, we describe its underlying rationale and provide practical advice. This set of principles can provide a methodological pattern for other researchers and serve as a resource to practitioners working on maternal health and other healthcare fields to emphasize the importance of technical nuance, historical context, and inclusive design when developing NLP technologies for clinical use.
Published: 2023

5. Generating Drug Repurposing Hypotheses through the Combination of Disease-Specific Hypergraphs

Author: Jain, Ayush, Laure-Charpignon, Marie, Chen, Irene Y., Philippakis, Anthony, and Alaa, Ahmed
Subjects: Quantitative Biology - Biomolecules, Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: The drug development pipeline for a new compound can last 10-20 years and cost over 10 billion. Drug repurposing offers a more time- and cost-effective alternative. Computational approaches based on biomedical knowledge graph representations have recently yielded new drug repurposing hypotheses. In this study, we present a novel, disease-specific hypergraph representation learning technique to derive contextual embeddings of biological pathways of various lengths but that all start at any given drug and all end at the disease of interest. Further, we extend this method to multi-disease hypergraphs. To determine the repurposing potential of each of the 1,522 drugs, we derive drug-specific distributions of cosine similarity values and ultimately consider the median for ranking. Cosine similarity values are computed between (1) all biological pathways starting at the considered drug and ending at the disease of interest and (2) all biological pathways starting at drugs currently prescribed against that disease and ending at the disease of interest. We illustrate our approach with Alzheimer's disease (AD) and two of its risk factors: hypertension (HTN) and type 2 diabetes (T2D). We compare each drug's rank across four hypergraph settings (single- or multi-disease): AD only, AD + HTN, AD + T2D, and AD + HTN + T2D. Notably, our framework led to the identification of two promising drugs whose repurposing potential was significantly higher in hypergraphs combining two diseases: dapagliflozin (antidiabetic; moved up, from top 32$\%$ to top 7$\%$, across all considered drugs) and debrisoquine (antihypertensive; moved up, from top 76$\%$ to top 23$\%$). Our approach serves as a hypothesis generation tool, to be paired with a validation pipeline relying on laboratory experiments and semi-automated parsing of the biomedical literature., Comment: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 9 pages
Published: 2023

6. Prognostic implications of tumor histology and microenvironment in surgically resected intrahepatic cholangiocarcinoma: a single institutional experience

Author: Chen, Irene Y., Dunne, Richard F., and Liao, Xiaoyan
Published: 2024
Full Text: View/download PDF

7. Closing the Gap in High-Risk Pregnancy Care Using Machine Learning and Human-AI Collaboration

Author: Mozannar, Hussein, Utsumi, Yuria, Chen, Irene Y., Gervasi, Stephanie S., Ewing, Michele, Smith-McLallen, Aaron, and Sontag, David
Subjects: Computer Science - Machine Learning, Computer Science - Human-Computer Interaction
Abstract: A high-risk pregnancy is a pregnancy complicated by factors that can adversely affect the outcomes of the mother or the infant. Health insurers use algorithms to identify members who would benefit from additional clinical support. This work presents the implementation of a real-world ML-based system to assist care managers in identifying pregnant patients at risk of complications. In this retrospective evaluation study, we developed a novel hybrid-ML classifier to predict whether patients are pregnant and trained a standard classifier using claims data from a health insurance company in the US to predict whether a patient will develop pregnancy complications. These models were developed in cooperation with the care management team and integrated into a user interface with explanations for the nurses. The proposed models outperformed commonly used claim codes for the identification of pregnant patients at the expense of a manageable false positive rate. Our risk complication classifier shows that we can accurately triage patients by risk of complication. Our approach and evaluation are guided by human-centric design. In user studies with the nurses, they preferred the proposed models over existing approaches.
Published: 2023

8. Machine Learning for Health symposium 2022 -- Extended Abstract track

Author: Parziale, Antonio, Agrawal, Monica, Joshi, Shalmali, Chen, Irene Y., Tang, Shengpu, Oala, Luis, and Subbaswamy, Adarsh
Subjects: Computer Science - Machine Learning, 68Txx, I.2, J.3, I.6, I.4
Abstract: A collection of the extended abstracts that were presented at the 2nd Machine Learning for Health symposium (ML4H 2022), which was held both virtually and in person on November 28, 2022, in New Orleans, Louisiana, USA. Machine Learning for Health (ML4H) is a longstanding venue for research into machine learning for health, including both theoretical works and applied works. ML4H 2022 featured two submission tracks: a proceedings track, which encompassed full-length submissions of technically mature and rigorous work, and an extended abstract track, which would accept less mature, but innovative research for discussion. All the manuscripts submitted to ML4H Symposium underwent a double-blind peer-review process. Extended abstracts included in this collection describe innovative machine learning research focused on relevant problems in health and biomedicine.
Published: 2022

9. High risk features in colorectal adenomatous polyps: A multi-institutional study

Author: Lee, Michael, Ko, Huaibin Mabel, Kudose, Satoru, Remotti, Helen, Choi, Won-Tak, Salomao, Marcela A., Zhao, Lei, Isidro, Raymond A., Liao, Xiaoyan, Ettel, Mark G., Chen, Irene Y., Liu, Xiaoqin, Pai, Reetesh, Alpert, Lindsay, Setia, Namrata, Wu, Elizabeth, Henn, Patrick, Westbrook, Lindsey, and Lagana, Stephen M.
Published: 2024
Full Text: View/download PDF

10. Clustering Interval-Censored Time-Series for Disease Phenotyping

Author: Chen, Irene Y., Krishnan, Rahul G., and Sontag, David
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Unsupervised learning is often used to uncover clusters in data. However, different kinds of noise may impede the discovery of useful patterns from real-world time-series data. In this work, we focus on mitigating the interference of interval censoring in the task of clustering for disease phenotyping. We develop a deep generative, continuous-time model of time-series data that clusters time-series while correcting for censorship time. We provide conditions under which clusters and the amount of delayed entry may be identified from data under a noiseless model. On synthetic data, we demonstrate accurate, stable, and interpretable results that outperform several benchmarks. On real-world clinical datasets of heart failure and Parkinson's disease patients, we study how interval censoring can adversely affect the task of disease phenotyping. Our model corrects for this source of error and recovers known clinical subtypes., Comment: AAAI 2022
Published: 2021

11. SWI/SNF chromatin remodeling complex in pancreatic ductal adenocarcinoma: Clinicopathologic and immunohistochemical study

Author: Chen, Irene Y., Ettel, Mark G., Bell, Phoenix D., Huber, Aaron R., Findeis-Hosey, Jennifer J., Wang, Wenjia, Hezel, Aram F., Dunne, Richard F., Drage, Michael G., and Agostini-Vulaj, Diana
Published: 2024
Full Text: View/download PDF

12. Probabilistic Machine Learning for Healthcare

Author: Chen, Irene Y., Joshi, Shalmali, Ghassemi, Marzyeh, and Ranganath, Rajesh
Subjects: Statistics - Machine Learning, Computer Science - Computers and Society, Computer Science - Machine Learning
Abstract: Machine learning can be used to make sense of healthcare data. Probabilistic machine learning models help provide a complete picture of observed data in healthcare. In this review, we examine how probabilistic machine learning can advance healthcare. We consider challenges in the predictive model building pipeline where probabilistic models can be beneficial including calibration and missing data. Beyond predictive models, we also investigate the utility of probabilistic machine learning models in phenotyping, in generative models for clinical use cases, and in reinforcement learning., Comment: Annual Reviews of Biomedical Data Science 2021
Published: 2020

13. Ethical Machine Learning in Health Care

Author: Chen, Irene Y., Pierson, Emma, Rose, Sherri, Joshi, Shalmali, Ferryman, Kadija, and Ghassemi, Marzyeh
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The use of machine learning (ML) in health care raises numerous ethical concerns, especially as models can amplify existing health inequities. Here, we outline ethical considerations for equitable ML in the advancement of health care. Specifically, we frame ethics of ML in health care through the lens of social justice. We describe ongoing efforts and outline challenges in a proposed pipeline of ethical ML in health, ranging from problem selection to post-deployment considerations. We close by summarizing recommendations to address these challenges., Comment: Annual Reviews in Biomedical Data Science 2021
Published: 2020
Full Text: View/download PDF

14. Intimate Partner Violence and Injury Prediction From Radiology Reports

Author: Chen, Irene Y., Alsentzer, Emily, Park, Hyesun, Thomas, Richard, Gosangi, Babina, Gujrathi, Rahul, and Khurana, Bharti
Subjects: Computer Science - Computers and Society, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Intimate partner violence (IPV) is an urgent, prevalent, and under-detected public health issue. We present machine learning models to assess patients for IPV and injury. We train the predictive algorithms on radiology reports with 1) IPV labels based on entry to a violence prevention program and 2) injury labels provided by emergency radiology fellowship-trained physicians. Our dataset includes 34,642 radiology reports and 1479 patients of IPV victims and control patients. Our best model predicts IPV a median of 3.08 years before violence prevention program entry with a sensitivity of 64% and a specificity of 95%. We conduct error analysis to determine for which patients our model has especially high or low performance and discuss next steps for a deployed clinical risk model.
Published: 2020

15. CheXclusion: Fairness gaps in deep chest X-ray classifiers

Author: Seyyed-Kalantari, Laleh, Liu, Guanxiong, McDermott, Matthew, Chen, Irene Y., and Ghassemi, Marzyeh
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing, Statistics - Machine Learning
Abstract: Machine learning systems have received much attention recently for their ability to achieve expert-level performance on clinical tasks, particularly in medical imaging. Here, we examine the extent to which state-of-the-art deep learning classifiers trained to yield diagnostic labels from X-ray images are biased with respect to protected attributes. We train convolution neural networks to predict 14 diagnostic labels in 3 prominent public chest X-ray datasets: MIMIC-CXR, Chest-Xray8, CheXpert, as well as a multi-site aggregation of all those datasets. We evaluate the TPR disparity -- the difference in true positive rates (TPR) -- among different protected attributes such as patient sex, age, race, and insurance type as a proxy for socioeconomic status. We demonstrate that TPR disparities exist in the state-of-the-art classifiers in all datasets, for all clinical tasks, and all subgroups. A multi-source dataset corresponds to the smallest disparities, suggesting one way to reduce bias. We find that TPR disparities are not significantly correlated with a subgroup's proportional disease burden. As clinical models move from papers to products, we encourage clinical decision makers to carefully audit for algorithmic disparities prior to deployment. Our code can be found at, https://github.com/LalehSeyyed/CheXclusion, Comment: Paper is accepted in Pacific Symposium on Biocomputing 2021 (PSB2021). Code can be found at, https://github.com/LalehSeyyed/CheXclusion
Published: 2020

16. Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph

Author: Chen, Irene Y., Agrawal, Monica, Horng, Steven, and Sontag, David
Subjects: Statistics - Applications, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Increasingly large electronic health records (EHRs) provide an opportunity to algorithmically learn medical knowledge. In one prominent example, a causal health knowledge graph could learn relationships between diseases and symptoms and then serve as a diagnostic tool to be refined with additional clinical input. Prior research has demonstrated the ability to construct such a graph from over 270,000 emergency department patient visits. In this work, we describe methods to evaluate a health knowledge graph for robustness. Moving beyond precision and recall, we analyze for which diseases and for which patients the graph is most accurate. We identify sample size and unmeasured confounders as major sources of error in the health knowledge graph. We introduce a method to leverage non-linear functions in building the causal graph to better understand existing model assumptions. Finally, to assess model generalizability, we extend to a larger set of complete patient visits within a hospital system. We conclude with a discussion on how to robustly extract medical knowledge from EHRs., Comment: 12 pages, presented at PSB 2020
Published: 2019

17. Automatic Generation of Cloze Items for Repeated Testing to Improve Reading Comprehension

Author: Yang, Albert C. M., Chen, Irene Y. L., Flanagan, Brendan, and Ogata, Hiroaki
Abstract: Reviewing learned knowledge is critical in the learning process. Testing the learning content instead of restudying, which is known as the testing effect, has been demonstrated to be an effective review strategy. However, education research recommends that instructors generate practice tests, but this burdens teachers and may also hinder teaching quality. To resolve this issue, the current study applied a modern artificial intelligence technique (BERT) to automate the generation of tests and evaluate the testing effect through e-books in a university lecture (N = 74). The last 5 minutes of each course session were utilized to review the taught content by having students either answer cloze item questions or restudy the summary of the core concepts covered in the lecture. A reading comprehension pretest was conducted before the experiment to ensure that the differences in prior knowledge were nonsignificant between groups, and a posttest was performed to examine the effectiveness of testing. In addition, we evaluated students' reading skills and reading engagement through their ability to identify key concepts and their interaction with e-books, respectively. A positive effect was observed for students who engaged in cloze item practice before the end of each class. The results indicated that the repeated testing group exhibited significantly better reading skills and engaged more with e-books than the restudying group did. More importantly, compared with only restudying the key concepts, answering the cloze items questions significantly improved students' reading comprehension. Our results suggest that machine-generated cloze testing may benefit learning in higher education.
Published: 2021

18. NLP for Maternal Healthcare: Perspectives and Guiding Principles in the Age of LLMs

Author: Antoniak, Maria, primary, Naik, Aakanksha, additional, Alvarado, Carla S., additional, Wang, Lucy Lu, additional, and Chen, Irene Y., additional
Published: 2024
Full Text: View/download PDF

19. A Review of Challenges and Opportunities in Machine Learning for Health

Author: Ghassemi, Marzyeh, Naumann, Tristan, Schulam, Peter, Beam, Andrew L., Chen, Irene Y., and Ranganath, Rajesh
Subjects: Computer Science - Machine Learning, Computer Science - Computers and Society, Statistics - Machine Learning
Abstract: Modern electronic health records (EHRs) provide data to answer clinically meaningful questions. The growing data in EHRs makes healthcare ripe for the use of machine learning. However, learning in a clinical setting presents unique challenges that complicate the use of common machine learning methodologies. For example, diseases in EHRs are poorly labeled, conditions can encompass multiple underlying endotypes, and healthy individuals are underrepresented. This article serves as a primer to illuminate these challenges and highlights opportunities for members of the machine learning community to contribute to healthcare., Comment: Updated version
Published: 2018

20. Using a Summarized Lecture Material Recommendation System to Enhance Students' Preclass Preparation in a Flipped Classroom

Author: Yang, Christopher C. Y., Chen, Irene Y. L., Akçapinar, Gökhan, Flanagan, Brendan, and Ogata, Hiroaki
Abstract: Research has revealed the positive effects of flipped classroom approaches on students' learning engagement and performance compared with conventional lecture-based classrooms. However, because of a lack of out-of-class learning support, many students fail to comprehensively prepare the provided lecture materials before class. One promising solution to this problem is recommendation systems in the educational area, which have been instrumental in helping learners identify useful and relevant lecture materials that satisfy their learning needs. Thus, in this study, we propose a summarized lecture material recommendation system, which is integrated into an e-book reading system as an enhancement of the flipped classroom approach. This system helps students identify pages that contain essential knowledge that must be thoroughly studied before class. The proposed system was constructed on the basis of our previous work. In this study, a quasi-experiment was conducted in a graduate course that implemented the flipped classroom model: experimental group students learned with the proposed system, whereas the control group students had no access to the additional features. The findings of this study suggest that students who learn with the proposed recommendation system significantly outperform those who learn without the system in a flipped classroom in terms of their learning outcomes and engagement in preclass preparation.
Published: 2021

21. Toward Precision Education: Educational Data Mining and Learning Analytics for Identifying Students' Learning Patterns with Ebook Systems

Author: Yang, Christopher C. Y., Chen, Irene Y. L., and Ogata, Hiroaki
Abstract: Precision education is now recognized as a new challenge of applying artificial intelligence, machine learning, and learning analytics to improve both learning performance and teaching quality. To promote precision education, digital learning platforms have been widely used to collect educational records of students' behavior, performance, and other types of interaction. On the other hand, the increasing volume of students' learning behavioral data in virtual learning environments provides opportunities for mining data on these students' learning patterns. Accordingly, identifying students' online learning patterns on various digital learning platforms has drawn the interest of the learning analytics and educational data mining research communities. In this study, the authors applied data analytics methods to examine the learning patterns of students using an ebook system for one semester in an undergraduate course. The authors used a clustering approach to identify subgroups of students with different learning patterns. Several subgroups were identified, and the students' learning patterns in each subgroup were determined accordingly. In addition, the association between these students' learning patterns and their learning outcomes from the course was investigated. The findings of this study provide educators opportunities to predict students' learning outcomes by analyzing their online learning behaviors and providing timely intervention for improving their learning experience, which achieves one of the goals of learning analytics as part of precision education.
Published: 2021

22. From Human Grading to Machine Grading: Automatic Diagnosis of e-Book Text Marking Skills in Precision Education

Author: Yang, Albert C. M., Chen, Irene Y. L., Flanagan, Brendan, and Ogata, Hiroaki
Abstract: Precision education is a new challenge in leveraging artificial intelligence, machine learning, and learning analytics to enhance teaching quality and learning performance. To facilitate precision education, text marking skills can be used to determine students' learning process. Text marking is an essential learning skill in reading. In this study, we proposed a model that leverages the state-of-the-art text summarization technique, Bidirectional Encoder Representations from Transformers (BERT), to calculate the marking score for 130 graduate students enrolled in an accounting course. Then, we applied learning analytics to analyze the correlation between their marking scores and learning performance. We measured students' self-regulated learning (SRL) and clustered them into four groups based on their marking scores and marking frequencies to examine whether differences in reading skills and text marking influence students' learning performance and awareness of self-regulation. Consistent with past research, our results did not indicate a strong relationship between marking scores and learning performance. However, high-skill readers who use more marking strategies perform better in learning performance, task strategies, and time management than high-skill readers who use fewer marking strategies. Furthermore, high-skill readers who actively employ marking strategies also achieve superior scores of environment structure, and task strategies in SRL than low-skill readers who are inactive in marking. The findings of this research provide evidence supporting the importance of monitoring and training students' text marking skill and facilitating precision education.
Published: 2021

23. Prompt Engineering for Generative Artificial Intelligence in Gastroenterology and Hepatology.

Author: Jin Ge, Chen, Irene Y., Pletcher, Mark J., and Lai, Jennifer C.
Subjects: *GENERATIVE artificial intelligence, *ARTIFICIAL intelligence, *LANGUAGE models, *INFORMATION technology, *GEMINI (Chatbot), *HEPATORENAL syndrome
Abstract: This article explores the application of generative artificial intelligence (GAI) and large language models (LLMs) in the field of gastroenterology and hepatology. It introduces the concept of prompt engineering, which involves optimizing user inputs to guide LLMs in generating desired outputs. The article discusses the goals of prompt engineering, such as task completion and in-context learning, and explains the core elements of a prompt. It also examines different approaches to prompting and the roles of system, user, and assistant in the interaction with LLMs. The article concludes by emphasizing the importance of prompt engineering in improving the performance of LLMs in extracting data from research articles, while acknowledging the limitations of this approach. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

24. Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations

Author: Seyyed-Kalantari, Laleh, Zhang, Haoran, McDermott, Matthew B. A., Chen, Irene Y., and Ghassemi, Marzyeh
Subjects: Artificial intelligence -- Usage, Medical imaging equipment -- Usage, Artificial intelligence, Biological sciences, Health
Abstract: Artificial intelligence (AI) systems have increasingly achieved expert-level performance in medical imaging applications. However, there is growing concern that such AI systems may reflect and amplify human bias, and reduce the quality of their performance in historically under-served populations such as female patients, Black patients, or patients of low socioeconomic status. Such biases are especially troubling in the context of underdiagnosis, whereby the AI algorithm would inaccurately label an individual with a disease as healthy, potentially delaying access to care. Here, we examine algorithmic underdiagnosis in chest X-ray pathology classification across three large chest X-ray datasets, as well as one multi-source dataset. We find that classifiers produced using state-of-the-art computer vision techniques consistently and selectively underdiagnosed under-served patient populations and that the underdiagnosis rate was higher for intersectional under-served subpopulations, for example, Hispanic female patients. Deployment of AI systems using medical imaging for disease diagnosis with such biases risks exacerbation of existing care biases and can potentially lead to unequal access to medical treatment, thereby raising ethical concerns for the use of these models in the clinic. Artificial intelligence algorithms trained using chest X-rays consistently underdiagnose pulmonary abnormalities or diseases in historically under-served patient populations, raising ethical concerns about the clinical use of such algorithms., Author(s): Laleh Seyyed-Kalantari [sup.1] [sup.2] , Haoran Zhang [sup.3] , Matthew B. A. McDermott [sup.3] , Irene Y. Chen [sup.3] , Marzyeh Ghassemi [sup.2] [sup.3] Author Affiliations: (1) University of [...]
Published: 2021
Full Text: View/download PDF

25. Systemic lupus in the era of machine learning medicine

Author: Zhan, Kevin, primary, Buhler, Katherine A, additional, Chen, Irene Y, additional, Fritzler, Marvin J, additional, and Choi, May Y, additional
Published: 2024
Full Text: View/download PDF

26. Updating the Minimum Information about CLinical Artificial Intelligence (MI-CLAIM) checklist for generative modeling research

Author: Miao, Brenda Y., Chen, Irene Y., Williams, Christopher YK, Davidson, Jaysón, Garcia-Agundez, Augusto, Sun, Harry, Zack, Travis, Butte, Atul J., Sushil, Madhumita, Miao, Brenda Y., Chen, Irene Y., Williams, Christopher YK, Davidson, Jaysón, Garcia-Agundez, Augusto, Sun, Harry, Zack, Travis, Butte, Atul J., and Sushil, Madhumita
Abstract: Recent advances in generative models, including large language models (LLMs), vision language models (VLMs), and diffusion models, have accelerated the field of natural language and image processing in medicine and marked a significant paradigm shift in how biomedical models can be developed and deployed. While these models are highly adaptable to new tasks, scaling and evaluating their usage presents new challenges not addressed in previous frameworks. In particular, the ability of these models to produce useful outputs with little to no specialized training data ("zero-" or "few-shot" approaches), as well as the open-ended nature of their outputs, necessitate the development of updated guidelines in using and evaluating these models. In response to gaps in standards and best practices for the development of clinical AI tools identified by US Executive Order 141103 and several emerging national networks for clinical AI evaluation, we begin to formalize some of these guidelines by building on the "Minimum information about clinical artificial intelligence modeling" (MI-CLAIM) checklist. The MI-CLAIM checklist, originally developed in 2020, provided a set of six steps with guidelines on the minimum information necessary to encourage transparent, reproducible research for artificial intelligence (AI) in medicine. Here, we propose modifications to the original checklist that highlight differences in training, evaluation, interpretability, and reproducibility of generative models compared to traditional AI models for clinical research. This updated checklist also seeks to clarify cohort selection reporting and adds additional items on alignment with ethical standards.
Published: 2024

27. How I Approach It: Prompt Engineering for Generative Artificial Intelligence (GAI) in Gastroenterology and Hepatology

Author: Ge, Jin, primary, Chen, Irene Y., additional, Pletcher, Mark J., additional, and Lai, Jennifer C., additional
Published: 2024
Full Text: View/download PDF

28. Reply to: ‘Potential sources of dataset bias complicate investigation of underdiagnosis by machine learning algorithms’ and ‘Confounding factors need to be accounted for in assessing bias by machine learning algorithms’

Author: Seyyed-Kalantari, Laleh, Zhang, Haoran, McDermott, Matthew B. A., Chen, Irene Y., and Ghassemi, Marzyeh
Published: 2022
Full Text: View/download PDF

29. Evidence for inhibition of a fungal biocontrol agent by a plant microbiome

Author: Garbelotto, Matteo, Lowell, Natalie, Chen, Irene Y., and Osmundson, Todd W.
Published: 2019

30. Congenital hepatic fibrosis and its mimics: a clinicopathologic study of 19 cases at a single institution

Author: Chen, Irene Y., Whitney-Miller, Christa L., and Liao, Xiaoyan
Published: 2021
Full Text: View/download PDF

31. COMBATING THE PURPLE BOTANICAL PLAGUE: EVALUATION OF COLLETOTRICHUM GLOEOSPORIOIDES F. SP. MICONIAE FOR BIOLOGICAL CONTROL OF MICONIA CALVESCENS IN MO’OREA, FRENCH POLYNESIA

Author: Chen, Irene Y
Subjects: invasive plants, Miconia calvescens, biological control, Colletotrichum gloeosporioides, elevation, endophyte community, Mo’orea, French Polynesia
Abstract: Classical biological control aims to actively manage threats that cause immense losses in biological diversity. The introduction of a biological control agent Colletotrichum gloeosporioides f. sp. miconiae (Cgm) in April 2000 to Tahiti, French Polynesia was intended to control the massive spread of an invasive weed Miconia calvescens. However, while Cgm has subsequently spread to Mo’orea, its impact on reducing M. calvescens remains uncertain. The main objectives of this study are: (1) to quantify the amount of the fungal pathogen infecting the M. calvescens plants at three elevation ranges on Mo’orea, (2) to understand the impact of moisture on the proliferation of disease development, and (3) to test the influence of endophytic fungal communities on the competitive ability of Cgm. Results from quantifying leaf damage showed that at higher elevations, Cgm disease development is more rampant. In laboratory experiments, varying moisture did not significantly affect the health of the seedling. While Cgm growth rate correlates with competitive ability, endophytic fungal growth rate does not, leading to speculation that other modes such as chemical interactions allow for endophytic competitive ability. Lastly, data supports the hypothesis that Cgm becomes a better competitor against the endophytic fungal species at higher elevations. Results of this study suggest that other microclimatic factors such as temperature and humidity may play a role in disease development. While Cgm may decelerate the growth of M. calvescens, Cgm alone is not likely to obliterate the massive damage M. calvescens has done on the native flora of Mo’orea and its surrounding islands. Conservation biologists must urgently attack this pest, or the fragile ecosystem of the islands will lead to massive losses in biological diversity.
Published: 2009

32. Metastatic Pleomorphic Dermal Sarcoma Involving Tonsil and Thyroid: An Unusual Presentation.

Author: Solomianko, Marek, Chen, Irene Y., Kaza, Srinivas, and Dongwei Zhang
Subjects: *METASTASIS, *SARCOMA, *THYROID diseases, *SPINDLE apparatus, *NODULAR disease
Abstract: Pleomorphic dermal sarcoma (PDS) is a rare dermal-based mesenchymal tumor that shares histologic characteristics with atypical fibroxanthoma (AFX), but demonstrates aggressive features increasing the risk of local recurrence or metastases. We report a unique case of a 62-year-old man with a recent history of left-groin PDS, now with a 2-cm pedunculated left tonsillar mass and two hypermetabolic nodules in the left thyroid. Microscopically, the tonsillar biopsy demonstrated infiltrating spindle cell neoplasm composed of highly pleomorphic tumor cells with abundant mitoses. The left hemithyroidectomy specimen also demonstrated similar pleomorphic spindled tumor cells infiltrating thyroid parenchyma with perineural invasion and focal osseous and chondroid components. Immunohistochemically, both tumors were diffusely positive for smooth muscle actin. Focal nonspecific positive staining with p53, SATB2, and S100 was observed in the thyroid tumor. Additional markers, including p40, CK5, p63, pancytokeratin, CAM5.2, S100, SOX10, Melan-A, HMB45, CD45, CD68, CD163, caldesmon, p16, TTF1, PAX8, and calcitonin, were all negative. INI-1 expression was retained. Molecular analysis showed that the tumor harbored PDGFRA deletion of exon 9 and amplification, PTCH1 A300fs*24, MDM2 amplification, CDKN2A/B loss, and FRS2 amplification. Compared with the patient's groin tumor, the current tumors showed similar morphology. Overall, the histologic and immunohistochemical findings are consistent with metastatic PDS to the tonsil and thyroid. Although PDS is commonly found on the sun-exposed skin in the elderly, this case highlights an unusual primary site as well as peculiar locations for the metastasis of PDS. Treatment and management of the patient are also considered. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. Recent Advances, Applications and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2022 Symposium

Author: Hegselmann, Stefan, Zhou, Helen, Yuyin Zhou, Chien, Jennifer, Sujay Nagaraj, Hulkund, Neha, Shreyas Bhave, Oberst, Michael, Amruta Pai, Ellington, Caleb, Wisdom Ikezogwo, Dou, Jason Xiaotian, Agrawal, Monica, Changye Li, Peniel Argaw, Biswas, Arpita, Mehak Gupta, Xinhui Li, Lemanczyk, Marta, Yuhui Zhang, Garbin, Christian, Healey, Elizabeth, Heejong Kim, Boone, Claire, Daneshjou, Roxana, Siyu Shi, Pezzotti, Nicola, Pfohl, Stephen R., Fong, Edwin, Aakanksha Naik, Lengerich, Ben, Xu, Ying, Bidwell, Jonathan, Sendak, Mark, Byung-Hak Kim, Hendrix, Nathaniel, Spathis, Dimitris, Seita, Jun, Quast, Bastiaan, Coffee, Megan, Stultz, Collin, Chen, Irene Y., Shalmali Joshi, and Girmaw Abebe Tadesse
Subjects: machine learning, medicine, healthcare
Abstract: The second Machine Learning for Health (ML4H) symposium was held both virtually and in-person on November 28, 2022, in New Orleans, Louisiana, USA (Parziale et al.,2022). The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the ML4H community. Encouraged by the successful virtual roundtables in the previous year (Roy et al.,2021), we organized nine in-person and four virtual roundtables at ML4H 2022 (Parziale et al.,2022). A roundtable session included invited senior chairs (with substantial experience in the field), junior chairs (responsible for facilitating the discussion), and attendees from di- verse backgrounds with interest in the session’s topic. This document explains the organization process we used and compiles the takeaways from the roundtable discussions, including recent advances, applications, and open challenges for each topic. We conclude with a summary and lessons learned across all roundtables.
Published: 2023
Full Text: View/download PDF

34. Treating health disparities with artificial intelligence

Author: Chen, Irene Y., Joshi, Shalmali, and Ghassemi, Marzyeh
Published: 2020
Full Text: View/download PDF

35. Ameloblastic Carcinoma of the Maxilla: A Rare Case Report and Review of Literature from 1948 to 2021.

Author: Chen, Irene Y., Giampoli, Ellen J., and Zhang, Dongwei
Subjects: *MAXILLA, *LITERATURE reviews, *POSITRON emission tomography, *MAXILLARY sinus, *CARCINOMA, *CANCER relapse
Abstract: Ameloblastic carcinoma is a rare malignant neoplasm arising from the odontogenic epithelium. Ameloblastic carcinoma commonly occurs de novo affecting the posterior segments of the mandible. Presently, only less than 100 cases have been reported arising from the maxilla. We report a unique case of maxillary ameloblastic carcinoma in a 68-year-old male with a 5.6 cm positron emission tomography (PET) avid left maxillary sinus mass. The patient underwent a left maxillectomy which revealed hyperchromatic and pleomorphic tumor cells arranged in a nested and trabecular architecture. The tumor cells showed distinct peripheral palisading with reverse polarization. Areas of bone destruction, necrosis, lymphovascular and perineural invasions, as well as atypical mitoses, were identified. Immunohistochemically, the tumor cells were positive for keratin cocktail (AE1/AE3 and CAM 5.2), keratin 19, p40, and weakly positive for MDM2, while negative for calretinin. Molecular analysis revealed wild-type BRAF ; however, alterations in CDKN2A / B, MTAP, RB1, SMARCA4, STK11, FGF12, SETD2, and TP53 were present. This histopathologic and molecular profile supported the diagnosis of ameloblastic carcinoma. There has been no evidence of disease recurrence or metastasis eleven months after the initial diagnosis. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

36. Clustering Interval-Censored Time-Series for Disease Phenotyping

Author: Chen, Irene Y., primary, Krishnan, Rahul G., additional, and Sontag, David, additional
Published: 2022
Full Text: View/download PDF

37. Ameloblastic Carcinoma of the Maxilla: A Rare Case Report and Review of Literature from 1948 to 2021

Author: Chen, Irene Y., primary, Giampoli, Ellen J., additional, and Zhang, Dongwei, additional
Published: 2022
Full Text: View/download PDF

38. Examining the Factors Influencing Participants' Knowledge Sharing Behavior in Virtual Learning Communities

Author: Chen, Irene Y. L., Chen, Nian-Shing, and Kinshuk
Abstract: Increasing organizations and educational institutions have implemented virtual learning communities to encourage knowledge sharing. However, this task can not be accomplished simply by grouping people together and telling them "sharing your knowledge will make you learn better". This research attempts to examine the factors influencing knowledge sharing from the perspective of human behavior. Theory of Planned Behavior is integrated with social network ties and empirical findings from virtual learning community literature to develop the research model. The current research model comprises eight hypotheses to explore questions of whether social network ties, learners' attitude toward knowledge sharing, learners' beliefs of their capabilities in performing online knowledge sharing, and subjective norms relate to knowledge sharing intention, which leads to actual behavior in a virtual learning environment. This study empirically validates the hypothesized relationships using a field survey of college students and MBA students enrolling courses conducted in a virtual learning community. Attitude, subjective norm, Web-specific self-efficacy and social network ties are shown to be good predictors of knowledge sharing intention which, in turn, is significantly associated with knowledge sharing behavior. Knowledge creation self-efficacy does not significantly impact knowledge sharing intention. (Contains 3 tables and 2 figures.)
Published: 2009

39. Enhancing the Quality of E-Learning in Virtual Learning Communities by Finding Quality Learning Content and Trustworthy Collaborators

Author: Yang, Stephen J. H., Chen, Irene Y. L., and Kinshuk
Abstract: Virtual learning communities encourage members to learn and contribute knowledge. However, knowledge sharing requires mutual-trust collaboration between learners and the contribution of quality knowledge. This task cannot be accomplished by simply storing learning content in repositories. It requires a mechanism to help learners find relevant learning content as well as knowledgeable collaborators to work with. In this paper, we present a peer-to-peer based social network to enhance the quality of e-learning regarding knowledge sharing in virtual learning communities. From a technical viewpoint, we will present advanced semantic search mechanisms for finding quality content and trustworthy collaborators. From the social viewpoint, we will address how to support a trustworthy social network that encourages learners to share. Results of this research demonstrate that applying such mechanisms to knowledge sharing can improve the quality of e-learning in virtual learning communities. (Contains 5 figures and 2 tables.)
Published: 2007

40. The Potential For Bias In Machine Learning And Opportunities For Health Insurers To Address It: Article examines the potential for bias in machine learning and opportunities for health insurers to address it.

Author: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Gervasi, Stephanie S, Chen, Irene Y, Smith-McLallen, Aaron, Sontag, David, Obermeyer, Ziad, Vennera, Michael, Chawla, Ravi, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Gervasi, Stephanie S, Chen, Irene Y, Smith-McLallen, Aaron, Sontag, David, Obermeyer, Ziad, Vennera, Michael, and Chawla, Ravi
Published: 2022

41. Imaging and Clinical Findings in a Series of Six Cases of Rare Primary Mediastinal Liposarcoma

Author: Wong, Gordon S., primary, Bass, David, additional, Chen, Irene Y., additional, Thomas, Raymond, additional, Velez, Moises J., additional, and Hobbs, Susan K., additional
Published: 2022
Full Text: View/download PDF

42. The Potential For Bias In Machine Learning And Opportunities For Health Insurers To Address It

Author: Gervasi, Stephanie S., primary, Chen, Irene Y., additional, Smith-McLallen, Aaron, additional, Sontag, David, additional, Obermeyer, Ziad, additional, Vennera, Michael, additional, and Chawla, Ravi, additional
Published: 2022
Full Text: View/download PDF

43. Diffuse Pagetoid Squamous Cell Carcinoma in Situ of the Esophagus: A Rare Case Report and Review of Literature

Author: Chen, Irene Y., primary, Bartell, Nicholas, additional, and Ettel, Mark G., additional
Published: 2021
Full Text: View/download PDF

44. S764 Applying Machine Learning to Large Databases to Predict Nonresponse to Conventional Treatment in Patients with Crohn’s Disease

Author: Chen, Irene Y., primary, Berlin, Heather M., additional, Sontag, David, additional, Szolovits, Peter, additional, Kamble, Pravin, additional, Wang, Song, additional, Elomaa, Kaisa, additional, and Luo, Michelle, additional
Published: 2021
Full Text: View/download PDF

45. Probabilistic Machine Learning for Healthcare

Author: Chen, Irene Y., primary, Joshi, Shalmali, additional, Ghassemi, Marzyeh, additional, and Ranganath, Rajesh, additional
Published: 2021
Full Text: View/download PDF

46. Ethical Machine Learning in Healthcare

Author: Chen, Irene Y., primary, Pierson, Emma, additional, Rose, Sherri, additional, Joshi, Shalmali, additional, Ferryman, Kadija, additional, and Ghassemi, Marzyeh, additional
Published: 2021
Full Text: View/download PDF

47. Primary Colonic Medullary Carcinoma With Exclusive Squamous Differentiation

Author: CHEN, IRENE Y., primary, FINDEIS-HOSEY, JENNIFER J., additional, CANNON, LISA, additional, and LIAO, XIAOYAN, additional
Published: 2021
Full Text: View/download PDF

48. Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph

Author: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Chen, Irene Y, Agrawal, Monica, Horng, Steven, Sontag, David, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Chen, Irene Y, Agrawal, Monica, Horng, Steven, and Sontag, David
Abstract: © 2019 The Authors. Increasingly large electronic health records (EHRs) provide an opportunity to algorithmi-cally learn medical knowledge. In one prominent example, a causal health knowledge graph could learn relationships between diseases and symptoms and then serve as a diagnostic tool to be refined with additional clinical input. Prior research has demonstrated the ability to construct such a graph from over 270,000 emergency department patient visits. In this work, we describe methods to evaluate a health knowledge graph for robustness. Moving beyond precision and recall, we analyze for which diseases and for which patients the graph is most accurate. We identify sample size and unmeasured confounders as major sources of error in the health knowledge graph. We introduce a method to leverage non-linear functions in building the causal graph to better understand existing model assumptions. Finally, to assess model generalizability, we extend to a larger set of complete patient visits within a hospital system. We conclude with a discussion on how to robustly extract medical knowl-edge from EHRs. Supplementary material: http://clinicalml.org/papers/ChenEtAl PSB20 suppl.pdf.
Published: 2021

49. Investigating the Relative Performance of Bricks-and-Mortar, Clicks-and-Mortar, and Pure-Click Firms in Taiwan

Author: Chen, Irene Y. L., primary, Wang, Yi-Shun, additional, and Li, Bo-Ruei, additional
Published: 2021
Full Text: View/download PDF

50. Diffuse Pagetoid Squamous Cell Carcinoma in Situ of the Esophagus: A Rare Case Report and Review of Literature.

Author: Chen, Irene Y., Bartell, Nicholas, and Ettel, Mark G.
Subjects: *SQUAMOUS cell carcinoma, *ESOPHAGUS, *LITERATURE reviews
Abstract: Squamous cell carcinoma in situ (SCCIS) with diffuse pagetoid features has been well-described in skin and external genitalia. Diffuse pagetoid SCCIS of the esophagus is extremely rare with only two cases published in the English literature. In this article, we report a rare case of diffuse pagetoid SCCIS of the esophagus in an 89-year-old female with no significant past medical history who presented with dysphagia. Endoscopic examination of the esophagus was remarkable for multiple clean base ulcers spanning 4 cm in the proximal esophagus. Biopsy showed enlarged and hyperchromatic dysplastic cells in the basal half of the epithelium with scattered large individual pagetoid cells as well as several apoptotic dyskeratotic cells in the superficial half of the epithelium. Immunohistochemically, the dysplastic cells were positive for CK7 and p40 with overexpression of p53, and were negative for cytokeratin 20, SOX10, GATA3, CDX2, TTF1. Kreyberg stain was negative for mucin. The histologic features and immunohistochemical profile supported the diagnosis of esophageal diffuse pagetoid SCCIS. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

159 results on '"Chen, Irene Y."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources