56 results on '"Mowery DL"'
Search Results
2. Moonstone: a novel natural language processing system for inferring social risk from clinical narratives
- Author
-
Conway, M, Keyhani, S, Christensen, L, South, BR, Vali, M, Walter, LC, Mowery, DL, Abdelrahman, S, Chapman, WW, Conway, M, Keyhani, S, Christensen, L, South, BR, Vali, M, Walter, LC, Mowery, DL, Abdelrahman, S, and Chapman, WW
- Abstract
BACKGROUND: Social risk factors are important dimensions of health and are linked to access to care, quality of life, health outcomes and life expectancy. However, in the Electronic Health Record, data related to many social risk factors are primarily recorded in free-text clinical notes, rather than as more readily computable structured data, and hence cannot currently be easily incorporated into automated assessments of health. In this paper, we present Moonstone, a new, highly configurable rule-based clinical natural language processing system designed to automatically extract information that requires inferencing from clinical notes. Our initial use case for the tool is focused on the automatic extraction of social risk factor information - in this case, housing situation, living alone, and social support - from clinical notes. Nursing notes, social work notes, emergency room physician notes, primary care notes, hospital admission notes, and discharge summaries, all derived from the Veterans Health Administration, were used for algorithm development and evaluation. RESULTS: An evaluation of Moonstone demonstrated that the system is highly accurate in extracting and classifying the three variables of interest (housing situation, living alone, and social support). The system achieved positive predictive value (i.e. precision) scores ranging from 0.66 (homeless/marginally housed) to 0.98 (lives at home/not homeless), accuracy scores ranging from 0.63 (lives in facility) to 0.95 (lives alone), and sensitivity (i.e. recall) scores ranging from 0.75 (lives in facility) to 0.97 (lives alone). CONCLUSIONS: The Moonstone system is - to the best of our knowledge - the first freely available, open source natural language processing system designed to extract social risk factors from clinical text with good (lives in facility) to excellent (lives alone) performance. Although developed with the social risk factor identification task in mind, Moonstone provides a powerful too
- Published
- 2019
3. Extracting a stroke phenotype risk factor from Veteran Health Administration clinical reports: an information content analysis
- Author
-
Mowery, DL, Chapman, BE, Conway, M, South, BR, Madden, E, Keyhani, S, Chapman, WW, Mowery, DL, Chapman, BE, Conway, M, South, BR, Madden, E, Keyhani, S, and Chapman, WW
- Abstract
BACKGROUND: In the United States, 795,000 people suffer strokes each year; 10-15 % of these strokes can be attributed to stenosis caused by plaque in the carotid artery, a major stroke phenotype risk factor. Studies comparing treatments for the management of asymptomatic carotid stenosis are challenging for at least two reasons: 1) administrative billing codes (i.e., Current Procedural Terminology (CPT) codes) that identify carotid images do not denote which neurovascular arteries are affected and 2) the majority of the image reports are negative for carotid stenosis. Studies that rely on manual chart abstraction can be labor-intensive, expensive, and time-consuming. Natural Language Processing (NLP) can expedite the process of manual chart abstraction by automatically filtering reports with no/insignificant carotid stenosis findings and flagging reports with significant carotid stenosis findings; thus, potentially reducing effort, costs, and time. METHODS: In this pilot study, we conducted an information content analysis of carotid stenosis mentions in terms of their report location (Sections), report formats (structures) and linguistic descriptions (expressions) from Veteran Health Administration free-text reports. We assessed an NLP algorithm, pyConText's, ability to discern reports with significant carotid stenosis findings from reports with no/insignificant carotid stenosis findings given these three document composition factors for two report types: radiology (RAD) and text integration utility (TIU) notes. RESULTS: We observed that most carotid mentions are recorded in prose using categorical expressions, within the Findings and Impression sections for RAD reports and within neither of these designated sections for TIU notes. For RAD reports, pyConText performed with high sensitivity (88 %), specificity (84 %), and negative predictive value (95 %) and reasonable positive predictive value (70 %). For TIU notes, pyConText performed with high specificity (87 %) a
- Published
- 2016
4. Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
- Author
-
Mowery, DL, South, BR, Christensen, L, Leng, J, Peltonen, L-M, Salantera, S, Suominen, H, Martinez, D, Velupillai, S, Elhadad, N, Savova, G, Pradhan, S, Chapman, WW, Mowery, DL, South, BR, Christensen, L, Leng, J, Peltonen, L-M, Salantera, S, Suominen, H, Martinez, D, Velupillai, S, Elhadad, N, Savova, G, Pradhan, S, and Chapman, WW
- Abstract
BACKGROUND: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term. METHODS: In this study, we evaluate 1) accuracy of participating systems' normalizing short forms compared to a majority sense baseline approach, 2) performance of participants' systems for short forms with variable majority sense distributions, and 3) report the accuracy of participating systems' normalizing shared normalized concepts between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms. RESULTS: The best systems submitted by the five participating teams performed with accuracies ranging from 43 to 72 %. A majority sense baseline approach achieved the second best performance. The performance of participating systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than 80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity (majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these 2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy. CONCLUSION: Shor
- Published
- 2016
5. Realizing the potential of social determinants data in EHR systems: A scoping review of approaches for screening, linkage, extraction, analysis, and interventions.
- Author
-
Li C, Mowery DL, Ma X, Yang R, Vurgun U, Hwang S, Donnelly HK, Bandhey H, Senathirajah Y, Visweswaran S, Sadhu EM, Akhtar Z, Getzen E, Freda PJ, Long Q, and Becich MJ
- Abstract
Background: Social determinants of health (SDoH), such as socioeconomics and neighborhoods, strongly influence health outcomes. However, the current state of standardized SDoH data in electronic health records (EHRs) is lacking, a significant barrier to research and care quality., Methods: We conducted a PubMed search using "SDOH" and "EHR" Medical Subject Headings terms, analyzing included articles across five domains: 1) SDoH screening and assessment approaches, 2) SDoH data collection and documentation, 3) Use of natural language processing (NLP) for extracting SDoH, 4) SDoH data and health outcomes, and 5) SDoH-driven interventions., Results: Of 685 articles identified, 324 underwent full review. Key findings include implementation of tailored screening instruments, census and claims data linkage for contextual SDoH profiles, NLP systems extracting SDoH from notes, associations between SDoH and healthcare utilization and chronic disease control, and integrated care management programs. However, variability across data sources, tools, and outcomes underscores the need for standardization., Discussion: Despite progress in identifying patient social needs, further development of standards, predictive models, and coordinated interventions is critical for SDoH-EHR integration. Additional database searches could strengthen this scoping review. Ultimately, widespread capture, analysis, and translation of multidimensional SDoH data into clinical care is essential for promoting health equity., Competing Interests: There are no conflicts of interest to declare., (© The Author(s) 2024.)
- Published
- 2024
- Full Text
- View/download PDF
6. Safety planning intervention and follow-up: A telehealth service model for suicidal individuals in emergency department settings: Study design and protocol.
- Author
-
Brown GK, Wolk CB, Green KL, Nezir F, Mowery DL, Gallop R, Reilly ME, Stanley B, Mandell DS, Oquendo MA, and Jager-Hyman S
- Subjects
- Adult, Female, Humans, Male, Research Design, Randomized Controlled Trials as Topic, Emergency Service, Hospital organization & administration, Suicide Prevention, Telemedicine organization & administration
- Abstract
Background: The Safety Planning Intervention with follow-up services (SPI+) is a promising suicide prevention intervention, yet many Emergency Departments (EDs) lack the resources for adequate implementation. Comprehensive strategies addressing structural and organizational barriers are needed to optimize SPI+ implementation and scale-up. This protocol describes a test of one strategy in which ED staff connect at-risk patients to expert clinicians from a Suicide Prevention Consultation Center (SPCC) via telehealth., Method: This stepped wedge, cluster-randomized trial compares the effectiveness, implementation, cost, and cost offsets of SPI+ delivered by SPCC clinicians versus ED-based clinicians (enhanced usual care; EUC). Eight EDs will start with EUC and cross over to the SPCC phase. Blocks of two EDs will be randomly assigned to start dates 3 months apart. Approximately 13,320 adults discharged following a suicide-related ED visit will be included; EUC and SPCC samples will comprise patients from before and after SPCC crossover, respectively. Effectiveness data sources are electronic health records, administrative claims, and the National Death Index. Primary effectiveness outcomes are presence of suicidal behavior and number/type of mental healthcare visits and secondary outcomes include number/type of suicide-related acute services 6-months post-discharge. We will use the same data sources to assess cost offsets to gauge SPCC scalability and sustainability. We will examine preliminary implementation outcomes (reach, adoption, fidelity, acceptability, and feasibility) through patient, clinician, and health-system leader interviews and surveys., Conclusion: If the SPCC demonstrates clinical effectiveness and health system cost reduction, it may be a scalable model for evidence-based suicide prevention in the ED., Competing Interests: Declaration of competing interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Drs. Brown and Oquendo receive royalties from the Research Foundation for Mental Hygiene, Inc. for the commercial use of the Columbia-Suicide Severity Rating Scale. Dr. Brown has equity in Oui Therapeutics. Dr. Oquendo consults for Alkermes, MindMed, and Sage Therapeutics. Her family owns stock in Bristol Myers Squibb. The other authors declare that they have no other known competing financial interests or personal relationships that could have appeared to influence the work reported in this manuscript., (Copyright © 2024. Published by Elsevier Inc.)
- Published
- 2024
- Full Text
- View/download PDF
7. Neurological diagnoses in hospitalized COVID-19 patients associated with adverse outcomes: A multinational cohort study.
- Author
-
Hutch MR, Son J, Le TT, Hong C, Wang X, Shakeri Hossein Abad Z, Morris M, Gutiérrez-Sacristán A, Klann JG, Spiridou A, Batugo A, Bellazzi R, Benoit V, Bonzel CL, Bryant WA, Chiudinelli L, Cho K, Das P, González González T, Hanauer DA, Henderson DW, Ho YL, Loh NHW, Makoudjou A, Makwana S, Malovini A, Moal B, Mowery DL, Neuraz A, Samayamuthu MJ, Sanz Vidorreta FJ, Schriver ER, Schubert P, Talbert J, Tan ALM, Tan BWL, Tan BWQ, Tibollo V, Tippman P, Verdy G, Yuan W, Avillach P, Gehlenborg N, Omenn GS, Visweswaran S, Cai T, Luo Y, and Xia Z
- Abstract
Few studies examining the patient outcomes of concurrent neurological manifestations during acute COVID-19 leveraged multinational cohorts of adults and children or distinguished between central and peripheral nervous system (CNS vs. PNS) involvement. Using a federated multinational network in which local clinicians and informatics experts curated the electronic health records data, we evaluated the risk of prolonged hospitalization and mortality in hospitalized COVID-19 patients from 21 healthcare systems across 7 countries. For adults, we used a federated learning approach whereby we ran Cox proportional hazard models locally at each healthcare system and performed a meta-analysis on the aggregated results to estimate the overall risk of adverse outcomes across our geographically diverse populations. For children, we reported descriptive statistics separately due to their low frequency of neurological involvement and poor outcomes. Among the 106,229 hospitalized COVID-19 patients (104,031 patients ≥18 years; 2,198 patients <18 years, January 2020-October 2021), 15,101 (14%) had at least one CNS diagnosis, while 2,788 (3%) had at least one PNS diagnosis. After controlling for demographics and pre-existing conditions, adults with CNS involvement had longer hospital stay (11 versus 6 days) and greater risk of (Hazard Ratio = 1.78) and faster time to death (12 versus 24 days) than patients with no neurological condition (NNC) during acute COVID-19 hospitalization. Adults with PNS involvement also had longer hospital stay but lower risk of mortality than the NNC group. Although children had a low frequency of neurological involvement during COVID-19 hospitalization, a substantially higher proportion of children with CNS involvement died compared to those with NNC (6% vs 1%). Overall, patients with concurrent CNS manifestation during acute COVID-19 hospitalization faced greater risks for adverse clinical outcomes than patients without any neurological diagnosis. Our global informatics framework using a federated approach (versus a centralized data collection approach) has utility for clinical discovery beyond COVID-19., Competing Interests: All authors report no competing interests or conflicts of interest. JGK reports a consulting relationship with the i2b2-tranSMART Foundation through Invocate, Inc. RB reports being a shareholder of Biomeris s.r.l. and Engenome s.r.l. DAH reports entitled to royalties from the University of Michigan for licensing of the EMERSE "synonyms". AM’s work is being funded by the Federal Ministry of Education and Research (BMBF) in Germany in the framework of the MIRACUM Consortium. AM reports being a shareholder of Biomeris s.r.l. BM reports being co-founder and equity owner from DESKI. DLM has received research support from the National Institutes of Health, Department of Veteran Affairs, and the University of Pittsburgh/Pittsburgh Health Data Alliance outside of this work. PA reports consulting for CCHMC and BCH. NG is a co-founder and equity owner of Datavisyn. ZX has served as a Consultant for Genentech/Roche. The institution of ZX has received research support from the National Institute of Health, the National Multiple Sclerosis Society, Food and Drug Administration, the Pittsburgh Foundation, the PNC Charitable Trust, the Ethel Vincent Trust, and Genentech / Roche., (Copyright: This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.)
- Published
- 2024
- Full Text
- View/download PDF
8. Realizing the Potential of Social Determinants Data: A Scoping Review of Approaches for Screening, Linkage, Extraction, Analysis and Interventions.
- Author
-
Li C, Mowery DL, Ma X, Yang R, Vurgun U, Hwang S, Donnelly HK, Bandhey H, Akhtar Z, Senathirajah Y, Sadhu EM, Getzen E, Freda PJ, Long Q, and Becich MJ
- Abstract
Background: Social determinants of health (SDoH) like socioeconomics and neighborhoods strongly influence outcomes, yet standardized SDoH data is lacking in electronic health records (EHR), limiting research and care quality., Methods: We searched PubMed using keywords "SDOH" and "EHR", underwent title/abstract and full-text screening. Included records were analyzed under five domains: 1) SDoH screening and assessment approaches, 2) SDoH data collection and documentation, 3) Use of natural language processing (NLP) for extracting SDoH, 4) SDoH data and health outcomes, and 5) SDoH-driven interventions., Results: We identified 685 articles, of which 324 underwent full review. Key findings include tailored screening instruments implemented across settings, census and claims data linkage providing contextual SDoH profiles, rule-based and neural network systems extracting SDoH from notes using NLP, connections found between SDoH data and healthcare utilization/chronic disease control, and integrated care management programs executed. However, considerable variability persists across data sources, tools, and outcomes., Discussion: Despite progress identifying patient social needs, further development of standards, predictive models, and coordinated interventions is critical to fulfill the potential of SDoH-EHR integration. Additional database searches could strengthen this scoping review. Ultimately widespread capture, analysis, and translation of multidimensional SDoH data into clinical care is essential for promoting health equity., Competing Interests: There are no conflicts of interest to declare.
- Published
- 2024
- Full Text
- View/download PDF
9. Developing a Framework to Infer Opioid Use Disorder Severity From Clinical Notes to Inform Natural Language Processing Methods: Characterization Study.
- Author
-
Poulsen MN, Freda PJ, Troiani V, and Mowery DL
- Subjects
- Adult, Humans, Natural Language Processing, Outpatients, Control Groups, Chronic Pain, Opioid-Related Disorders diagnosis
- Abstract
Background: Information regarding opioid use disorder (OUD) status and severity is important for patient care. Clinical notes provide valuable information for detecting and characterizing problematic opioid use, necessitating development of natural language processing (NLP) tools, which in turn requires reliably labeled OUD-relevant text and understanding of documentation patterns., Objective: To inform automated NLP methods, we aimed to develop and evaluate an annotation schema for characterizing OUD and its severity, and to document patterns of OUD-relevant information within clinical notes of heterogeneous patient cohorts., Methods: We developed an annotation schema to characterize OUD severity based on criteria from the Diagnostic and Statistical Manual of Mental Disorders, 5th edition. In total, 2 annotators reviewed clinical notes from key encounters of 100 adult patients with varied evidence of OUD, including patients with and those without chronic pain, with and without medication treatment for OUD, and a control group. We completed annotations at the sentence level. We calculated severity scores based on annotation of note text with 18 classes aligned with criteria for OUD severity and determined positive predictive values for OUD severity., Results: The annotation schema contained 27 classes. We annotated 1436 sentences from 82 patients; notes of 18 patients (11 of whom were controls) contained no relevant information. Interannotator agreement was above 70% for 11 of 15 batches of reviewed notes. Severity scores for control group patients were all 0. Among noncontrol patients, the mean severity score was 5.1 (SD 3.2), indicating moderate OUD, and the positive predictive value for detecting moderate or severe OUD was 0.71. Progress notes and notes from emergency department and outpatient settings contained the most and greatest diversity of information. Substance misuse and psychiatric classes were most prevalent and highly correlated across note types with high co-occurrence across patients., Conclusions: Implementation of the annotation schema demonstrated strong potential for inferring OUD severity based on key information in a small set of clinical notes and highlighting where such information is documented. These advancements will facilitate NLP tool development to improve OUD prevention, diagnosis, and treatment., (©Melissa N Poulsen, Philip J Freda, Vanessa Troiani, Danielle L Mowery. Originally published in JMIR Mental Health (https://mental.jmir.org), 15.01.2024.)
- Published
- 2024
- Full Text
- View/download PDF
10. Patient Phenotyping for Atopic Dermatitis with Transformers and Machine Learning.
- Author
-
Wang A, Fulton R, Hwang S, Margolis DJ, and Mowery DL
- Abstract
Background: Atopic dermatitis (AD) is a chronic skin condition that millions of people around the world live with each day. Performing research studies into identifying the causes and treatment for this disease has great potential to provide benefit for these individuals. However, AD clinical trial recruitment is a non-trivial task due to variance in diagnostic precision and phenotypic definitions leveraged by different clinicians as well as time spent finding, recruiting, and enrolling patients by clinicians to become study subjects. Thus, there is a need for automatic and effective patient phenotyping for cohort recruitment., Objective: Our study aims to present an approach for identifying patients whose electronic health records suggest that they may have AD., Methods: We created a vectorized representation of each patient and trained various supervised machine learning methods to classify when a patient has AD. Each patient is represented by a vector of either probabilities or binary values where each value indicates whether they meet a different criteria for AD diagnosis., Results: The most accurate AD classifier performed with a class-balanced accuracy of 0.8036, a precision of 0.8400, and a recall of 0.7500 when using XGBoost (Extreme Gradient Boosting)., Conclusions: Creating an automated approach for identifying patient cohorts has the potential to accelerate, standardize, and automate the process of patient recruitment for AD studies; therefore, reducing clinician burden and informing knowledge discovery of better treatment options for AD., Competing Interests: Conflicts of Interest DJM is or recently has been a consultant for Pfizer, Leo, and Sanofi with respect to studies of atopic dermatitis and served on an advisory board for the National Eczema Association.
- Published
- 2023
- Full Text
- View/download PDF
11. A voice-based digital assistant for intelligent prompting of evidence-based practices during ICU rounds.
- Author
-
King AJ, Angus DC, Cooper GF, Mowery DL, Seaman JB, Potter KM, Bukowski LA, Al-Khafaji A, Gunn SR, and Kahn JM
- Subjects
- Humans, Natural Language Processing, Teaching Rounds methods, Evidence-Based Practice, Voice, Intensive Care Units
- Abstract
Objective: To evaluate the technical feasibility and potential value of a digital assistant that prompts intensive care unit (ICU) rounding teams to use evidence-based practices based on analysis of their real-time discussions., Methods: We evaluated a novel voice-based digital assistant which audio records and processes the ICU care team's rounding discussions to determine which evidence-based practices are applicable to the patient but have yet to be addressed by the team. The system would then prompt the team to consider indicated but not yet delivered practices, thereby reducing cognitive burden compared to traditional rigid rounding checklists. In a retrospective analysis, we applied automatic transcription, natural language processing, and a rule-based expert system to generate personalized prompts for each patient in 106 audio-recorded ICU rounding discussions. To assess technical feasibility, we compared the system's prompts to those created by experienced critical care nurses who directly observed rounds. To assess potential value, we also compared the system's prompts to a hypothetical paper checklist containing all evidence-based practices., Results: The positive predictive value, negative predictive value, true positive rate, and true negative rate of the system's prompts were 0.45 ± 0.06, 0.83 ± 0.04, 0.68 ± 0.07, and 0.66 ± 0.04, respectively. If implemented in lieu of a paper checklist, the system would generate 56% fewer prompts per patient, with 50%±17% greater precision., Conclusion: A voice-based digital assistant can reduce prompts per patient compared to traditional approaches for improving evidence uptake on ICU rounds. Additional work is needed to evaluate field performance and team acceptance., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 Elsevier Inc. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
12. Identifying Barriers to Post-Acute Care Referral and Characterizing Negative Patient Preferences Among Hospitalized Older Adults Using Natural Language Processing.
- Author
-
Kennedy EE, Davoudi A, Hwang S, Freda PJ, Urbanowicz R, Bowles KH, and Mowery DL
- Subjects
- Humans, Aged, Subacute Care, Machine Learning, Referral and Consultation, Electronic Health Records, Natural Language Processing, Patient Preference
- Abstract
Our objective was to detect common barriers to post-acute care (B2PAC) among hospitalized older adults using natural language processing (NLP) of clinical notes from patients discharged home when a clinical decision support system recommended post-acute care. We annotated B2PAC sentences from discharge planning notes and developed an NLP classifier to identify the highest-value B2PAC class (negative patient preferences). Thirteen machine learning models were compared with Amazon's AutoGluon deep learning model. The study included 594 acute care notes from 100 patient encounters (1156 sentences contained 11 B2PAC) in a large academic health system. The most frequent and modifiable B2PAC class was negative patient preferences (18.3%). The best supervised model was Extreme Gradient Boosting (F1: 0.859), but the deep learning model performed better (F1: 0.916). Alerting clinicians of negative patient preferences early in the hospitalization can prompt interventions such as patient education to ensure patients receive the right level of care and avoid negative outcomes., (©2022 AMIA - All rights reserved.)
- Published
- 2023
13. Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?
- Author
-
Tan ALM, Getzen EJ, Hutch MR, Strasser ZH, Gutiérrez-Sacristán A, Le TT, Dagliati A, Morris M, Hanauer DA, Moal B, Bonzel CL, Yuan W, Chiudinelli L, Das P, Zhang HG, Aronow BJ, Avillach P, Brat GA, Cai T, Hong C, La Cava WG, Hooi Will Loh H, Luo Y, Murphy SN, Yuan Hgiam K, Omenn GS, Patel LP, Jebathilagam Samayamuthu M, Shriver ER, Shakeri Hossein Abad Z, Tan BWL, Visweswaran S, Wang X, Weber GM, Xia Z, Verdy B, Long Q, Mowery DL, and Holmes JH
- Subjects
- Humans, Data Collection, Records, Cluster Analysis, Electronic Health Records, COVID-19
- Abstract
Background: In electronic health records, patterns of missing laboratory test results could capture patients' course of disease as well as reflect clinician's concerns or worries for possible conditions. These patterns are often understudied and overlooked. This study aims to identify informative patterns of missingness among laboratory data collected across 15 healthcare system sites in three countries for COVID-19 inpatients., Methods: We collected and analyzed demographic, diagnosis, and laboratory data for 69,939 patients with positive COVID-19 PCR tests across three countries from 1 January 2020 through 30 September 2021. We analyzed missing laboratory measurements across sites, missingness stratification by demographic variables, temporal trends of missingness, correlations between labs based on missingness indicators over time, and clustering of groups of labs based on their missingness/ordering pattern., Results: With these analyses, we identified mapping issues faced in seven out of 15 sites. We also identified nuances in data collection and variable definition for the various sites. Temporal trend analyses may support the use of laboratory test result missingness patterns in identifying severe COVID-19 patients. Lastly, using missingness patterns, we determined relationships between various labs that reflect clinical behaviors., Conclusion: In this work, we use computational approaches to relate missingness patterns to hospital treatment capacity and highlight the heterogeneity of looking at COVID-19 over time and at multiple sites, where there might be different phases, policies, etc. Changes in missingness could suggest a change in a patient's condition, and patterns of missingness among laboratory measurements could potentially identify clinical outcomes. This allows sites to consider missing data as informative to analyses and help researchers identify which sites are better poised to study particular questions., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 Elsevier Inc. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
14. Measuring Performance on the ABCDEF Bundle During Interprofessional Rounds via a Nurse-Based Assessment Tool.
- Author
-
King AJ, Potter KM, Seaman JB, Chiyka EA, Hileman BA, Cooper GF, Mowery DL, Angus DC, and Kahn JM
- Subjects
- Humans, Critical Care, Intensive Care Units, Reproducibility of Results, Checklist, Communication
- Abstract
Background: Nurse-led rounding checklists are a common strategy for facilitating evidence-based practice in the intensive care unit (ICU). To streamline checklist workflow, some ICUs have the nurse or another individual listen to the conversation and customize the checklist for each patient. Such customizations assume that individuals can reliably assess whether checklist items have been addressed., Objective: To evaluate whether 1 critical care nurse can reliably assess checklist items on rounds., Methods: Two nurses performed in-person observation of multidisciplinary ICU rounds. Using a standardized paper-based assessment tool, each nurse indicated whether 17 items related to the ABCDEF bundle were discussed during rounds. For each item, generalizability coefficients were used as a measure of reliability, with a single-rater value of 0.70 or greater considered sufficient to support its assessment by 1 nurse., Results: The nurse observers assessed 118 patient discussions across 15 observation days. For 11 of 17 items (65%), the generalizability coefficient for a single rater met or exceeded the 0.70 threshold. The generalizability coefficients (95% CIs) of a single rater for key items were as follows: pain, 0.86 (0.74-0.97); delirium score, 0.74 (0.64-0.83); agitation score, 0.72 (0.33-1.00); spontaneous awakening trial, 0.67 (0.49-0.83); spontaneous breathing trial, 0.80 (0.70-0.89); mobility, 0.79 (0.69-0.87); and family (future/past) engagement, 0.82 (0.73-0.90)., Conclusion: Using a paper-based assessment tool, a single trained critical care nurse can reliably assess the discussion of elements of the ABCDEF bundle during multidisciplinary rounds., (©2023 American Association of Critical-Care Nurses.)
- Published
- 2023
- Full Text
- View/download PDF
15. Toward Predicting 30-Day Readmission Among Oncology Patients: Identifying Timely and Actionable Risk Factors.
- Author
-
Hwang S, Urbanowicz R, Lynch S, Vernon T, Bresz K, Giraldo C, Kennedy E, Leabhart M, Bleacher T, Ripchinski MR, Mowery DL, and Oyer RA
- Subjects
- Humans, Retrospective Studies, Hospitalization, Risk Factors, Patient Readmission, Neoplasms
- Abstract
Purpose: Predicting 30-day readmission risk is paramount to improving the quality of patient care. In this study, we compare sets of patient-, provider-, and community-level variables that are available at two different points of a patient's inpatient encounter (first 48 hours and the full encounter) to train readmission prediction models and identify possible targets for appropriate interventions that can potentially reduce avoidable readmissions., Methods: Using electronic health record data from a retrospective cohort of 2,460 oncology patients and a comprehensive machine learning analysis pipeline, we trained and tested models predicting 30-day readmission on the basis of data available within the first 48 hours of admission and from the entire hospital encounter., Results: Leveraging all features, the light gradient boosting model produced higher, but comparable performance (area under receiver operating characteristic curve [AUROC]: 0.711) with the Epic model (AUROC: 0.697). Given features in the first 48 hours, the random forest model produces higher AUROC (0.684) than the Epic model (AUROC: 0.676). Both models flagged patients with a similar distribution of race and sex; however, our light gradient boosting and random forest models were more inclusive, flagging more patients among younger age groups. The Epic models were more sensitive to identifying patients with an average lower zip income. Our 48-hour models were powered by novel features at various levels: patient (weight change over 365 days, depression symptoms, laboratory values, and cancer type), hospital (winter discharge and hospital admission type), and community (zip income and marital status of partner)., Conclusion: We developed and validated models comparable with the existing Epic 30-day readmission models with several novel actionable insights that could create service interventions deployed by the case management or discharge planning teams that may decrease readmission rates over time.
- Published
- 2023
- Full Text
- View/download PDF
16. Acute respiratory distress syndrome after SARS-CoV-2 infection on young adult population: International observational federated study based on electronic health records through the 4CE consortium.
- Author
-
Moal B, Orieux A, Ferté T, Neuraz A, Brat GA, Avillach P, Bonzel CL, Cai T, Cho K, Cossin S, Griffier R, Hanauer DA, Haverkamp C, Ho YL, Hong C, Hutch MR, Klann JG, Le TT, Loh NHW, Luo Y, Makoudjou A, Morris M, Mowery DL, Olson KL, Patel LP, Samayamuthu MJ, Sanz Vidorreta FJ, Schriver ER, Schubert P, Verdy G, Visweswaran S, Wang X, Weber GM, Xia Z, Yuan W, Zhang HG, Zöller D, Kohane IS, Boyer A, and Jouhet V
- Subjects
- Humans, Young Adult, Aged, Adolescent, Adult, Middle Aged, SARS-CoV-2, Cohort Studies, Retrospective Studies, Electronic Health Records, Obesity complications, COVID-19 complications, COVID-19 epidemiology, Respiratory Distress Syndrome etiology, Respiratory Distress Syndrome complications
- Abstract
Purpose: In young adults (18 to 49 years old), investigation of the acute respiratory distress syndrome (ARDS) after severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection has been limited. We evaluated the risk factors and outcomes of ARDS following infection with SARS-CoV-2 in a young adult population., Methods: A retrospective cohort study was conducted between January 1st, 2020 and February 28th, 2021 using patient-level electronic health records (EHR), across 241 United States hospitals and 43 European hospitals participating in the Consortium for Clinical Characterization of COVID-19 by EHR (4CE). To identify the risk factors associated with ARDS, we compared young patients with and without ARDS through a federated analysis. We further compared the outcomes between young and old patients with ARDS., Results: Among the 75,377 hospitalized patients with positive SARS-CoV-2 PCR, 1001 young adults presented with ARDS (7.8% of young hospitalized adults). Their mortality rate at 90 days was 16.2% and they presented with a similar complication rate for infection than older adults with ARDS. Peptic ulcer disease, paralysis, obesity, congestive heart failure, valvular disease, diabetes, chronic pulmonary disease and liver disease were associated with a higher risk of ARDS. We described a high prevalence of obesity (53%), hypertension (38%- although not significantly associated with ARDS), and diabetes (32%)., Conclusion: Trough an innovative method, a large international cohort study of young adults developing ARDS after SARS-CoV-2 infection has been gather. It demonstrated the poor outcomes of this population and associated risk factor., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2023 Moal et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
17. Long-term kidney function recovery and mortality after COVID-19-associated acute kidney injury: An international multi-centre observational cohort study.
- Author
-
Tan BWL, Tan BWQ, Tan ALM, Schriver ER, Gutiérrez-Sacristán A, Das P, Yuan W, Hutch MR, García Barrio N, Pedrera Jimenez M, Abu-El-Rub N, Morris M, Moal B, Verdy G, Cho K, Ho YL, Patel LP, Dagliati A, Neuraz A, Klann JG, South AM, Visweswaran S, Hanauer DA, Maidlow SE, Liu M, Mowery DL, Batugo A, Makoudjou A, Tippmann P, Zöller D, Brat GA, Luo Y, Avillach P, Bellazzi R, Chiovato L, Malovini A, Tibollo V, Samayamuthu MJ, Serrano Balazote P, Xia Z, Loh NHW, Chiudinelli L, Bonzel CL, Hong C, Zhang HG, Weber GM, Kohane IS, Cai T, Omenn GS, Holmes JH, and Ngiam KY
- Abstract
Background: While acute kidney injury (AKI) is a common complication in COVID-19, data on post-AKI kidney function recovery and the clinical factors associated with poor kidney function recovery is lacking., Methods: A retrospective multi-centre observational cohort study comprising 12,891 hospitalized patients aged 18 years or older with a diagnosis of SARS-CoV-2 infection confirmed by polymerase chain reaction from 1 January 2020 to 10 September 2020, and with at least one serum creatinine value 1-365 days prior to admission. Mortality and serum creatinine values were obtained up to 10 September 2021., Findings: Advanced age (HR 2.77, 95%CI 2.53-3.04, p < 0.0001), severe COVID-19 (HR 2.91, 95%CI 2.03-4.17, p < 0.0001), severe AKI (KDIGO stage 3: HR 4.22, 95%CI 3.55-5.00, p < 0.0001), and ischemic heart disease (HR 1.26, 95%CI 1.14-1.39, p < 0.0001) were associated with worse mortality outcomes. AKI severity (KDIGO stage 3: HR 0.41, 95%CI 0.37-0.46, p < 0.0001) was associated with worse kidney function recovery, whereas remdesivir use (HR 1.34, 95%CI 1.17-1.54, p < 0.0001) was associated with better kidney function recovery. In a subset of patients without chronic kidney disease, advanced age (HR 1.38, 95%CI 1.20-1.58, p < 0.0001), male sex (HR 1.67, 95%CI 1.45-1.93, p < 0.0001), severe AKI (KDIGO stage 3: HR 11.68, 95%CI 9.80-13.91, p < 0.0001), and hypertension (HR 1.22, 95%CI 1.10-1.36, p = 0.0002) were associated with post-AKI kidney function impairment. Furthermore, patients with COVID-19-associated AKI had significant and persistent elevations of baseline serum creatinine 125% or more at 180 days (RR 1.49, 95%CI 1.32-1.67) and 365 days (RR 1.54, 95%CI 1.21-1.96) compared to COVID-19 patients with no AKI., Interpretation: COVID-19-associated AKI was associated with higher mortality, and severe COVID-19-associated AKI was associated with worse long-term post-AKI kidney function recovery., Funding: Authors are supported by various funders, with full details stated in the acknowledgement section., Competing Interests: Dr Hanauer reported having developed an electronic resource of clinical synonyms, EMERSE, that is licensed by the University of Michigan and receiving a portion of the licensing fees for this resource outside the submitted work. Dr Omenn reported being an early investor and serving on the board of Angion Biomedica Corporation, New York, which has conducted clinical trials of drug candidates for overcoming acute kidney injury following cardiopulmonary surgery or kidney transplantation. The former was terminated early based on unsatisfactory efficacy/adverse effects assessment; the latter had insufficient benefit to warrant proposing a Phase III trial. The company is moving in other directions, to be determined. No further work on kidney is anticipated. Dr Holmes disclosed participation as an NIH/NIDDK T2 Coach R01DK113189. Dr Malovini disclosed being a shareholder of Biomeris s.r.l. Dr Bellazzi reported receiving honoraria from Pfizer, and disclosed being a shareholder of University of Pavia spin-off Biomeris. Dr Klann reports consulting fees from i2b2 tranSMART foundation, for work to enhance open-source data warehouse platform. He reports no direct relationship to this work, except that the data model for analysis in this manuscript was inspired by this platform., (© 2022 Published by Elsevier Ltd.)
- Published
- 2022
- Full Text
- View/download PDF
18. SurvMaximin: Robust federated approach to transporting survival risk prediction models.
- Author
-
Wang X, Zhang HG, Xiong X, Hong C, Weber GM, Brat GA, Bonzel CL, Luo Y, Duan R, Palmer NP, Hutch MR, Gutiérrez-Sacristán A, Bellazzi R, Chiovato L, Cho K, Dagliati A, Estiri H, García-Barrio N, Griffier R, Hanauer DA, Ho YL, Holmes JH, Keller MS, Klann MEng JG, L'Yi S, Lozano-Zahonero S, Maidlow SE, Makoudjou A, Malovini A, Moal B, Moore JH, Morris M, Mowery DL, Murphy SN, Neuraz A, Yuan Ngiam K, Omenn GS, Patel LP, Pedrera-Jiménez M, Prunotto A, Jebathilagam Samayamuthu M, Sanz Vidorreta FJ, Schriver ER, Schubert P, Serrano-Balazote P, South AM, Tan ALM, Tan BWL, Tibollo V, Tippmann P, Visweswaran S, Xia Z, Yuan W, Zöller D, Kohane IS, Avillach P, Guo Z, and Cai T
- Subjects
- Humans, Privacy, Proportional Hazards Models, Survival Analysis, Algorithms, Electronic Health Records
- Abstract
Objective: For multi-center heterogeneous Real-World Data (RWD) with time-to-event outcomes and high-dimensional features, we propose the SurvMaximin algorithm to estimate Cox model feature coefficients for a target population by borrowing summary information from a set of health care centers without sharing patient-level information., Materials and Methods: For each of the centers from which we want to borrow information to improve the prediction performance for the target population, a penalized Cox model is fitted to estimate feature coefficients for the center. Using estimated feature coefficients and the covariance matrix of the target population, we then obtain a SurvMaximin estimated set of feature coefficients for the target population. The target population can be an entire cohort comprised of all centers, corresponding to federated learning, or a single center, corresponding to transfer learning., Results: Simulation studies and a real-world international electronic health records application study, with 15 participating health care centers across three countries (France, Germany, and the U.S.), show that the proposed SurvMaximin algorithm achieves comparable or higher accuracy compared with the estimator using only the information of the target site and other existing methods. The SurvMaximin estimator is robust to variations in sample sizes and estimated feature coefficients between centers, which amounts to significantly improved estimates for target sites with fewer observations., Conclusions: The SurvMaximin method is well suited for both federated and transfer learning in the high-dimensional survival analysis setting. SurvMaximin only requires a one-time summary information exchange from participating centers. Estimated regression vectors can be very heterogeneous. SurvMaximin provides robust Cox feature coefficient estimates without outcome information in the target population and is privacy-preserving., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2022. Published by Elsevier Inc.)
- Published
- 2022
- Full Text
- View/download PDF
19. Initial Development of an Automated Platform for Assessing Trainee Performance on Case Presentations.
- Author
-
King AJ, Kahn JM, Brant EB, Cooper GF, and Mowery DL
- Abstract
Background: Oral case presentation is a crucial skill of physicians and a key component of team-based care. However, consistent and objective assessment and feedback on presentations during training are infrequent., Objective: To determine the potential value of applying natural language processing, computer software that extracts meaning from text, to transcripts of oral case presentations as a strategy to assess their quality automatically and objectively., Methods: We transcribed a collection of simulated oral case presentations. The presentations were from eight critical care fellows and one critical care attending. They were instructed to review the medical charts of 11 real intensive care unit patient cases and to audio record themselves, presenting each case as if they were doing so on morning rounds. We then used natural language processing to convert the transcripts from human-readable text into machine-readable numbers. These numbers represent details of the presentation style and content. The distance between the numeric representation of two different transcripts negatively correlates with the similarity of those two transcripts. We ranked fellows on the basis of how similar their presentations were to the attending's presentations., Results: The 99 presentations included 260 minutes of audio (mean length: 2.6 ± 1.24 min per case). On average, 23.88 ± 2.65 sentences were spoken, and each sentence had 14.10 ± 0.67 words, 3.62 ± 0.15 medical concepts, and 0.75 ± 0.09 medical adjectives. When ranking fellows on the basis of how similar their presentations were to the attending's presentation, we found a gap between the five fellows with the most similar presentations and the three fellows with the least similar presentations (average group similarity scores of 0.62 ± 0.01 and 0.53 ± 0.01, respectively). Rankings were sensitive to whether presentation style or content information were weighted more heavily when calculating transcript similarity., Conclusion: Natural language processing enabled the ranking of case presentations on the basis of how similar they were to a reference presentation. Although additional work is needed to convert these rankings, and underlying similarity scores, into actionable feedback for trainees, these methods may support new tools for improving medical education., (Copyright © 2022 by the American Thoracic Society.)
- Published
- 2022
- Full Text
- View/download PDF
20. Changes in laboratory value improvement and mortality rates over the course of the pandemic: an international retrospective cohort study of hospitalised patients infected with SARS-CoV-2.
- Author
-
Hong C, Zhang HG, L'Yi S, Weber G, Avillach P, Tan BWQ, Gutiérrez-Sacristán A, Bonzel CL, Palmer NP, Malovini A, Tibollo V, Luo Y, Hutch MR, Liu M, Bourgeois F, Bellazzi R, Chiovato L, Sanz Vidorreta FJ, Le TT, Wang X, Yuan W, Neuraz A, Benoit V, Moal B, Morris M, Hanauer DA, Maidlow S, Wagholikar K, Murphy S, Estiri H, Makoudjou A, Tippmann P, Klann J, Follett RW, Gehlenborg N, Omenn GS, Xia Z, Dagliati A, Visweswaran S, Patel LP, Mowery DL, Schriver ER, Samayamuthu MJ, Kavuluru R, Lozano-Zahonero S, Zöller D, Tan ALM, Tan BWL, Ngiam KY, Holmes JH, Schubert P, Cho K, Ho YL, Beaulieu-Jones BK, Pedrera-Jiménez M, García-Barrio N, Serrano-Balazote P, Kohane I, South A, Brat GA, and Cai T
- Subjects
- Hospitalization, Humans, Retrospective Studies, SARS-CoV-2, COVID-19, Pandemics
- Abstract
Objective: To assess changes in international mortality rates and laboratory recovery rates during hospitalisation for patients hospitalised with SARS-CoV-2 between the first wave (1 March to 30 June 2020) and the second wave (1 July 2020 to 31 January 2021) of the COVID-19 pandemic., Design, Setting and Participants: This is a retrospective cohort study of 83 178 hospitalised patients admitted between 7 days before or 14 days after PCR-confirmed SARS-CoV-2 infection within the Consortium for Clinical Characterization of COVID-19 by Electronic Health Record, an international multihealthcare system collaborative of 288 hospitals in the USA and Europe. The laboratory recovery rates and mortality rates over time were compared between the two waves of the pandemic., Primary and Secondary Outcome Measures: The primary outcome was all-cause mortality rate within 28 days after hospitalisation stratified by predicted low, medium and high mortality risk at baseline. The secondary outcome was the average rate of change in laboratory values during the first week of hospitalisation., Results: Baseline Charlson Comorbidity Index and laboratory values at admission were not significantly different between the first and second waves. The improvement in laboratory values over time was faster in the second wave compared with the first. The average C reactive protein rate of change was -4.72 mg/dL vs -4.14 mg/dL per day (p=0.05). The mortality rates within each risk category significantly decreased over time, with the most substantial decrease in the high-risk group (42.3% in March-April 2020 vs 30.8% in November 2020 to January 2021, p<0.001) and a moderate decrease in the intermediate-risk group (21.5% in March-April 2020 vs 14.3% in November 2020 to January 2021, p<0.001)., Conclusions: Admission profiles of patients hospitalised with SARS-CoV-2 infection did not differ greatly between the first and second waves of the pandemic, but there were notable differences in laboratory improvement rates during hospitalisation. Mortality risks among patients with similar risk profiles decreased over the course of the pandemic. The improvement in laboratory values and mortality risk was consistent across multiple countries., Competing Interests: Competing interests: None declared., (© Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.)
- Published
- 2022
- Full Text
- View/download PDF
21. International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality.
- Author
-
Weber GM, Hong C, Xia Z, Palmer NP, Avillach P, L'Yi S, Keller MS, Murphy SN, Gutiérrez-Sacristán A, Bonzel CL, Serret-Larmande A, Neuraz A, Omenn GS, Visweswaran S, Klann JG, South AM, Loh NHW, Cannataro M, Beaulieu-Jones BK, Bellazzi R, Agapito G, Alessiani M, Aronow BJ, Bell DS, Benoit V, Bourgeois FT, Chiovato L, Cho K, Dagliati A, DuVall SL, Barrio NG, Hanauer DA, Ho YL, Holmes JH, Issitt RW, Liu M, Luo Y, Lynch KE, Maidlow SE, Malovini A, Mandl KD, Mao C, Matheny ME, Moore JH, Morris JS, Morris M, Mowery DL, Ngiam KY, Patel LP, Pedrera-Jimenez M, Ramoni RB, Schriver ER, Schubert P, Balazote PS, Spiridou A, Tan ALM, Tan BWL, Tibollo V, Torti C, Trecarichi EM, Wang X, Kohane IS, Cai T, and Brat GA
- Abstract
Given the growing number of prediction algorithms developed to predict COVID-19 mortality, we evaluated the transportability of a mortality prediction algorithm using a multi-national network of healthcare systems. We predicted COVID-19 mortality using baseline commonly measured laboratory values and standard demographic and clinical covariates across healthcare systems, countries, and continents. Specifically, we trained a Cox regression model with nine measured laboratory test values, standard demographics at admission, and comorbidity burden pre-admission. These models were compared at site, country, and continent level. Of the 39,969 hospitalized patients with COVID-19 (68.6% male), 5717 (14.3%) died. In the Cox model, age, albumin, AST, creatine, CRP, and white blood cell count are most predictive of mortality. The baseline covariates are more predictive of mortality during the early days of COVID-19 hospitalization. Models trained at healthcare systems with larger cohort size largely retain good transportability performance when porting to different sites. The combination of routine laboratory test values at admission along with basic demographic features can predict mortality in patients hospitalized with COVID-19. Importantly, this potentially deployable model differs from prior work by demonstrating not only consistent performance but also reliable transportability across healthcare systems in the US and Europe, highlighting the generalizability of this model and the overall approach., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
22. Using Natural Language Processing to Classify Serious Illness Communication with Oncology Patients.
- Author
-
Davoudi A, Tissot H, Doucette A, Gabriel PE, Parikh R, Mowery DL, and Miranda SP
- Abstract
One core measure of healthcare quality set forth by the Institute of Medicine is whether care decisions match patient goals. High-quality "serious illness communication" about patient goals and prognosis is required to support patient-centered decision-making, however current methods are not sensitive enough to measure the quality of this communication or determine whether care delivered matches patient priorities. Natural language processing (NLP) offers an efficient method for identification and evaluation of documented serious illness communication, which could serve as the basis for future quality metrics in oncology and other forms of serious illness. In this study, we trained NLP algorithms to identify and characterize serious illness communication with oncology patients., (©2022 AMIA - All rights reserved.)
- Published
- 2022
23. Determinants of hospital outcomes for patients with COVID-19 in the University of Pennsylvania Health System.
- Author
-
Shaw PA, Yang JB, Mowery DL, Schriver ER, Mahoney KB, Bar KJ, and Ellenberg SS
- Subjects
- Aftercare, Hospitals, Humans, Patient Discharge, SARS-CoV-2, COVID-19 epidemiology, COVID-19 therapy
- Abstract
There is growing evidence that racial and ethnic minorities bear a disproportionate burden from COVID-19. Temporal changes in the pandemic epidemiology and diversity in the clinical course require careful study to identify determinants of poor outcomes. We analyzed 6255 hospitalized individuals with PCR-confirmed SARS-CoV-2 infection from one of 5 hospitals in the University of Pennsylvania Health System between March 2020 and March 2021, using electronic health records to assess risk factors and outcomes through 8 weeks post-admission. Discharge, readmission and mortality outcomes were analyzed in a multi-state model with multivariable Cox models for each transition. Mortality varied markedly over time, with cumulative incidence (95% CI) 30 days post-admission of 19.1% (16.9, 21.3) in March-April 2020, 5.7% (4.2, 7.5) in July-October 2020 and 10.5% (9.1,12.0) in January-March 2021; 26% of deaths occurred after discharge. Average age (SD) at admission varied from 62.7 (17.6) to 54.8 (19.9) to 60.5 (18.1); mechanical ventilation use declined from 21.3% to 9-11%. Compared to Caucasian, Black race was associated with more severe disease at admission, higher rates of co-morbidities and residing in a low-income zip code. Between-race risk differences in mortality risk diminished in multivariable models; while admitting hospital, increasing age, admission early in the pandemic, and severe disease and low blood pressure at admission were associated with increased mortality hazard. Hispanic ethnicity was associated with fewer baseline co-morbidities and lower mortality hazard (0.57, 95% CI: 0.37, .087). Multi-state modeling allows for a unified framework to analyze multiple outcomes throughout the disease course. Morbidity and mortality for hospitalized COVID-19 patients varied over time but post-discharge mortality remained non-trivial. Black race was associated with more risk factors for morbidity and with treatment at hospitals with lower mortality. Multivariable models suggest there are not between-race differences in outcomes. Future work is needed to better understand the identified between-hospital differences in mortality., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2022
- Full Text
- View/download PDF
24. Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing.
- Author
-
Poulsen MN, Freda PJ, Troiani V, Davoudi A, and Mowery DL
- Subjects
- Analgesics, Opioid, Hospitals, Humans, Patient Discharge, Pilot Projects, Natural Language Processing, Opioid-Related Disorders
- Abstract
Background: Opioid use disorder (OUD) is underdiagnosed in health system settings, limiting research on OUD using electronic health records (EHRs). Medical encounter notes can enrich structured EHR data with documented signs and symptoms of OUD and social risks and behaviors. To capture this information at scale, natural language processing (NLP) tools must be developed and evaluated. We developed and applied an annotation schema to deeply characterize OUD and related clinical, behavioral, and environmental factors, and automated the annotation schema using machine learning and deep learning-based approaches., Methods: Using the MIMIC-III Critical Care Database, we queried hospital discharge summaries of patients with International Classification of Diseases (ICD-9) OUD diagnostic codes. We developed an annotation schema to characterize problematic opioid use, identify individuals with potential OUD, and provide psychosocial context. Two annotators reviewed discharge summaries from 100 patients. We randomly sampled patients with their associated annotated sentences and divided them into training (66 patients; 2,127 annotated sentences) and testing (29 patients; 1,149 annotated sentences) sets. We used the training set to generate features, employing three NLP algorithms/knowledge sources. We trained and tested prediction models for classification with a traditional machine learner (logistic regression) and deep learning approach (Autogluon based on ELECTRA's replaced token detection model). We applied a five-fold cross-validation approach to reduce bias in performance estimates., Results: The resulting annotation schema contained 32 classes. We achieved moderate inter-annotator agreement, with F
1 -scores across all classes increasing from 48 to 66%. Five classes had a sufficient number of annotations for automation; of these, we observed consistently high performance (F1 -scores) across training and testing sets for drug screening (training: 91-96; testing: 91-94) and opioid type (training: 86-96; testing: 86-99). Performance dropped from training and to testing sets for other drug use (training: 52-65; testing: 40-48), pain management (training: 72-78; testing: 61-78) and psychiatric (training: 73-80; testing: 72). Autogluon achieved the highest performance., Conclusion: This pilot study demonstrated that rich information regarding problematic opioid use can be manually identified by annotators. However, more training samples and features would improve our ability to reliably identify less common classes from clinical text, including text from outpatient settings., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2022 Poulsen, Freda, Troiani, Davoudi and Mowery.)- Published
- 2022
- Full Text
- View/download PDF
25. Performance of polygenic risk scores for cancer prediction in a racially diverse academic biobank.
- Author
-
Wang L, Desai H, Verma SS, Le A, Hausler R, Verma A, Judy R, Doucette A, Gabriel PE, Nathanson KL, Damrauer SM, Mowery DL, Ritchie MD, Kember RL, and Maxwell KN
- Subjects
- Biological Specimen Banks, Black People genetics, Female, Genetic Predisposition to Disease, Humans, Male, Risk Factors, White People genetics, Genome-Wide Association Study, Multifactorial Inheritance, Neoplasms ethnology, Neoplasms genetics
- Abstract
Purpose: Genome-wide association studies have identified hundreds of single nucleotide variations (formerly single nucleotide polymorphisms) associated with several cancers, but the predictive ability of polygenic risk scores (PRSs) is unclear, especially among non-Whites., Methods: PRSs were derived from genome-wide significant single-nucleotide variations for 15 cancers in 20,079 individuals in an academic biobank. We evaluated the improvement in discriminatory accuracy by including cancer-specific PRS in patients of genetically-determined African and European ancestry., Results: Among the individuals of European genetic ancestry, PRSs for breast, colon, melanoma, and prostate were significantly associated with their respective cancers. Among the individuals of African genetic ancestry, PRSs for breast, colon, prostate, and thyroid were significantly associated with their respective cancers. The area under the curve of the model consisting of age, sex, and principal components was 0.621 to 0.710, and it increased by 1% to 4% with the inclusion of PRS in individuals of European genetic ancestry. In individuals of African genetic ancestry, area under the curve was overall higher in the model without the PRS (0.723-0.810) but increased by <1% with the inclusion of PRS for most cancers., Conclusion: PRS moderately increased the ability to discriminate the cancer status in individuals of European but not African ancestry. Further large-scale studies are needed to identify ancestry-specific genetic factors in non-White populations to incorporate PRS into cancer risk assessment., Competing Interests: Conflict of Interest S.M.D. receives research support to the University of Pennsylvania from RenalytixAI and consulting fees from Calico Labs, both outside the current work. S.M.D. is named as a coinventor on a Government-owned US Patent application related to the use of genetic risk prediction for venous thromboembolic disease filed by the US Department of Veterans Affairs in accordance with Federal regulatory requirements. M.D.R. is on the scientific advisory board for Goldfinch Bio and Cipherome. The remaining authors declare no conflicts of interest., (Copyright © 2021 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2022
- Full Text
- View/download PDF
26. Authorship Correction: International Changes in COVID-19 Clinical Trajectories Across 315 Hospitals and 6 Countries: Retrospective Cohort Study.
- Author
-
Weber GM, Zhang HG, L'Yi S, Bonzel CL, Hong C, Avillach P, Gutiérrez-Sacristán A, Palmer NP, Tan ALM, Wang X, Yuan W, Gehlenborg N, Alloni A, Amendola DF, Bellasi A, Bellazzi R, Beraghi M, Bucalo M, Chiovato L, Cho K, Dagliati A, Estiri H, Follett RW, García Barrio N, Hanauer DA, Henderson DW, Ho YL, Holmes JH, Hutch MR, Kavuluru R, Kirchoff K, Klann JG, Krishnamurthy AK, Le TT, Liu M, Loh NHW, Lozano-Zahonero S, Luo Y, Maidlow S, Makoudjou A, Malovini A, Martins MR, Moal B, Morris M, Mowery DL, Murphy SN, Neuraz A, Ngiam KY, Okoshi MP, Omenn GS, Patel LP, Pedrera Jiménez M, Prudente RA, Samayamuthu MJ, Sanz Vidorreta FJ, Schriver ER, Schubert P, Serrano Balazote P, Tan BW, Tanni SE, Tibollo V, Visweswaran S, Wagholikar KB, Xia Z, Zöller D, Kohane IS, Cai T, South AM, and Brat GA
- Abstract
[This corrects the article DOI: 10.2196/31400.]., (©Griffin M Weber, Harrison G Zhang, Sehi L'Yi, Clara-Lea Bonzel, Chuan Hong, Paul Avillach, Alba Gutiérrez-Sacristán, Nathan P Palmer, Amelia Li Min Tan, Xuan Wang, William Yuan, Nils Gehlenborg, Anna Alloni, Danilo F Amendola, Antonio Bellasi, Riccardo Bellazzi, Michele Beraghi, Mauro Bucalo, Luca Chiovato, Kelly Cho, Arianna Dagliati, Hossein Estiri, Robert W Follett, Noelia García Barrio, David A Hanauer, Darren W Henderson, Yuk-Lam Ho, John H Holmes, Meghan R Hutch, Ramakanth Kavuluru, Katie Kirchoff, Jeffrey G Klann, Ashok K Krishnamurthy, Trang T Le, Molei Liu, Ne Hooi Will Loh, Sara Lozano-Zahonero, Yuan Luo, Sarah Maidlow, Adeline Makoudjou, Alberto Malovini, Marcelo Roberto Martins, Bertrand Moal, Michele Morris, Danielle L Mowery, Shawn N Murphy, Antoine Neuraz, Kee Yuan Ngiam, Marina P Okoshi, Gilbert S Omenn, Lav P Patel, Miguel Pedrera Jiménez, Robson A Prudente, Malarkodi Jebathilagam Samayamuthu, Fernando J Sanz Vidorreta, Emily R Schriver, Petra Schubert, Pablo Serrano Balazote, Byorn WL Tan, Suzana E Tanni, Valentina Tibollo, Shyam Visweswaran, Kavishwar B Wagholikar, Zongqi Xia, Daniela Zöller, The Consortium for Clinical Characterization of COVID-19 by EHR (4CE), Isaac S Kohane, Tianxi Cai, Andrew M South, Gabriel A Brat. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 30.11.2021.)
- Published
- 2021
- Full Text
- View/download PDF
27. Multinational characterization of neurological phenotypes in patients hospitalized with COVID-19.
- Author
-
Le TT, Gutiérrez-Sacristán A, Son J, Hong C, South AM, Beaulieu-Jones BK, Loh NHW, Luo Y, Morris M, Ngiam KY, Patel LP, Samayamuthu MJ, Schriver E, Tan ALM, Moore J, Cai T, Omenn GS, Avillach P, Kohane IS, Visweswaran S, Mowery DL, and Xia Z
- Subjects
- Adolescent, Adult, Aged, Aged, 80 and over, Child, Child, Preschool, Cross-Sectional Studies, Female, Humans, Infant, Infant, Newborn, Male, Middle Aged, Prevalence, Severity of Illness Index, Young Adult, COVID-19 complications, COVID-19 epidemiology, Nervous System Diseases epidemiology, Nervous System Diseases etiology, Pandemics
- Abstract
Neurological complications worsen outcomes in COVID-19. To define the prevalence of neurological conditions among hospitalized patients with a positive SARS-CoV-2 reverse transcription polymerase chain reaction test in geographically diverse multinational populations during early pandemic, we used electronic health records (EHR) from 338 participating hospitals across 6 countries and 3 continents (January-September 2020) for a cross-sectional analysis. We assessed the frequency of International Classification of Disease code of neurological conditions by countries, healthcare systems, time before and after admission for COVID-19 and COVID-19 severity. Among 35,177 hospitalized patients with SARS-CoV-2 infection, there was an increase in the proportion with disorders of consciousness (5.8%, 95% confidence interval [CI] 3.7-7.8%, p
FDR < 0.001) and unspecified disorders of the brain (8.1%, 5.7-10.5%, pFDR < 0.001) when compared to the pre-admission proportion. During hospitalization, the relative risk of disorders of consciousness (22%, 19-25%), cerebrovascular diseases (24%, 13-35%), nontraumatic intracranial hemorrhage (34%, 20-50%), encephalitis and/or myelitis (37%, 17-60%) and myopathy (72%, 67-77%) were higher for patients with severe COVID-19 when compared to those who never experienced severe COVID-19. Leveraging a multinational network to capture standardized EHR data, we highlighted the increased prevalence of central and peripheral neurological phenotypes in patients hospitalized with COVID-19, particularly among those with severe disease., (© 2021. The Author(s).)- Published
- 2021
- Full Text
- View/download PDF
28. International Changes in COVID-19 Clinical Trajectories Across 315 Hospitals and 6 Countries: Retrospective Cohort Study.
- Author
-
Weber GM, Zhang HG, L'Yi S, Bonzel CL, Hong C, Avillach P, Gutiérrez-Sacristán A, Palmer NP, Tan ALM, Wang X, Yuan W, Gehlenborg N, Alloni A, Amendola DF, Bellasi A, Bellazzi R, Beraghi M, Bucalo M, Chiovato L, Cho K, Dagliati A, Estiri H, Follett RW, García Barrio N, Hanauer DA, Henderson DW, Ho YL, Holmes JH, Hutch MR, Kavuluru R, Kirchoff K, Klann JG, Krishnamurthy AK, Le TT, Liu M, Loh NHW, Lozano-Zahonero S, Luo Y, Maidlow S, Makoudjou A, Malovini A, Martins MR, Moal B, Morris M, Mowery DL, Murphy SN, Neuraz A, Ngiam KY, Okoshi MP, Omenn GS, Patel LP, Pedrera Jiménez M, Prudente RA, Samayamuthu MJ, Sanz Vidorreta FJ, Schriver ER, Schubert P, Serrano Balazote P, Tan BW, Tanni SE, Tibollo V, Visweswaran S, Wagholikar KB, Xia Z, Zöller D, Kohane IS, Cai T, South AM, and Brat GA
- Subjects
- Adult, Aged, Female, Hospitalization, Hospitals, Humans, Male, Middle Aged, Retrospective Studies, SARS-CoV-2, COVID-19, Pandemics
- Abstract
Background: Many countries have experienced 2 predominant waves of COVID-19-related hospitalizations. Comparing the clinical trajectories of patients hospitalized in separate waves of the pandemic enables further understanding of the evolving epidemiology, pathophysiology, and health care dynamics of the COVID-19 pandemic., Objective: In this retrospective cohort study, we analyzed electronic health record (EHR) data from patients with SARS-CoV-2 infections hospitalized in participating health care systems representing 315 hospitals across 6 countries. We compared hospitalization rates, severe COVID-19 risk, and mean laboratory values between patients hospitalized during the first and second waves of the pandemic., Methods: Using a federated approach, each participating health care system extracted patient-level clinical data on their first and second wave cohorts and submitted aggregated data to the central site. Data quality control steps were adopted at the central site to correct for implausible values and harmonize units. Statistical analyses were performed by computing individual health care system effect sizes and synthesizing these using random effect meta-analyses to account for heterogeneity. We focused the laboratory analysis on C-reactive protein (CRP), ferritin, fibrinogen, procalcitonin, D-dimer, and creatinine based on their reported associations with severe COVID-19., Results: Data were available for 79,613 patients, of which 32,467 were hospitalized in the first wave and 47,146 in the second wave. The prevalence of male patients and patients aged 50 to 69 years decreased significantly between the first and second waves. Patients hospitalized in the second wave had a 9.9% reduction in the risk of severe COVID-19 compared to patients hospitalized in the first wave (95% CI 8.5%-11.3%). Demographic subgroup analyses indicated that patients aged 26 to 49 years and 50 to 69 years; male and female patients; and black patients had significantly lower risk for severe disease in the second wave than in the first wave. At admission, the mean values of CRP were significantly lower in the second wave than in the first wave. On the seventh hospital day, the mean values of CRP, ferritin, fibrinogen, and procalcitonin were significantly lower in the second wave than in the first wave. In general, countries exhibited variable changes in laboratory testing rates from the first to the second wave. At admission, there was a significantly higher testing rate for D-dimer in France, Germany, and Spain., Conclusions: Patients hospitalized in the second wave were at significantly lower risk for severe COVID-19. This corresponded to mean laboratory values in the second wave that were more likely to be in typical physiological ranges on the seventh hospital day compared to the first wave. Our federated approach demonstrated the feasibility and power of harmonizing heterogeneous EHR data from multiple international health care systems to rapidly conduct large-scale studies to characterize how COVID-19 clinical trajectories evolve., (©Griffin M Weber, Harrison G Zhang, Sehi L'Yi, Clara-Lea Bonzel, Chuan Hong, Paul Avillach, Alba Gutiérrez-Sacristán, Nathan P Palmer, Amelia Li Min Tan, Xuan Wang, William Yuan, Nils Gehlenborg, Anna Alloni, Danilo F Amendola, Antonio Bellasi, Riccardo Bellazzi, Michele Beraghi, Mauro Bucalo, Luca Chiovato, Kelly Cho, Arianna Dagliati, Hossein Estiri, Robert W Follett, Noelia García Barrio, David A Hanauer, Darren W Henderson, Yuk-Lam Ho, John H Holmes, Meghan R Hutch, Ramakanth Kavuluru, Katie Kirchoff, Jeffrey G Klann, Ashok K Krishnamurthy, Trang T Le, Molei Liu, Ne Hooi Will Loh, Sara Lozano-Zahonero, Yuan Luo, Sarah Maidlow, Adeline Makoudjou, Alberto Malovini, Marcelo Roberto Martins, Bertrand Moal, Michele Morris, Danielle L Mowery, Shawn N Murphy, Antoine Neuraz, Kee Yuan Ngiam, Marina P Okoshi, Gilbert S Omenn, Lav P Patel, Miguel Pedrera Jiménez, Robson A Prudente, Malarkodi Jebathilagam Samayamuthu, Fernando J Sanz Vidorreta, Emily R Schriver, Petra Schubert, Pablo Serrano Balazote, Byorn WL Tan, Suzana E Tanni, Valentina Tibollo, Shyam Visweswaran, Kavishwar B Wagholikar, Zongqi Xia, Daniela Zöller, The Consortium For Clinical Characterization Of COVID-19 By EHR (4CE), Isaac S Kohane, Tianxi Cai, Andrew M South, Gabriel A Brat. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 11.10.2021.)
- Published
- 2021
- Full Text
- View/download PDF
29. Validation of an internationally derived patient severity phenotype to support COVID-19 analytics from electronic health record data.
- Author
-
Klann JG, Estiri H, Weber GM, Moal B, Avillach P, Hong C, Tan ALM, Beaulieu-Jones BK, Castro V, Maulhardt T, Geva A, Malovini A, South AM, Visweswaran S, Morris M, Samayamuthu MJ, Omenn GS, Ngiam KY, Mandl KD, Boeker M, Olson KL, Mowery DL, Follett RW, Hanauer DA, Bellazzi R, Moore JH, Loh NW, Bell DS, Wagholikar KB, Chiovato L, Tibollo V, Rieg S, Li ALLJ, Jouhet V, Schriver E, Xia Z, Hutch M, Luo Y, Kohane IS, Brat GA, and Murphy SN
- Subjects
- Hospitalization, Humans, Machine Learning, Prognosis, ROC Curve, Sensitivity and Specificity, COVID-19 classification, Electronic Health Records, Severity of Illness Index
- Abstract
Objective: The Consortium for Clinical Characterization of COVID-19 by EHR (4CE) is an international collaboration addressing coronavirus disease 2019 (COVID-19) with federated analyses of electronic health record (EHR) data. We sought to develop and validate a computable phenotype for COVID-19 severity., Materials and Methods: Twelve 4CE sites participated. First, we developed an EHR-based severity phenotype consisting of 6 code classes, and we validated it on patient hospitalization data from the 12 4CE clinical sites against the outcomes of intensive care unit (ICU) admission and/or death. We also piloted an alternative machine learning approach and compared selected predictors of severity with the 4CE phenotype at 1 site., Results: The full 4CE severity phenotype had pooled sensitivity of 0.73 and specificity 0.83 for the combined outcome of ICU admission and/or death. The sensitivity of individual code categories for acuity had high variability-up to 0.65 across sites. At one pilot site, the expert-derived phenotype had mean area under the curve of 0.903 (95% confidence interval, 0.886-0.921), compared with an area under the curve of 0.956 (95% confidence interval, 0.952-0.959) for the machine learning approach. Billing codes were poor proxies of ICU admission, with as low as 49% precision and recall compared with chart review., Discussion: We developed a severity phenotype using 6 code classes that proved resilient to coding variability across international institutions. In contrast, machine learning approaches may overfit hospital-specific orders. Manual chart review revealed discrepancies even in the gold-standard outcomes, possibly owing to heterogeneous pandemic conditions., Conclusions: We developed an EHR-based severity phenotype for COVID-19 in hospitalized patients and validated it at 12 international sites., (© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association.)
- Published
- 2021
- Full Text
- View/download PDF
30. Effect of Targeted Messaging on Return to In-Person Visits During the COVID-19 Pandemic: A Randomized Clinical Trial.
- Author
-
Cappola AR, Schriver ER, Mowery DL, Wollack C, Ives CT, Gonzales R, and Cappella JN
- Subjects
- Adolescent, Adult, Aged, Aged, 80 and over, Correspondence as Topic, Electronic Mail, Female, Humans, Male, Middle Aged, Office Visits, SARS-CoV-2, Telemedicine, Young Adult, Ambulatory Care, Appointments and Schedules, COVID-19, Communication, Pandemics, Patient Acceptance of Health Care
- Published
- 2021
- Full Text
- View/download PDF
31. Association of Neighborhood-Level Factors and COVID-19 Infection Patterns in Philadelphia Using Spatial Regression.
- Author
-
Boland MR, Liu J, Balocchi C, Meeker J, Bai R, Mellis I, Mowery DL, and Herman D
- Subjects
- Humans, Philadelphia epidemiology, Residence Characteristics, SARS-CoV-2, Spatial Regression, United States epidemiology, COVID-19
- Abstract
As of August 2020, there were ~6 million COVID-19 cases in the United States of America, resulting in ~200,000 deaths. Informatics approaches are needed to better understand the role of individual and community risk factors for COVID-19. We developed an informatics method to integrate SARS-CoV-2 data with multiple neighborhood-level factors from the American Community Survey and opendataphilly.org. We assessed the spatial association between neighborhood-level factors and the frequency of SARS-CoV-2 positivity, separately across all patients and across asymptomatic patients. We found that neighborhoods with higher proportions of individuals with a high-school degree and/or who were identified as Hispanic/Latinx were more likely to have higher SARS-CoV-2 positivity rates, after adjusting for other neighborhood covariates. Patients from neighborhoods with higher proportions of individuals receiving public assistance and/or identified as White were less likely to test positive for SARS-CoV-2. Our approach and its findings could inform future public health efforts., (©2021 AMIA - All rights reserved.)
- Published
- 2021
32. Why Is the Electronic Health Record So Challenging for Research and Clinical Care?
- Author
-
Holmes JH, Beinlich J, Boland MR, Bowles KH, Chen Y, Cook TS, Demiris G, Draugelis M, Fluharty L, Gabriel PE, Grundmeier R, Hanson CW, Herman DS, Himes BE, Hubbard RA, Kahn CE Jr, Kim D, Koppel R, Long Q, Mirkovic N, Morris JS, Mowery DL, Ritchie MD, Urbanowicz R, and Moore JH
- Subjects
- Delivery of Health Care, Health Personnel, Humans, Electronic Health Records, Health Information Systems
- Abstract
Background: The electronic health record (EHR) has become increasingly ubiquitous. At the same time, health professionals have been turning to this resource for access to data that is needed for the delivery of health care and for clinical research. There is little doubt that the EHR has made both of these functions easier than earlier days when we relied on paper-based clinical records. Coupled with modern database and data warehouse systems, high-speed networks, and the ability to share clinical data with others are large number of challenges that arguably limit the optimal use of the EHR OBJECTIVES: Our goal was to provide an exhaustive reference for those who use the EHR in clinical and research contexts, but also for health information systems professionals as they design, implement, and maintain EHR systems., Methods: This study includes a panel of 24 biomedical informatics researchers, information technology professionals, and clinicians, all of whom have extensive experience in design, implementation, and maintenance of EHR systems, or in using the EHR as clinicians or researchers. All members of the panel are affiliated with Penn Medicine at the University of Pennsylvania and have experience with a variety of different EHR platforms and systems and how they have evolved over time., Results: Each of the authors has shared their knowledge and experience in using the EHR in a suite of 20 short essays, each representing a specific challenge and classified according to a functional hierarchy of interlocking facets such as usability and usefulness, data quality, standards, governance, data integration, clinical care, and clinical research., Conclusion: We provide here a set of perspectives on the challenges posed by the EHR to clinical and research users., Competing Interests: T.S.C. reports grants from ACRIN, NIH, ACR, and RSNA, as well as royalties from the Osler Institute for lectures in 2013, outside the submitted work. D.S.H. reports grants and nonfinancial support from Roche Diagnostics, outside the submitted work. R.A.H. reports grants from Johnson & Johnson, Merck, and Pfizer, outside the submitted work. Q.L. reports grants from NIH, during the conduct of the study; grants from Pfizer and Bayer, outside the submitted work., (Thieme. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
33. International Comparisons of Harmonized Laboratory Value Trajectories to Predict Severe COVID-19: Leveraging the 4CE Collaborative Across 342 Hospitals and 6 Countries: A Retrospective Cohort Study.
- Author
-
Weber GM, Hong C, Palmer NP, Avillach P, Murphy SN, Gutiérrez-Sacristán A, Xia Z, Serret-Larmande A, Neuraz A, Omenn GS, Visweswaran S, Klann JG, South AM, Loh NHW, Cannataro M, Beaulieu-Jones BK, Bellazzi R, Agapito G, Alessiani M, Aronow BJ, Bell DS, Bellasi A, Benoit V, Beraghi M, Boeker M, Booth J, Bosari S, Bourgeois FT, Brown NW, Bucalo M, Chiovato L, Chiudinelli L, Dagliati A, Devkota B, DuVall SL, Follett RW, Ganslandt T, García Barrio N, Gradinger T, Griffier R, Hanauer DA, Holmes JH, Horki P, Huling KM, Issitt RW, Jouhet V, Keller MS, Kraska D, Liu M, Luo Y, Lynch KE, Malovini A, Mandl KD, Mao C, Maram A, Matheny ME, Maulhardt T, Mazzitelli M, Milano M, Moore JH, Morris JS, Morris M, Mowery DL, Naughton TP, Ngiam KY, Norman JB, Patel LP, Pedrera Jimenez M, Ramoni RB, Schriver ER, Scudeller L, Sebire NJ, Serrano Balazote P, Spiridou A, Tan AL, Tan BW, Tibollo V, Torti C, Trecarichi EM, Vitacca M, Zambelli A, Zucco C, Kohane IS, Cai T, and Brat GA
- Abstract
Objectives: To perform an international comparison of the trajectory of laboratory values among hospitalized patients with COVID-19 who develop severe disease and identify optimal timing of laboratory value collection to predict severity across hospitals and regions., Design: Retrospective cohort study., Setting: The Consortium for Clinical Characterization of COVID-19 by EHR (4CE), an international multi-site data-sharing collaborative of 342 hospitals in the US and in Europe., Participants: Patients hospitalized with COVID-19, admitted before or after PCR-confirmed result for SARS-CoV-2., Primary and Secondary Outcome Measures: Patients were categorized as "ever-severe" or "never-severe" using the validated 4CE severity criteria. Eighteen laboratory tests associated with poor COVID-19-related outcomes were evaluated for predictive accuracy by area under the curve (AUC), compared between the severity categories. Subgroup analysis was performed to validate a subset of laboratory values as predictive of severity against a published algorithm. A subset of laboratory values (CRP, albumin, LDH, neutrophil count, D-dimer, and procalcitonin) was compared between North American and European sites for severity prediction., Results: Of 36,447 patients with COVID-19, 19,953 (43.7%) were categorized as ever-severe. Most patients (78.7%) were 50 years of age or older and male (60.5%). Longitudinal trajectories of CRP, albumin, LDH, neutrophil count, D-dimer, and procalcitonin showed association with disease severity. Significant differences of laboratory values at admission were found between the two groups. With the exception of D-dimer, predictive discrimination of laboratory values did not improve after admission. Sub-group analysis using age, D-dimer, CRP, and lymphocyte count as predictive of severity at admission showed similar discrimination to a published algorithm (AUC=0.88 and 0.91, respectively). Both models deteriorated in predictive accuracy as the disease progressed. On average, no difference in severity prediction was found between North American and European sites., Conclusions: Laboratory test values at admission can be used to predict severity in patients with COVID-19. Prediction models show consistency across international sites highlighting the potential generalizability of these models., Competing Interests: COMPETING INTEREST STATEMENT There are no competing interests to report.
- Published
- 2021
- Full Text
- View/download PDF
34. Multinational Prevalence of Neurological Phenotypes in Patients Hospitalized with COVID-19.
- Author
-
Le TT, Gutiérrez-Sacristán A, Son J, Hong C, South AM, Beaulieu-Jones BK, Loh NHW, Luo Y, Morris M, Ngiam KY, Patel LP, Samayamuthu MJ, Schriver E, Tan AL, Moore J, Cai T, Omenn GS, Avillach P, Kohane IS, Visweswaran S, Mowery DL, and Xia Z
- Abstract
Objective: Neurological complications can worsen outcomes in COVID-19. We defined the prevalence of a wide range of neurological conditions among patients hospitalized with COVID-19 in geographically diverse multinational populations., Methods: Using electronic health record (EHR) data from 348 participating hospitals across 6 countries and 3 continents between January and September 2020, we performed a cross-sectional study of hospitalized adult and pediatric patients with a positive SARS-CoV-2 reverse transcription polymerase chain reaction test, both with and without severe COVID-19. We assessed the frequency of each disease category and 3-character International Classification of Disease (ICD) code of neurological diseases by countries, sites, time before and after admission for COVID-19, and COVID-19 severity., Results: Among the 35,177 hospitalized patients with SARS-CoV-2 infection, there was increased prevalence of disorders of consciousness (5.8%, 95% confidence interval [CI]: 3.7%-7.8%, p
FDR <.001) and unspecified disorders of the brain (8.1%, 95%CI: 5.7%-10.5%, pFDR <.001), compared to pre-admission prevalence. During hospitalization, patients who experienced severe COVID-19 status had 22% (95%CI: 19%-25%) increase in the relative risk (RR) of disorders of consciousness, 24% (95%CI: 13%-35%) increase in other cerebrovascular diseases, 34% (95%CI: 20%-50%) increase in nontraumatic intracranial hemorrhage, 37% (95%CI: 17%-60%) increase in encephalitis and/or myelitis, and 72% (95%CI: 67%-77%) increase in myopathy compared to those who never experienced severe disease., Interpretation: Using an international network and common EHR data elements, we highlight an increase in the prevalence of central and peripheral neurological phenotypes in patients hospitalized with SARS-CoV-2 infection, particularly among those with severe disease.- Published
- 2021
- Full Text
- View/download PDF
35. A Preliminary Characterization of Canonicalized and Non-Canonicalized Section Headers Across Variable Clinical Note Types.
- Author
-
Wang J, Yu S, Davoudi A, and Mowery DL
- Subjects
- Artificial Intelligence, Humans, Medical Records Systems, Computerized, Pilot Projects, Vocabulary, Controlled, Documentation methods, Electronic Health Records standards, Natural Language Processing, Terminology as Topic
- Abstract
In the electronic health record, the majority of clinically relevant information is stored within clinical notes. Most clinical notes follow a set organizational structure composed of canonicalized section headers that facilitate clinical review and information gathering. Standardized section header terminologies such as the SecTag terminology permit the identification and standardization of headers to a canonicalized form. Although the SecTag terminology has been evaluated extensively for history & physical notes, the coverage of canonical section header terms has not been assessed across other note types. For this pilot study, we conducted a coverage study and characterization of canonical section headers across 5 common, clinical note types and a generalizability study of canonical section headers detected within two types of clinical notes from Penn Medicine., (©2020 AMIA - All rights reserved.)
- Published
- 2021
36. Annotation and extraction of age and temporally-related events from clinical histories.
- Author
-
Hong J, Davoudi A, Yu S, and Mowery DL
- Subjects
- Aged, 80 and over, Cohort Studies, Humans, Natural Language Processing
- Abstract
Background: Age and time information stored within the histories of clinical notes can provide valuable insights for assessing a patient's disease risk, understanding disease progression, and studying therapeutic outcomes. However, details of age and temporally-specified clinical events are not well captured, consistently codified, and readily available to research databases for study., Methods: We expanded upon existing annotation schemes to capture additional age and temporal information, conducted an annotation study to validate our expanded schema, and developed a prototypical, rule-based Named Entity Recognizer to extract our novel clinical named entities (NE). The annotation study was conducted on 138 discharge summaries from the pre-annotated 2014 ShARe/CLEF eHealth Challenge corpus. In addition to existing NE classes (TIMEX3, SUBJECT_CLASS, DISEASE_DISORDER), our schema proposes 3 additional NEs (AGE, PROCEDURE, OTHER_EVENTS). We also propose new attributes, e.g., "degree_relation" which captures the degree of biological relation for subjects annotated under SUBJECT_CLASS. As a proof of concept, we applied the schema to 49 H&P notes to encode pertinent history information for a lung cancer cohort study., Results: An abundance of information was captured under the new OTHER_EVENTS, PROCEDURE and AGE classes, with 23%, 10% and 8% of all annotated NEs belonging to the above classes, respectively. We observed high inter-annotator agreement of >80% for AGE and TIMEX3; the automated NLP system achieved F1 scores of 86% (AGE) and 86% (TIMEX3). Age and temporally-specified mentions within past medical, family, surgical, and social histories were common in our lung cancer data set; annotation is ongoing to support this translational research study., Conclusions: Our annotation schema and NLP system can encode historical events from clinical notes to support clinical and translational research studies.
- Published
- 2020
- Full Text
- View/download PDF
37. Patient Interaction Phenotypes With an Automated Remote Hypertension Monitoring Program and Their Association With Blood Pressure Control: Observational Study.
- Author
-
Davoudi A, Lee NS, Chivers C, Delaney T, Asch EL, Reitz C, Mehta SJ, Chaiyachati KH, and Mowery DL
- Subjects
- Adolescent, Adult, Aged, Female, Humans, Male, Middle Aged, Phenotype, Young Adult, Blood Pressure physiology, Hypertension therapy, Monitoring, Physiologic methods, Text Messaging standards
- Abstract
Background: Automated texting platforms have emerged as a tool to facilitate communication between patients and health care providers with variable effects on achieving target blood pressure (BP). Understanding differences in the way patients interact with these communication platforms can inform their use and design for hypertension management., Objective: Our primary aim was to explore the unique phenotypes of patient interactions with an automated text messaging platform for BP monitoring. Our secondary aim was to estimate associations between interaction phenotypes and BP control., Methods: This study was a secondary analysis of data from a randomized controlled trial for adults with poorly controlled hypertension. A total of 201 patients with established primary care were assigned to the automated texting platform; messages exchanged throughout the 4-month program were analyzed. We used the k-means clustering algorithm to characterize two different interaction phenotypes: program conformity and engagement style. First, we identified unique clusters signifying differences in program conformity based on the frequency over time of error alerts, which were generated to patients when they deviated from the requested text message format (eg, ###/## for BP). Second, we explored overall engagement styles, defined by error alerts and responsiveness to text prompts, unprompted messages, and word count averages. Finally, we applied the chi-square test to identify associations between each interaction phenotype and achieving the target BP., Results: We observed 3 categories of program conformity based on their frequency of error alerts: those who immediately and consistently submitted texts without system errors (perfect users, 51/201), those who did so after an initial learning period (adaptive users, 66/201), and those who consistently submitted messages generating errors to the platform (nonadaptive users, 38/201). Next, we observed 3 categories of engagement style: the enthusiast, who tended to submit unprompted messages with high word counts (17/155); the student, who inconsistently engaged (35/155); and the minimalist, who engaged only when prompted (103/155). Of all 6 phenotypes, we observed a statistically significant association between patients demonstrating the minimalist communication style (high adherence, few unprompted messages, limited information sharing) and achieving target BP (P<.001)., Conclusions: We identified unique interaction phenotypes among patients engaging with an automated text message platform for remote BP monitoring. Only the minimalist communication style was associated with achieving target BP. Identifying and understanding interaction phenotypes may be useful for tailoring future automated texting interactions and designing future interventions to achieve better BP control., (©Anahita Davoudi, Natalie S Lee, Corey Chivers, Timothy Delaney, Elizabeth L Asch, Catherine Reitz, Shivan J Mehta, Krisda H Chaiyachati, Danielle L Mowery. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 03.12.2020.)
- Published
- 2020
- Full Text
- View/download PDF
38. Identifying risk factors for suicidal ideation across a large community healthcare system.
- Author
-
Schriver E, Lieblich S, AlRabiah R, Mowery DL, and Brown LA
- Subjects
- Adolescent, Community Health Services, Humans, Risk Factors, Suicide, Attempted, Obsessive-Compulsive Disorder, Suicidal Ideation
- Abstract
Background: Suicide is the tenth leading cause of death in the United States. Several studies have leveraged electronic health record (EHR) data to predict suicide risk in veteran and military samples; however, few studies have investigated suicide risk factors in a large-scale community health population., Methods: Clinical data was queried for 9,811 patients from the Penn Medicine Health System who had completed a Patient Health Questionnaire-9 (PHQ-9) documented in the EHR between January 2017 and June 2019. Patient demographics, PHQ-9 scores, and psychiatric comorbidities were extracted from the EHR. Univariate and multivariable logistic regressions were applied to determine significant risk factors associated with suicide ideation responses from the PHQ-9., Results: One-quarter (25.8%% of patients endorsed suicide ideation. Univariate analysis found 22 risk factors of suicide ideation. Multivariable logistic regression found significant positive associations (Odds Ratio, (95% Confidence Interval)) with the following: younger ages less than 18 years: 2.1, (1.69, 2.60) and 19-24 years: 1.55, (1.29, 1.87)), single marital status (1.22, (1.08, 1.38)), African American (1.22, (1.08, 1.38)), non-commercial insurance (1.16, (1.03, 1.31)), multiple comorbidities (1 comorbidity (1.65, (1.32, 2.07); 2 comorbidities (2.07, (1.61, 2.64)), 3+ comorbidities (2.49, (1.87, 3.33))), bipolar disorders (Type I: 1.38, (1.14, 1.67) and Type II: 1.94, (1.52, 2.49)), depressive disorders (1.70, (1.49, 1.94)), obsessive compulsive disorder (OCD) (1.43, (1.08, 1.90)), and stress disorders (1.53, (1.33, 1.76))., Conclusion: Community EHR information can be used to predict suicidal ideation. This information can be used to design tools for identifying patients at risk for suicide in real-time., (Copyright © 2020 Elsevier B.V. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
39. Risk of Persistent Opioid Use following Major Surgery in Matched Samples of Patients with and without Cancer.
- Author
-
Falcone M, Luo C, Chen Y, Birtwell D, Cheatle M, Duan R, Gabriel PE, He L, Ko EM, Lenz HJ, Mirkovic N, Mowery DL, Ochroch EA, Paulson EC, Schriver E, Schnoll RA, Bekelman JE, and Lerman C
- Subjects
- Analgesics, Opioid pharmacology, Female, Humans, Male, Middle Aged, Analgesics, Opioid therapeutic use, Neoplasms drug therapy, Neoplasms surgery, Surgical Procedures, Operative adverse effects
- Abstract
Background: The opioid crisis has reached epidemic proportions, yet risk of persistent opioid use following curative intent surgery for cancer and factors influencing this risk are not well understood., Methods: We used electronic health record data from 3,901 adult patients who received a prescription for an opioid analgesic related to hysterectomy or large bowel surgery from January 1, 2013, through June 30, 2018. Patients with and without a cancer diagnosis were matched on the basis of demographic, clinical, and procedural variables and compared for persistent opioid use., Results: Cancer diagnosis was associated with greater risk for persistent opioid use after hysterectomy [18.9% vs. 9.6%; adjusted OR (aOR), 2.26; 95% confidence interval (CI), 1.38-3.69; P = 0.001], but not after large bowel surgery (28.3% vs. 24.1%; aOR 1.25; 95% CI, 0.97-1.59; P = 0.09). In the cancer hysterectomy cohort, persistent opioid use was associated with cancer stage (increased rates among those with stage III cancer compared with stage I) and use of neoadjuvant or adjuvant chemotherapy; however, these factors were not associated with persistent opioid use in the large bowel cohort., Conclusions: Patients with cancer may have an increased risk of persistent opioid use following hysterectomy., Impact: Risks and benefits of opioid analgesia for surgical pain among patients with cancer undergoing hysterectomy should be carefully considered., (©2020 American Association for Cancer Research.)
- Published
- 2020
- Full Text
- View/download PDF
40. International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium.
- Author
-
Brat GA, Weber GM, Gehlenborg N, Avillach P, Palmer NP, Chiovato L, Cimino J, Waitman LR, Omenn GS, Malovini A, Moore JH, Beaulieu-Jones BK, Tibollo V, Murphy SN, Yi SL, Keller MS, Bellazzi R, Hanauer DA, Serret-Larmande A, Gutierrez-Sacristan A, Holmes JJ, Bell DS, Mandl KD, Follett RW, Klann JG, Murad DA, Scudeller L, Bucalo M, Kirchoff K, Craig J, Obeid J, Jouhet V, Griffier R, Cossin S, Moal B, Patel LP, Bellasi A, Prokosch HU, Kraska D, Sliz P, Tan ALM, Ngiam KY, Zambelli A, Mowery DL, Schiver E, Devkota B, Bradford RL, Daniar M, Daniel C, Benoit V, Bey R, Paris N, Serre P, Orlova N, Dubiel J, Hilka M, Jannot AS, Breant S, Leblanc J, Griffon N, Burgun A, Bernaux M, Sandrin A, Salamanca E, Cormont S, Ganslandt T, Gradinger T, Champ J, Boeker M, Martel P, Esteve L, Gramfort A, Grisel O, Leprovost D, Moreau T, Varoquaux G, Vie JJ, Wassermann D, Mensch A, Caucheteux C, Haverkamp C, Lemaitre G, Bosari S, Krantz ID, South A, Cai T, and Kohane IS
- Abstract
We leveraged the largely untapped resource of electronic health record data to address critical clinical and epidemiological questions about Coronavirus Disease 2019 (COVID-19). To do this, we formed an international consortium (4CE) of 96 hospitals across five countries (www.covidclinical.net). Contributors utilized the Informatics for Integrating Biology and the Bedside (i2b2) or Observational Medical Outcomes Partnership (OMOP) platforms to map to a common data model. The group focused on temporal changes in key laboratory test values. Harmonized data were analyzed locally and converted to a shared aggregate form for rapid analysis and visualization of regional differences and global commonalities. Data covered 27,584 COVID-19 cases with 187,802 laboratory tests. Case counts and laboratory trajectories were concordant with existing literature. Laboratory tests at the time of diagnosis showed hospital-level differences equivalent to country-level variation across the consortium partners. Despite the limitations of decentralized data generation, we established a framework to capture the trajectory of COVID-19 disease in patients and their response to interventions., Competing Interests: Competing interestsR.B. and A.M. are shareholders of Biomeris s.r.l., (© The Author(s) 2020.)
- Published
- 2020
- Full Text
- View/download PDF
41. Ideas for how informaticians can get involved with COVID-19 research.
- Author
-
Moore JH, Barnett I, Boland MR, Chen Y, Demiris G, Gonzalez-Hernandez G, Herman DS, Himes BE, Hubbard RA, Kim D, Morris JS, Mowery DL, Ritchie MD, Shen L, Urbanowicz R, and Holmes JH
- Abstract
The coronavirus disease 2019 (COVID-19) pandemic has had a significant impact on population health and wellbeing. Biomedical informatics is central to COVID-19 research efforts and for the delivery of healthcare for COVID-19 patients. Critical to this effort is the participation of informaticians who typically work on other basic science or clinical problems. The goal of this editorial is to highlight some examples of COVID-19 research areas that could benefit from informatics expertise. Each research idea summarizes the COVID-19 application area, followed by an informatics methodology, approach, or technology that could make a contribution. It is our hope that this piece will motivate and make it easy for some informaticians to adopt COVID-19 research projects., Competing Interests: Competing interestsThe authors declare that they have no competing interests., (© The Author(s) 2020.)
- Published
- 2020
- Full Text
- View/download PDF
42. A novel tool for standardizing clinical data in a semantically rich model.
- Author
-
Freedman HG, Williams H, Miller MA, Birtwell D, Mowery DL, and Stoeckert CJ Jr
- Abstract
Standardizing clinical information in a semantically rich data model is useful for promoting interoperability and facilitating high quality research. Semantic Web technologies such as Resource Description Framework can be utilized to their full potential when a model accurately reflects the semantics of the clinical situation it describes. To this end, ontologies that abide by sound organizational principles can be used as the building blocks of a semantically rich model for the storage of clinical data. However, it is a challenge to programmatically define such a model and load data from disparate sources. The PennTURBO Semantic Engine is a tool developed at the University of Pennsylvania that transforms concise RDF data into a source-independent, semantically rich model. This system sources classes from an application ontology and specifically defines how instances of those classes may relate to each other. Additionally, the system defines and executes RDF data transformations by launching dynamically generated SPARQL update statements. The Semantic Engine was designed as a generalizable data standardization tool, and is able to work with various data models and incoming data sources. Its human-readable configuration files can easily be shared between institutions, providing the basis for collaboration on a standard data model., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2020 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
43. Carnival: A Graph-Based Data Integration and Query Tool to Support Patient Cohort Generation for Clinical Research.
- Author
-
Birtwell D, Williams H, Pyeritz R, Damrauer S, and Mowery DL
- Subjects
- Case-Control Studies, Cohort Studies, Humans, Data Analysis, Electronic Health Records, Information Storage and Retrieval
- Abstract
Clinical research studies often leverage various heterogeneous data sources including patient electronic health record, online survey, and genomic data. We introduce a graph-based, data integration and query tool called Carnival. We demonstrate its powerful ability to unify data from these disparate data sources to create datasets for two studies: prevalence and incidence case/control matches for coronary artery disease and controls for Marfan syndrome. We conclude with future directions for Carnival development.
- Published
- 2019
- Full Text
- View/download PDF
44. Determining Onset for Familial Breast and Colorectal Cancer from Family History Comments in the Electronic Health Record.
- Author
-
Mowery DL, Kawamoto K, Bradshaw R, Kohlmann W, Schiffman JD, Weir C, Borbolla D, Chapman WW, and Del Fiol G
- Abstract
Background . Family health history (FHH) can be used to identify individuals at elevated risk for familial cancers. Risk criteria for common cancers rely on age of onset, which is documented inconsistently as structured and unstructured data in electronic health records (EHRs). Objective . To investigate a natural language processing (NLP) approach to extract age of onset and age of death from free-text EHR fields. Methods . Using 474,651 FHH entries from 89,814 patients, we investigated two methods - frequent patterns (baseline) and NLP classifier. Results . For age of onset, the NLP classifier outperformed the baseline in precision (96% vs. 83%; 95% CI [94, 97] and [80, 86]) with equivalent recall (both 93%; 95% CI [91, 95]). When applied to the full dataset, the NLP approach increased the percentage of FHH entries for which cancer risk criteria could be applied from 10% to 15%. Conclusion . NLP combined with structured data may improve the computation of familial cancer risk criteria for various use cases.
- Published
- 2019
45. Preparing next-generation scientists for biomedical big data: artificial intelligence approaches.
- Author
-
Moore JH, Boland MR, Camara PG, Chervitz H, Gonzalez G, Himes BE, Kim D, Mowery DL, Ritchie MD, Shen L, Urbanowicz RJ, and Holmes JH
- Subjects
- Big Data, Clinical Decision-Making, Data Mining, Electronic Health Records, Humans, Machine Learning, Natural Language Processing, Data Science methods, Precision Medicine methods
- Abstract
Personalized medicine is being realized by our ability to measure biological and environmental information about patients. Much of these data are being stored in electronic health records yielding big data that presents challenges for its management and analysis. Here, we review several areas of knowledge that are necessary for next-generation scientists to fully realize the potential of biomedical big data. We begin with an overview of big data and its storage and management. We then review statistics and data science as foundational topics followed by a core curriculum of artificial intelligence, machine learning and natural language processing that are needed to develop predictive models for clinical decision making. We end with some specific training recommendations for preparing next-generation scientists for biomedical big data.
- Published
- 2019
- Full Text
- View/download PDF
46. Moonstone: a novel natural language processing system for inferring social risk from clinical narratives.
- Author
-
Conway M, Keyhani S, Christensen L, South BR, Vali M, Walter LC, Mowery DL, Abdelrahman S, and Chapman WW
- Subjects
- Health, Humans, Risk Factors, Natural Language Processing, Social Environment
- Abstract
Background: Social risk factors are important dimensions of health and are linked to access to care, quality of life, health outcomes and life expectancy. However, in the Electronic Health Record, data related to many social risk factors are primarily recorded in free-text clinical notes, rather than as more readily computable structured data, and hence cannot currently be easily incorporated into automated assessments of health. In this paper, we present Moonstone, a new, highly configurable rule-based clinical natural language processing system designed to automatically extract information that requires inferencing from clinical notes. Our initial use case for the tool is focused on the automatic extraction of social risk factor information - in this case, housing situation, living alone, and social support - from clinical notes. Nursing notes, social work notes, emergency room physician notes, primary care notes, hospital admission notes, and discharge summaries, all derived from the Veterans Health Administration, were used for algorithm development and evaluation., Results: An evaluation of Moonstone demonstrated that the system is highly accurate in extracting and classifying the three variables of interest (housing situation, living alone, and social support). The system achieved positive predictive value (i.e. precision) scores ranging from 0.66 (homeless/marginally housed) to 0.98 (lives at home/not homeless), accuracy scores ranging from 0.63 (lives in facility) to 0.95 (lives alone), and sensitivity (i.e. recall) scores ranging from 0.75 (lives in facility) to 0.97 (lives alone)., Conclusions: The Moonstone system is - to the best of our knowledge - the first freely available, open source natural language processing system designed to extract social risk factors from clinical text with good (lives in facility) to excellent (lives alone) performance. Although developed with the social risk factor identification task in mind, Moonstone provides a powerful tool to address a range of clinical natural language processing tasks, especially those tasks that require nuanced linguistic processing in conjunction with inference capabilities.
- Published
- 2019
- Full Text
- View/download PDF
47. Documentation of ENDS Use in the Veterans Affairs Electronic Health Record (2008-2014).
- Author
-
Conway M, Mowery DL, South BR, Stoddard GJ, Chapman WW, Patterson OV, and Zhu SH
- Subjects
- Adolescent, Adult, Aged, Aged, 80 and over, Electronic Nicotine Delivery Systems statistics & numerical data, Female, Humans, Male, Middle Aged, United States, Young Adult, Documentation standards, Electronic Health Records standards, Tobacco Smoking epidemiology, United States Department of Veterans Affairs
- Published
- 2019
- Full Text
- View/download PDF
48. Detecting Evidence of Intra-abdominal Surgical Site Infections from Radiology Reports Using Natural Language Processing.
- Author
-
Chapman AB, Mowery DL, Swords DS, Chapman WW, and Bucher BT
- Subjects
- Abdomen diagnostic imaging, Abdomen surgery, Datasets as Topic, Humans, Radiology Information Systems, Reference Standards, Vocabulary, Controlled, Electronic Health Records, Machine Learning, Natural Language Processing, Radiography, Surgical Wound Infection diagnostic imaging
- Abstract
Free-text reports in electronic health records (EHRs) contain medically significant information - signs, symptoms, findings, diagnoses - recorded by clinicians during patient encounters. These reports contain rich clinical information which can be leveraged for surveillance of disease and occurrence of adverse events. In order to gain meaningful knowledge from these text reports to support surveillance efforts, information must first be converted into a structured, computable format. Traditional methods rely on manual review of charts, which can be costly and inefficient. Natural language processing (NLP) methods offer an efficient, alternative approach to extracting the information and can achieve a similar level of accuracy. We developed an NLP system to automatically identify mentions of surgical site infections in radiology reports and classify reports containing evidence of surgical site infections leveraging these mentions. We evaluated our system using a reference standard of reports annotated by domain experts, administrative data generated for each patient encounter, and a machine learning-based approach.
- Published
- 2018
49. Understanding patient satisfaction with received healthcare services: A natural language processing approach.
- Author
-
Doing-Harris K, Mowery DL, Daniels C, Chapman WW, and Conway M
- Subjects
- Bayes Theorem, Humans, Outcome and Process Assessment, Health Care, Terminology as Topic, Delivery of Health Care, Natural Language Processing, Patient Satisfaction
- Abstract
Important information is encoded in free-text patient comments. We determine the most common topics in patient comments, design automatic topic classifiers, identify comments ' sentiment, and find new topics in negative comments. Our annotation scheme consisted of 28 topics, with positive and negative sentiment. Within those 28 topics, the seven most frequent accounted for 63% of annotations. For automated topic classification, we developed vocabulary-based and Naive Bayes ' classifiers. For sentiment analysis, another Naive Bayes ' classifier was used. Finally, we used topic modeling to search for unexpected topics within negative comments. The seven most common topics were appointment access, appointment wait, empathy, explanation, friendliness, practice environment, and overall experience. The best F-measures from our classifier were 0.52(NB), 0.57(NB), 0.36(Vocab), 0.74(NB), 0.40(NB), and 0.44(Vocab), respectively. F- scores ranged from 0.16 to 0.74. The sentiment classification F-score was 0.84. Negative comment topic modeling revealed complaints about appointment access, appointment wait, and time spent with physician.
- Published
- 2017
50. Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2.
- Author
-
Mowery DL, South BR, Christensen L, Leng J, Peltonen LM, Salanterä S, Suominen H, Martinez D, Velupillai S, Elhadad N, Savova G, Pradhan S, and Chapman WW
- Subjects
- Humans, Biological Ontologies, Natural Language Processing, Telemedicine
- Abstract
Background: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term., Methods: In this study, we evaluate 1) accuracy of participating systems' normalizing short forms compared to a majority sense baseline approach, 2) performance of participants' systems for short forms with variable majority sense distributions, and 3) report the accuracy of participating systems' normalizing shared normalized concepts between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms., Results: The best systems submitted by the five participating teams performed with accuracies ranging from 43 to 72 %. A majority sense baseline approach achieved the second best performance. The performance of participating systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than 80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity (majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these 2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy., Conclusion: Short form normalization continues to be a challenging problem. Short form normalization systems perform with moderate to reasonable accuracies. The Consumer Health Vocabulary could enrich its knowledge base with missed concept unique identifiers from the ShARe test set to further support patient understanding of unfamiliar medical terms.
- Published
- 2016
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.