1. Individualized decision making in on-scene resuscitation time for out-of-hospital cardiac arrest using reinforcement learning.
- Author
-
Choi, Dong Hyun, Lim, Min Hyuk, Hong, Ki Jeong, Kim, Young Gyun, Park, Jeong Ho, Song, Kyoung Jun, Do Shin, Sang, and Kim, Sungwan
- Subjects
REINFORCEMENT (Psychology) ,RESEARCH funding ,STATISTICAL sampling ,KRUSKAL-Wallis Test ,LOGISTIC regression analysis ,DECISION making ,LEARNING ,EMERGENCY medicine ,RETROSPECTIVE studies ,CHI-squared test ,MANN Whitney U Test ,LONGITUDINAL method ,REWARD (Psychology) ,MEDICAL records ,ACQUISITION of data ,CARDIAC arrest ,CARDIOPULMONARY resuscitation ,SURVIVAL analysis (Biometry) ,DATA analysis software - Abstract
On-scene resuscitation time is associated with out-of-hospital cardiac arrest (OHCA) outcomes. We developed and validated reinforcement learning models for individualized on-scene resuscitation times, leveraging nationwide Korean data. Adult OHCA patients with a medical cause of arrest were included (N = 73,905). The optimal policy was derived from conservative Q-learning to maximize survival. The on-scene return of spontaneous circulation hazard rates estimated from the Random Survival Forest were used as intermediate rewards to handle sparse rewards, while patients' historical survival was reflected in the terminal rewards. The optimal policy increased the survival to hospital discharge rate from 9.6% to 12.5% (95% CI: 12.2–12.8) and the good neurological recovery rate from 5.4% to 7.5% (95% CI: 7.3–7.7). The recommended maximum on-scene resuscitation times for patients demonstrated a bimodal distribution, varying with patient, emergency medical services, and OHCA characteristics. Our survival analysis-based approach generates explainable rewards, reducing subjectivity in reinforcement learning. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF