Bin Zhang, Shengsheng Huang, Chenxing Zhou, Jichong Zhu, Tianyou Chen, Sitan Feng, Chengqian Huang, Zequn Wang, Shaofeng Wu, Chong Liu, and Xinli Zhan
Background Machine learning (ML), a subset of artificial intelligence (AI), uses algorithms to analyze data and predict outcomes without extensive human intervention. In healthcare, ML is gaining attention for enhancing patient outcomes. This study focuses on predicting additional hospital days (AHD) for patients with cervical spondylosis (CS), a condition affecting the cervical spine. The research aims to develop an ML-based nomogram model analyzing clinical and demographic factors to estimate hospital length of stay (LOS). Accurate AHD predictions enable efficient resource allocation, improved patient care, and potential cost reduction in healthcare.Methods The study selected CS patients undergoing cervical spine surgery and investigated their medical data. A total of 945 patients were recruited, with 570 males and 375 females. The mean number of LOS calculated for the total sample was 8.64 ± 3.7 days. A LOS equal to or 8.64 days comprised the AHD-positive group (n = 406). The collected data was randomly divided into training and validation cohorts using a 7:3 ratio. The parameters included their general conditions, chronic diseases, preoperative clinical scores, and preoperative radiographic data including ossification of the anterior longitudinal ligament (OALL), ossification of the posterior longitudinal ligament (OPLL), cervical instability and magnetic resonance imaging T2-weighted imaging high signal (MRI T2WIHS), operative indicators and complications. ML-based models like Lasso regression, random forest (RF), and support vector machine (SVM) recursive feature elimination (SVM-RFE) were developed for predicting AHD-related risk factors. The intersections of the variables screened by the aforementioned algorithms were utilized to construct a nomogram model for predicting AHD in patients. The area under the curve (AUC) of the receiver operating characteristic (ROC) curve and C-index were used to evaluate the performance of the nomogram. Calibration curve and decision curve analysis (DCA) were performed to test the calibration performance and clinical utility.Results For these participants, 25 statistically significant parameters were identified as risk factors for AHD. Among these, nine factors were obtained as the intersection factors of these three ML algorithms and were used to develop a nomogram model. These factors were gender, age, body mass index (BMI), American Spinal Injury Association (ASIA) scores, magnetic resonance imaging T2-weighted imaging high signal (MRI T2WIHS), operated segment, intraoperative bleeding volume, the volume of drainage, and diabetes. After model validation, the AUC was 0.753 in the training cohort and 0.777 in the validation cohort. The calibration curve exhibited a satisfactory agreement between the nomogram predictions and actual probabilities. The C-index was 0.788 (95% confidence interval: 0.73214–0.84386). On the decision curve analysis (DCA), the threshold probability of the nomogram ranged from 1 to 99% (training cohort) and 1 to 75% (validation cohort).Conclusion We successfully developed an ML model for predicting AHD in patients undergoing cervical spine surgery, showcasing its potential to support clinicians in AHD identification and enhance perioperative treatment strategies.