1. Development of a risk prediction model for radiation dermatitis following proton radiotherapy in head and neck cancer using ensemble machine learning
- Author
-
Tsair-Fwu Lee, Yen-Hsien Liu, Chu-Ho Chang, Chien-Liang Chiu, Chih-Hsueh Lin, Jen-Chung Shao, Yu-Cheng Yen, Guang-Zhi Lin, Jack Yang, Chin-Dar Tseng, Fu-Min Fang, Pei-Ju Chao, and Shen-Hao Lee
- Subjects
Radiation dermatitis ,Proton radiotherapy ,Head and neck cancer ,Ensemble machine learning ,Predictive modeling ,Feature selection ,Medical physics. Medical radiology. Nuclear medicine ,R895-920 ,Neoplasms. Tumors. Oncology. Including cancer and carcinogens ,RC254-282 - Abstract
Abstract Purpose This study aims to develop an ensemble machine learning-based (EML-based) risk prediction model for radiation dermatitis (RD) in patients with head and neck cancer undergoing proton radiotherapy, with the goal of achieving superior predictive performance compared to traditional models. Materials and methods Data from 57 head and neck cancer patients treated with intensity-modulated proton therapy at Kaohsiung Chang Gung Memorial Hospital were analyzed. The study incorporated 11 clinical and 9 dosimetric parameters. Pearson’s correlation was used to eliminate highly correlated variables, followed by feature selection via LASSO to focus on potential RD predictors. Model training involved traditional logistic regression (LR) and advanced ensemble methods such as Random Forest and XGBoost, which were optimized through hyperparameter tuning. Results Feature selection identified six key predictors, including smoking history and specific dosimetric parameters. Ensemble machine learning models, particularly XGBoost, demonstrated superior performance, achieving the highest AUC of 0.890. Feature importance was assessed using SHAP (SHapley Additive exPlanations) values, which underscored the relevance of various clinical and dosimetric factors in predicting RD. Conclusion The study confirms that EML methods, especially XGBoost with its boosting algorithm, provide superior predictive accuracy, enhanced feature selection, and improved data handling compared to traditional LR. While LR offers greater interpretability, the precision and broader applicability of EML make it more suitable for complex medical prediction tasks, such as predicting radiation dermatitis. Given these advantages, EML is highly recommended for further research and application in clinical settings.
- Published
- 2024
- Full Text
- View/download PDF