Back to Search Start Over

Prediction of lung cancer risk in Chinese population with genetic‐environment factor using extreme gradient boosting

Authors :
Yutao Li
Zixiu Zou
Zhunyi Gao
Yi Wang
Man Xiao
Chang Xu
Gengxi Jiang
Haijian Wang
Li Jin
Jiucun Wang
Huai Zhou Wang
Shicheng Guo
Junjie Wu
Source :
Cancer Medicine, Vol 11, Iss 23, Pp 4469-4478 (2022)
Publication Year :
2022
Publisher :
Wiley, 2022.

Abstract

Abstract Background Detecting early‐stage lung cancer is critical to reduce the lung cancer mortality rate; however, existing models based on germline variants perform poorly, and new models are needed. This study aimed to use extreme gradient boosting to develop a predictive model for the early diagnosis of lung cancer in a multicenter case–control study. Materials and Methods A total of 974 cases and 1005 controls in Shanghai and Taizhou were recruited, and 61 single nucleotide polymorphisms (SNPs) were genotyped. Multivariate logistic regression was used to calculate the association between signal SNPs and lung cancer risk. Logistic regression (LR) and extreme gradient boosting (XGBoost) algorithms, a large‐scale machine learning algorithm, were adopted to build the lung cancer risk model. In both models, 10‐fold cross‐validation was performed, and model predictive performance was evaluated by the area under the curve (AUC). Results After FDR adjustment, TYMS rs3819102 and BAG6 rs1077393 were significantly associated with lung cancer risk (p

Details

Language :
English
ISSN :
20457634
Volume :
11
Issue :
23
Database :
Directory of Open Access Journals
Journal :
Cancer Medicine
Publication Type :
Academic Journal
Accession number :
edsdoj.5345a1dea55f48e280c4b3c5e39f5ef9
Document Type :
article
Full Text :
https://doi.org/10.1002/cam4.4800