1. Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
- Author
-
Jia-Xin Li, Li Li, Xuemei Zhong, Shu-Jun Fan, Tao Cen, Jianquan Wang, Chuanjiang He, Zhoubin Zhang, Ya-Na Luo, Xiao-Xuan Liu, Li-Xin Hu, Yi-Dan Zhang, Hui-Ling Qiu, Guang-Hui Dong, Xiao-Guang Zou, and Bo-Yi Yang
- Subjects
Cardiovascular disease (CVD) ,Prediction ,Prominent factors ,Machine learning ,Kashgar prefecture ,Public aspects of medicine ,RA1-1270 - Abstract
Abstract Background Identifying factors associated with cardiovascular disease (CVD) is critical for its prevention, but this topic is scarcely investigated in Kashgar prefecture, Xinjiang, northwestern China. We thus explored the CVD epidemiology and identified prominent factors associated with CVD in this region. Methods A total of 1,887,710 adults at baseline (in 2017) of the Kashgar Prospective Cohort Study were included in the analysis. Sixteen candidate factors, including seven demographic factors, 4 lifestyle factors, and 5 clinical factors, were collected from a questionnaire and health examination records. CVD was defined according to International Clinical Diagnosis (ICD-10) codes. We first used logistic regression models to investigate the association between each of the candidate factors and CVD. Then, we employed 3 machine learning methods—Random Forest, Random Ferns, and Extreme Gradient Boosting—to rank and identify prominent factors associated with CVD. Stratification analyses by sex, ethnicity, education level, economic status, and residential setting were also performed to test the consistency of the ranking. Results The prevalence of CVD in Kashgar prefecture was 8.1%. All the 16 candidate factors were confirmed to be significantly associated with CVD (odds ratios ranged from 1.03 to 2.99, all p values
- Published
- 2022
- Full Text
- View/download PDF