Jiao Wang, Meng-Yang Wang, Hui Wang, Hong-Wei Liu, Rui Lu, Tong-Qing Duan, Chang-Ping Li, Zhuang Cui, Yuan-Yuan Liu, Yuan-Jun Lyu, Jun Ma, and Yuan-Yuan Ji
Background: Blood glucose control is closely related to type 2 diabetes mellitus (T2DM) prognosis. This multicenter study aimed to investigate blood glucose control among patients with insulin-treated T2DM in North China and explore the application value of combining an elastic network (EN) with a machine-learning algorithm to predict glycemic control. Methods: Basic information, biochemical indices, and diabetes-related data were collected via questionnaire from 2787 consecutive participants recruited from 27 centers in six cities between January 2016 and December 2017. An EN regression was used to address variable collinearity. Then, three common machine learning algorithms (random forest [RF], support vector machine [SVM], and back propagation artificial neural network [BP-ANN]) were used to simulate and predict blood glucose status. Additionally, a stepwise logistic regression was performed to compare the machine learning models. Results: The well-controlled blood glucose rate was 45.82% in North China. The multivariable analysis found that hypertension history, atherosclerotic cardiovascular disease history, exercise, and total cholesterol were protective factors in glycosylated hemoglobin (HbA1c) control, while central adiposity, family history, T2DM duration, complications, insulin dose, blood pressure, and hypertension were risk factors for elevated HbA1c. Before the dimensional reduction in the EN, the areas under the curve of RF, SVM, and BP were 0.73, 0.61, and 0.70, respectively, while these figures increased to 0.75, 0.72, and 0.72, respectively, after dimensional reduction. Moreover, the EN and machine learning models had higher sensitivity and accuracy than the logistic regression models (the sensitivity and accuracy of logistic were 0.52 and 0.56; RF: 0.79, 0.70; SVM: 0.84, 0.73; BP-ANN: 0.78, 0.73, respectively). Conclusions: More than half of T2DM patients in North China had poor glycemic control and were at a higher risk of developing diabetic complications. The EN and machine learning algorithms are alternative choices, in addition to the traditional logistic model, for building predictive models of blood glucose control in patients with T2DM. Key words: Type 2 diabetes; Blood glucose; HbA1c; Elastic network; Machine learning more...