Back to Search Start Over

Using machine learning methods to predict hepatic encephalopathy in cirrhotic patients with unbalanced data.

Authors :
Yang, Hong
Li, Xinxin
Cao, Hongyan
Cui, Yuehua
Luo, Yanhong
Liu, Jinchun
Zhang, Yanbo
Source :
Computer Methods & Programs in Biomedicine. Nov2021, Vol. 211, pN.PAG-N.PAG. 1p.
Publication Year :
2021

Abstract

• In this paper, we found that the weighted random forest (WRF) model can better predict the incidence of hepatic encephalopathy (HE) in cirrhotic patients. • For unbalanced data, we use the random forest (RF) and support vector machine (SVM) algorithms to construct a risk prediction model for liver cirrhosis complicated by HE to improve the efficiency of its prediction. • This work is the first study to predict cirrhosis in patients with hepatic encephalopathy based on unbalanced data. Hepatic encephalopathy (HE) is among the most common complications of cirrhosis. Data for cirrhosis with HE is typically unbalanced. Traditional statistical methods and machine learning algorithms thus cannot identify a few classes. In this paper, we use machine learning algorithms to construct a risk prediction model for liver cirrhosis complicated by HE to improve the efficiency of its prediction. We collected medical data from 1,256 patients with cirrhosis and performed preprocessing to extract 81 features from these irregular data. To predict HE in cirrhotic patients, we compared several classification methods: logistic regression, weighted random forest (WRF), SVM, and weighted SVM (WSVM). We also used an additional 722 patients with cirrhosis for external validation of the model. The WRF, WSVM, and logistic regression models exhibited better recognition ability for patients with HE than traditional machine learning models (sensitivity> 0.70), but their ability to identify patients with uncomplicated HE was slightly lower (specificity approximately 85%). The comprehensive evaluation index of the traditional model was higher than those of other models (G-means> 0.80 and F-measure> 0.40). For the WRF, the G-means (0.82), F-measure (0.46), and AUC (0.82) were superior to those of the logistic regression and WSVM models, which means that it can better predict the incidence of HE in patients. The WRF model is more suitable for the classification of unbalanced medical data and can be used to construct a risk prediction and evaluation system for liver cirrhosis complicated with HE. The probabilistic prediction models of WRF can help clinicians identify high-risk patients with HE. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01692607
Volume :
211
Database :
Academic Search Index
Journal :
Computer Methods & Programs in Biomedicine
Publication Type :
Academic Journal
Accession number :
153173170
Full Text :
https://doi.org/10.1016/j.cmpb.2021.106420