Using machine learning methods to predict hepatic encephalopathy in cirrhotic patients with unbalanced data. (November 2021)
- Record Type:
- Journal Article
- Title:
- Using machine learning methods to predict hepatic encephalopathy in cirrhotic patients with unbalanced data. (November 2021)
- Main Title:
- Using machine learning methods to predict hepatic encephalopathy in cirrhotic patients with unbalanced data
- Authors:
- Yang, Hong
Li, Xinxin
Cao, Hongyan
Cui, Yuehua
Luo, Yanhong
Liu, Jinchun
Zhang, Yanbo - Abstract:
- Highlights: In this paper, we found that the weighted random forest (WRF) model can better predict the incidence of hepatic encephalopathy (HE) in cirrhotic patients. For unbalanced data, we use the random forest (RF) and support vector machine (SVM) algorithms to construct a risk prediction model for liver cirrhosis complicated by HE to improve the efficiency of its prediction. This work is the first study to predict cirrhosis in patients with hepatic encephalopathy based on unbalanced data. Abstract: Objective: Hepatic encephalopathy (HE) is among the most common complications of cirrhosis. Data for cirrhosis with HE is typically unbalanced. Traditional statistical methods and machine learning algorithms thus cannot identify a few classes. In this paper, we use machine learning algorithms to construct a risk prediction model for liver cirrhosis complicated by HE to improve the efficiency of its prediction. Method: We collected medical data from 1, 256 patients with cirrhosis and performed preprocessing to extract 81 features from these irregular data. To predict HE in cirrhotic patients, we compared several classification methods: logistic regression, weighted random forest (WRF), SVM, and weighted SVM (WSVM). We also used an additional 722 patients with cirrhosis for external validation of the model. Results: The WRF, WSVM, and logistic regression models exhibited better recognition ability for patients with HE than traditional machine learning models (sensitivity> 0.70),Highlights: In this paper, we found that the weighted random forest (WRF) model can better predict the incidence of hepatic encephalopathy (HE) in cirrhotic patients. For unbalanced data, we use the random forest (RF) and support vector machine (SVM) algorithms to construct a risk prediction model for liver cirrhosis complicated by HE to improve the efficiency of its prediction. This work is the first study to predict cirrhosis in patients with hepatic encephalopathy based on unbalanced data. Abstract: Objective: Hepatic encephalopathy (HE) is among the most common complications of cirrhosis. Data for cirrhosis with HE is typically unbalanced. Traditional statistical methods and machine learning algorithms thus cannot identify a few classes. In this paper, we use machine learning algorithms to construct a risk prediction model for liver cirrhosis complicated by HE to improve the efficiency of its prediction. Method: We collected medical data from 1, 256 patients with cirrhosis and performed preprocessing to extract 81 features from these irregular data. To predict HE in cirrhotic patients, we compared several classification methods: logistic regression, weighted random forest (WRF), SVM, and weighted SVM (WSVM). We also used an additional 722 patients with cirrhosis for external validation of the model. Results: The WRF, WSVM, and logistic regression models exhibited better recognition ability for patients with HE than traditional machine learning models (sensitivity> 0.70), but their ability to identify patients with uncomplicated HE was slightly lower (specificity approximately 85%). The comprehensive evaluation index of the traditional model was higher than those of other models (G-means> 0.80 and F-measure> 0.40). For the WRF, the G-means (0.82), F-measure (0.46), and AUC (0.82) were superior to those of the logistic regression and WSVM models, which means that it can better predict the incidence of HE in patients. Conclusion: The WRF model is more suitable for the classification of unbalanced medical data and can be used to construct a risk prediction and evaluation system for liver cirrhosis complicated with HE. The probabilistic prediction models of WRF can help clinicians identify high-risk patients with HE. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 211(2021)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 211(2021)
- Issue Display:
- Volume 211, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 211
- Issue:
- 2021
- Issue Sort Value:
- 2021-0211-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-11
- Subjects:
- Cost sensitivity -- Hepatic encephalopathy -- Disease risk prediction -- Weighted random forest -- Weighted support vector machine
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2021.106420 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20051.xml