Back to Search
Start Over
Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study.
- Source :
-
Nutrition, metabolism, and cardiovascular diseases : NMCD [Nutr Metab Cardiovasc Dis] 2024 Jun; Vol. 34 (6), pp. 1456-1466. Date of Electronic Publication: 2024 Feb 15. - Publication Year :
- 2024
-
Abstract
- Background and Aims: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver disease, which lacks effective drug treatments. This study aimed to construct an eXtreme Gradient Boosting (XGBoost) prediction model to identify or evaluate potential NAFLD patients.<br />Methods and Results: We conducted a longitudinal study of 22,140 individuals from the Beijing Health Management Cohort. Variable filtering was performed using the least absolute shrinkage and selection operator. Random Over Sampling Examples was used to address imbalanced data. Next, the XGBoost model and the other three machine learning (ML) models were built using balanced data. Finally, the variable importance of the XGBoost model was ranked. Among four ML algorithms, we got that the XGBoost model outperformed the other models with the following results: accuracy of 0.835, sensitivity of 0.835, specificity of 0.834, Youden index of 0.669, precision of 0.831, recall of 0.835, F-1 score of 0.833, and an area under the curve of 0.914. The top five variables with the greatest impact on the onset of NAFLD were aspartate aminotransferase, cardiometabolic index, body mass index, alanine aminotransferase, and triglyceride-glucose index.<br />Conclusion: The predictive model based on the XGBoost algorithm enables early prediction of the onset of NAFLD. Additionally, assessing variable importance provides valuable insights into the prevention and treatment of NAFLD.<br />Competing Interests: Declaration of competing interest The authors declared no conflict of interest.<br /> (Copyright © 2024 The Italian Diabetes Society, the Italian Society for the Study of Atherosclerosis, the Italian Society of Human Nutrition and the Department of Clinical Medicine and Surgery, Federico II University. Published by Elsevier B.V. All rights reserved.)
- Subjects :
- Humans
Longitudinal Studies
Male
Female
Middle Aged
Adult
Risk Assessment
Beijing epidemiology
Prognosis
Reproducibility of Results
Decision Support Techniques
Risk Factors
Diagnosis, Computer-Assisted
Non-alcoholic Fatty Liver Disease diagnosis
Non-alcoholic Fatty Liver Disease epidemiology
Non-alcoholic Fatty Liver Disease blood
Machine Learning
Predictive Value of Tests
Biomarkers blood
Subjects
Details
- Language :
- English
- ISSN :
- 1590-3729
- Volume :
- 34
- Issue :
- 6
- Database :
- MEDLINE
- Journal :
- Nutrition, metabolism, and cardiovascular diseases : NMCD
- Publication Type :
- Academic Journal
- Accession number :
- 38508988
- Full Text :
- https://doi.org/10.1016/j.numecd.2024.02.004