five

Performance of models by different algorithms.

收藏
Figshare2025-11-12 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Performance_of_models_by_different_algorithms_/30602687
下载链接
链接失效反馈
官方服务:
资源简介:
ObjectiveWith the global increase in obesity rates and lifestyle changes, metabolic dysfunction-associated steatotic liver disease (MASLD) has become a prevalent chronic liver disorder, affecting approximately 25% of the global population. This disease can progress to cirrhosis and liver cancer, posing a significant threat to public health. To facilitate early diagnosis and intervention, this study aims to develop an efficient and reliable prediction model for MASLD using machine learning algorithm.MethodsThis study included 9,232 participants aged 20 years and older from the 2017–2020 National Health and Nutrition Examination Survey (NHANES). After excluding individuals with frequent alcohol consumption, hepatitis B/C infection, those lacking liver ultrasound examinations, and samples with missing data, a total of 2,460 subjects were ultimately included. The dataset was split into training and testing sets in an 80:20 ratio. Five machine learning algorithms—XGBoost, Random Forest (RF), and Logistic Regression (LR), among others—were utilized to build prediction models, while Recursive Feature Elimination (RFE) was employed to identify key predictive factors.ResultsComparison of the five algorithms revealed that the XGBoost algorithm performed the best. Twelve key features were selected through Recursive Feature Elimination (RFE), and the model achieved an AUC of 0.8740 on the testing set, demonstrating excellent predictive accuracy and discriminative ability. SHAP plot analysis of the model showed that waist circumference, BMI, and other factors played a pivotal role in the prediction of MASLD.ConclusionThe prediction model developed using the XGBoost algorithm and the 12 selected features demonstrates high efficiency and stability in assessing MASLD risk. This model offers innovative technical solutions and data-driven support for the clinical early identification of high-risk populations, with the potential to optimize and refine MASLD prevention and control strategies.
创建时间:
2025-11-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作