Performance of predictive models on test set.

Figshare2025-02-24 更新2026-04-28 收录

下载链接：

https://figshare.com/articles/dataset/Performance_of_predictive_models_on_test_set_/28474841

下载链接

链接失效反馈

官方服务：

资源简介：

ObjectiveThis study aimed to develop and compare machine learning models for predicting diabetic retinopathy (DR) using clinical and biochemical data, specifically logistic regression, random forest, XGBoost, and neural networks.MethodsA dataset of 3,000 diabetic patients, including 1,500 with DR, was obtained from the National Population Health Science Data Center. Significant predictors were identified, and four predictive models were developed. Model performance was assessed using accuracy, precision, recall, F1-score, and area under the curve (AUC).ResultsRandom forest and XGBoost demonstrated superior performance, achieving accuracies of 95.67% and 94.67%, respectively, with AUC values of 0.991 and 0.989. Logistic regression yielded an accuracy of 76.50% (AUC: 0.828), while neural networks achieved 82.67% accuracy (AUC: 0.927). Key predictors included 24-hour urinary microalbumin, HbA1c, and serum creatinine.ConclusionThe study highlights random forest and XGBoost as effective tools for early DR detection, emphasizing the importance of renal and glycemic markers in risk assessment. These findings support the integration of machine learning models into clinical decision-making for improved patient outcomes in diabetes management.

创建时间：

2025-02-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集