five

RF model key performance metrics.

收藏
Figshare2026-03-30 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/_p_RF_model_key_performance_metrics_p_/31893336
下载链接
链接失效反馈
官方服务:
资源简介:
BackgroundNeglected Tropical Diseases (NTDs) affect 1.5 billion people worldwide with 39% of the global burden occurring in Africa. In Kenya, NTDs remain endemic despite control efforts, with co-endemicity of soil-transmitted helminths (STH), schistosomiasis (SCH), and lymphatic filariasis (LF) complicating intervention strategies. This study developed machine learning models to predict high-risk co-endemic areas using demographic and Water, Sanitation, and Hygiene (WASH) indicators.MethodologyThe study analyzed Kenya’s 2022 NTD co-endemicity data from the Expanded Special Project for Elimination of Neglected Tropical Diseases, incorporating WASH and population variables. Three machine learning algorithms, Random Forest, Gradient Boosting Machine, and Extreme Gradient Boosting (XGBoost) were trained to classify regions by STH prevalence levels and co-endemicity status. Model performance was evaluated using cross-validation, Receiver Operating Characteristic – Area under the Curve (AUC) and variable importance analysis.ResultsThe RF model achieved the highest predictive performance (AUC = 0.70), followed by XGBoost (AUC = 0.66) and GBM (AUC = 0.62). Key predictors included improved sanitation access (mean importance score: 0.24), population density (0.21), and co-endemicity with LF/SCH (0.18). Spatial analysis identified Eastern and North-Eastern Kenya as persistent hotspots, correlating with low WASH coverage (ConclusionMachine learning models effectively identified the high-risk NTD co-endemic areas in Kenya, with RF outperforming other models. These findings support targeted interventions integrating WASH improvements with mass drug administration in identified hotspots. We propose a real-time dashboard for dynamic risk mapping to optimize resource allocation; a strategy aligned with Kenya’s NTD Elimination Strategic Plan 2030.
创建时间:
2026-03-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作