five

Hyperparameters explored in RFRs.

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Hyperparameters_explored_in_RFRs_/30212542
下载链接
链接失效反馈
官方服务:
资源简介:
Household surveys have been the foundation for poverty measurement in developing countries for the past half-century, but the spatial and temporal gaps in these survey data often limit how well anti-poverty programs can be targeted, monitored, or evaluated. To fill in these gaps, analysts and policymakers increasingly turn to machine learning (ML) methods to predict indices of asset wealth from satellite-based and other geospatial data. However, to date, the potential for gender-related differences in these methods’ performance has not been investigated. We implement a frequently used class of ML models (random forests) relying on readily accessible geospatial data and trained on and validated against a widely used source of asset holdings (a recent round of the Demographic & Health Survey in Ghana). By separately aggregating the asset holdings of female- and male-headed households within each survey cluster, we are able to estimate the distinctions in performance of ML models trained on each of these gender-based asset indices. We find that models trained on data from male-headed households achieve an impressive level of predictive accuracy (R2 = 0.85), while those trained on data from female-headed households achieve reasonable but notably lower accuracy (R2 = 0.75). Roughly half of this gap appears to be driven in large part by the relatively smaller number of female-headed households in the survey sample. While we cannot rule out that the ML models themselves play a role in creating differences in performance across gender, it appears that these gaps may largely be a reflection of the sampling designs of the underlying survey data used as inputs for these models. Our findings confirm that ML models can be used to extend the spatial and temporal scope of these survey data to populations that were not randomly sampled, even while encouraging larger samples of female-headed households in survey designs to improve the predictive accuracy of ML models for female-headed households.
创建时间:
2025-09-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作