World - Synthetic Data for an Imaginary Country, Full Population, 2023
收藏WORLD BANK GROUP2023-07-02 更新2026-03-28 收录
下载链接:
https://datacatalog.worldbank.org/search/dataset/0064618
下载链接
链接失效反馈官方服务:
资源简介:
The dataset is a relational dataset of 10,003,891 individuals (2,501,755 households), representing the entire population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country. A sample dataset of 8000 households was created out of this full-population dataset, and is also distributed as open data.



