World - Synthetic Data for an Imaginary Country, Sample, 2023
收藏WORLD BANK GROUP2024-11-01 更新2026-03-28 收录
下载链接:
https://datacatalog.worldbank.org/search/dataset/0064619
下载链接
链接失效反馈官方服务:
资源简介:
The dataset is a relational dataset of 8,000 households households, representing a sample of the population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country. The full-population dataset (with about 10 million individuals) is also distributed as open data.



