Replication data for "svy: A Python Package for Complex Survey Design and Analysis"
收藏DataCite Commons2026-05-05 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20039376
下载链接
链接失效反馈官方服务:
资源简介:
Replication data for: "svy: A Python Package for Complex Survey Design and Analysis"
This record contains the dataset required to reproduce the results in Diallo, 2026, submitted to the Journal of Statistical Software. The accompanying code is available at https://github.com/MamadouSDiallo/svy-jss-replication.
## Contents
hld_pop_wb_2023.parquet (91.9 MB): A synthetic household-level dataset of 2,501,755 records representing the full population of households in an imaginary middle-income country, containing variables typical of population censuses and household surveys (dwelling characteristics, water and sanitation, asset ownership, household expenditure, and household composition), generated using REaLTabFormer for training and simulation purposes.
## Usage
The replication script replication.py in the code repository expects these files to be placed in a data/ directory at the project root.
## Source and processing
This dataset is derived from synthetic survey data published by the World Bank Development Data Group:
World Bank (2023a). Synthetic Data for an Imaginary Country, Full Population, 2023. https://doi.org/10.48529/78M1-AE09
Note: The underlying datasets are entirely synthetic and are not intended for real-world applications. They are provided solely for educational and methodological purposes. The concepts, definitions, targets, and other specifications used in this work do not necessarily reflect those of the World Bank, the World Health Organization (WHO), UNICEF, or any other referenced institutions.
## License
Released under [CC-BY 4.0]. When using this dataset, please cite both the paper and this Zenodo record.
提供机构:
Zenodo
创建时间:
2026-05-05



