Domestic Electrical Load Survey Secure Data 1994-2014 - South Africa
收藏www.datafirst.uct.ac.za2019-06-20 更新2025-01-09 收录
下载链接:
https://www.datafirst.uct.ac.za/dataportal/index.php/catalog/757
下载链接
链接失效反馈官方服务:
资源简介:
Abstract
---------------------------
This dataset contains sensitive data that has not been disclosed in the online version of the Domestic Electrical Load Survey (DELS) 1994-2014 dataset. In contrast to the DELS dataset, the DELS Secure Data contains partially anonymised survey responses with only the names of respondents and home owners removed. The DELSS contains street and postal addresses, as well as GPS level location data for households from 2000 onwards. The GPS data is obtained through an auxiliary dataset, the Site Reference database. Like the DELS, the DELSS dataset has been retrieved and anonymised from the original SQL database with the python package delretrieve.
Geographic coverage
---------------------------
The study had national coverage.
Analysis unit
---------------------------
Households and individuals
Universe
---------------------------
The survey covers electrified households that received electricity either directly from Eskom or from their local municipality. Particular attention was devoted to rural and low income households, as well as surveying households electrified over a range of years, thus having had access to electricity from recent times to several decades.
Kind of data
---------------------------
Sample survey data
Sampling procedure
---------------------------
See sampling procedure for DELS 1994-2014
Mode of data collection
---------------------------
Face-to-face [f2f]
Cleaning operations
---------------------------
This dataset has been produced by extracting only the survey responses from the original NRS Load Research SQL database using the saveAnswers function from the delretrieve python package (https://github.com/wiebket/delretrieve: release v1.0). Full instructions on how to use delretrieve to extract data are in the README file contained in the package.
PARTIAL DE-IDENTIFICATION
Partial de-identification was done in the process of extracting the data from the SQL database with the delretrieve package. Only the names of respondents and home owners have been removed from the survey responses by replacing responses with an 'a' in the dataset. Documents with full details of the variables that have been anonymised are included as external resources.
MISSING VALUES
Other than partial de-identification no post-processing was done and all database records, including missing values, are stored exactly as retrieved.
Data appraisal
---------------------------
See notes on data quality for DELS 1994-2014
摘要
---------------------------
本数据集包含未在1994-2014年国内电力负荷调查(DELS)在线版本中公开的敏感数据。与DELS数据集相比,DELS安全数据集包含部分匿名化的调查响应,仅移除了受访者和房主的名字。自2000年起,DELS安全数据集还包含了街道和邮政地址,以及家庭的位置数据,这些数据达到了GPS级别的精确度。GPS数据是通过辅助数据集——场地参考数据库获得的。与DELS类似,DELS安全数据集也是通过python包delretrieve从原始SQL数据库中检索并匿名化的。
地理覆盖范围
---------------------------
研究具有全国性覆盖。
分析单元
---------------------------
家庭和个人
总体
---------------------------
调查涵盖了直接从南非电力公司(Eskom)或当地市政府获得电力的通电家庭。特别关注农村和低收入家庭,以及对在不同年份通电的家庭进行调查,这些家庭从近期到几十年前都有电力供应的历史。
数据类型
---------------------------
样本调查数据
抽样程序
---------------------------
参见DELS 1994-2014的抽样程序
数据收集方式
---------------------------
面对面访谈 [f2f]
清洗操作
---------------------------
本数据集是通过使用delretrieve python包中的saveAnswers函数,仅从原始NRS负荷研究SQL数据库中提取调查响应而制作的(https://github.com/wiebket/delretrieve: 版本 v1.0)。如何使用delretrieve提取数据的完整说明包含在包中的README文件中。
部分去标识化
在从SQL数据库中提取数据的过程中进行了部分去标识化。通过在数据集中用'a'替换响应,仅从调查响应中移除了受访者和房主的名字。包含已匿名化变量完整详情的文件作为外部资源提供。
缺失值
除了部分去标识化外,没有进行任何后处理,数据库中的所有记录,包括缺失值,都按照检索到的原始状态存储。
数据评估
---------------------------
参见DELS 1994-2014的数据质量说明
提供机构:
www.datafirst.uct.ac.za



