Social Environment Characteristics of Bogota, Colombia, 2005 & 2018
收藏DataCite Commons2025-05-12 更新2025-04-15 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/LGHZE5
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is part of the ESCALA (Study of Urban Health and Climate Change in Informal Settlements in Latin America) project that was funded by the Lacuna Fund of the Meridian Institute https://lacunafund.org/.
This dataset contains sociodemographic data by city block from census data for Bogota, Colombia in 2005 and 2018 from DANE: National Administrative Department of Statistics (https://geoportal.dane.gov.co/) from the national population and demographic censuses. These data include proportion of individuals by sex, age, educational level, employment status from individual data, proportion of households in poverty or inadequate housing, and proportion of households with utility connections and dwelling quality within a city block. Data cleaning included: (1) Census data were provided at the level of persons, households, dwellings and spatial data (city blocks). To relate non-spatial and spatial data, city block codes (22 characters) were generated by concatenating the department code (2 characters), municipality (3 characters), class (1 character), rural sector (3 characters),rural section (2 characters), population center (3 characters), urban sector (4 characters), urban section (2 characters) and city block (2 characters).These codes were linked to the persons database. (2) The 2005 and 2018 census had some records with missing information on water and sewer connection which were filled with the category "Not reported". Regarding the wall material variable, the 2005 census did not report this information, so for that year this variable was filled in its entirety by the category “Not reported”. That same variable had some missing records in the 2018 census, which were managed in the same way. (3) The 2005 and 2018 census data were merged into one dataset with the following attributes: city block code, census year, water connection, sewer connection and wall durability categories. Poverty and inadequate housing datasets were merged using the city block ID, and only the attributes of interest were kept.The 2005 and 2018 educational level and employment status census data had two additional categories with no clear definition in the census documentation ("Not applicable" and "Not reported"). Those categories were merged into the "Not reported" category. The 2005 and 2018 census data were merged into one dataset with the following attributes: city block code, census year, sex, educational level, and employment status, combining the multiple categories of socioeconomic variables.
本数据集隶属于ESCALA项目(拉丁美洲非正式定居点城市健康与气候变化研究,Study of Urban Health and Climate Change in Informal Settlements in Latin America),该项目由子午线研究所(Meridian Institute)的拉库纳基金(Lacuna Fund)资助,资助方官网为https://lacunafund.org/。
本数据集包含哥伦比亚波哥大2005年与2018年的街区级社会人口统计学数据,数据源自哥伦比亚国家行政统计部门(DANE: National Administrative Department of Statistics,https://geoportal.dane.gov.co/)发布的全国人口与人口普查数据。
此类数据涵盖:基于个体层面数据统计的按性别、年龄、受教育程度、就业状态划分的个体占比,处于贫困或住房条件不佳状态的家庭占比,以及街区内拥有公用设施接入与住房质量情况的家庭占比。
数据清理流程如下:
1. 原始普查数据分别涵盖个体、家庭、住房及空间数据(街区层面)。为实现非空间数据与空间数据的关联,通过拼接以下字段生成22位字符的街区编码:部门代码(2位)、市政代码(3位)、类别(1位)、农村区域代码(3位)、农村片区代码(2位)、人口中心代码(3位)、城市区域代码(4位)、城市片区代码(2位)以及街区代码(2位),并将该编码与个体数据库进行关联。
2. 2005年与2018年的普查数据中,部分记录存在供水与排污接入信息缺失的情况,此类缺失值均填充为"未报告"类别。针对墙体材料变量,2005年普查未提供该信息,因此当年该变量全部填充为"未报告"类别;2018年普查中该变量亦存在部分缺失记录,采用相同方式处理。
3. 将2005年与2018年的普查数据合并为单一数据集,包含以下属性:街区编码、普查年份、供水接入情况、排污接入情况与墙体耐用性类别。通过街区ID关联贫困与住房条件不佳数据集,仅保留目标分析属性。
2005年与2018年的受教育程度及就业状态普查数据中,存在两个在普查文档中未明确定义的额外类别("不适用"与"未报告"),现将这两个类别合并至"未报告"类别中。随后将2005年与2018年的普查数据合并为单一数据集,包含以下属性:街区编码、普查年份、性别、受教育程度与就业状态,整合了社会经济变量的多个类别。
提供机构:
Harvard Dataverse
创建时间:
2025-01-18



