STEP Skills Measurement Household Survey 2012 (Wave 1) - Colombia
收藏microdata.worldbank.org2025-01-15 收录
下载链接:
https://microdata.worldbank.org/index.php/catalog/2012
下载链接
链接失效反馈官方服务:
资源简介:
Abstract
---------------------------
The STEP (Skills Toward Employment and Productivity) Measurement program is the first ever initiative to generate internationally comparable data on skills available in developing countries. The program implements standardized surveys to gather information on the supply and distribution of skills and the demand for skills in labor market of low-income countries.
The uniquely-designed Household Survey includes modules that measure the cognitive skills (reading, writing and numeracy), socio-emotional skills (personality, behavior and preferences) and job-specific skills (subset of transversal skills with direct job relevance) of a representative sample of adults aged 15 to 64 living in urban areas, whether they work or not. The cognitive skills module also incorporates a direct assessment of reading literacy based on the Survey of Adults Skills instruments. Modules also gather information about family, health and language.
Geographic coverage
---------------------------
13 major metropolitan areas: Bogota, Medellin, Cali, Baranquilla, Bucaramanga, Cucuta, Cartagena, Pasto, Ibague, Pereira, Manizales, Monteira, and Villavicencio.
Analysis unit
---------------------------
The units of analysis are the individual respondents and households. A household roster is undertaken at the start of the survey and the individual respondent is randomly selected among all household members aged 15 to 64 included. The random selection process was designed by the STEP team and compliance with the procedure is carefully monitored during fieldwork.
Universe
---------------------------
The target population for the Colombia STEP survey is all non-institutionalized persons 15 to 64 years old (inclusive) living in private dwellings in urban areas of the country at the time of data collection. This includes all residents except foreign diplomats and non-nationals working for international organizations.
The following groups are excluded from the sample:
- residents of institutions (prisons, hospitals, etc.)
- residents of senior homes and hospices
- residents of other group dwellings such as college dormitories, halfway homes, workers' quarters, etc.
- persons living outside the country at the time of data collection.
Kind of data
---------------------------
Sample survey data [ssd]
Sampling procedure
---------------------------
Stratified 7-stage sample design was used in Colombia. The stratification variable is city-size category.
First Stage Sample
The primary sample unit (PSU) is a metropolitan area. A sample of 9 metropolitan areas was selected from the 13 metropolitan areas on the sample frame. The metropolitan areas were grouped according to city-size; the five largest metropolitan areas are included in Stratum 1 and the remaining 8 metropolitan areas are included in Stratum 2. The five metropolitan areas in Stratum 1 were selected with certainty; in Stratum 2, four metropolitan areas were selected with probability proportional to size (PPS), where the measure of size was the number of persons aged 15 to 64 in a metropolitan area.
Second Stage Sample
The second stage sample unit is a Section. At the second stage of sample selection, a PPS sample of 267 Sections was selected from the sampled metropolitan areas; the measure of size was the number of persons aged 15 to 64 in a Section. The sample of 267 Sections consisted of 243 initial Sections and 24 reserve Sections to be used in the event of complete non-response at the Section level.
Third Stage Sample
The third stage sample unit is a Block. Within each selected Section, a PPS sample of 4 blocks was selected; the measure of size was the number of persons aged 15 to 64 in a Block. Two sample Blocks were initially activated while the remaining two sample Blocks were reserved for use in cases where there was a refusal to cooperate at the Block level or cases where the block did not belong to the target population (e.g., parks, and commercial and industrial areas).
Fourth Stage Sample
The fourth stage sample unit is a Block Segment. Regarding the Block segmentation strategy, the Colombia document 'FINAL SAMPLING PLAN (ARD-397)' states "According to the 2005 population and housing census conducted by DANE, the average number of dwellings per block in the 13 large cities or metropolitan areas was approximately 42 dwellings. Based on this finding, the defined protocol was to report those cases in which 80 or more dwellings were present in a given block in order to partition block using a random selection algorithm." At the fourth stage of sample selection, 1 Block Segment was selected in each selected Block using a simple random sample (SRS) method.
Fifth Stage Sample
The fifth stage sample unit is a dwelling. At the fifth stage of sample selection, 5582 dwellings were selected from the sampled Blocks/Block Segments using a simple random sample (SRS) method. According to the Colombia document 'FINAL SAMPLING PLAN (ARD-397)', the selection of dwellings within a participant Block "was performed differentially amongst the different socioeconomic strata that the Colombian government uses for the generation of cross-subsidies for public utilities (in this case, the socioeconomic stratum used for the electricity bill was used). Given that it is known from previous survey implementations that refusal rates are highest amongst households of higher socioeconomic status, the number of dwellings to be selected increased with the socioeconomic stratum (1 being the poorest and 6 being the richest) that was most prevalent in a given block".
Sixth Stage Sample
The sixth stage sample unit is a household. At the sixth stage of sample selection, one household was selected in each selected dwelling using an SRS method.
Seventh Stage Sample
The seventh stage sample unit was an individual aged 15-64 (inclusive). The sampling objective was to select one individual with equal probability from each selected household.
Sampling methodologies are described for each country in two documents and are provided as external resources:
(i) the National Survey Design Planning Report (NSDPR)
(ii) the weighting documentation (available for all countries)
Mode of data collection
---------------------------
Face-to-face [f2f]
Research instrument
---------------------------
The STEP survey instruments include:
- The background questionnaire developed by the World Bank (WB) STEP team
- Reading Literacy Assessment developed by Educational Testing Services (ETS).
All countries adapted and translated both instruments following the STEP technical standards: two independent translators adapted and translated the STEP background questionnaire and Reading Literacy Assessment, while reconciliation was carried out by a third translator.
The survey instruments were piloted as part of the survey pre-test.
The background questionnaire covers such topics as respondents' demographic characteristics, dwelling characteristics, education and training, health, employment, job skill requirements, personality, behavior and preferences, language and family background.
The background questionnaire, the structure of the Reading Literacy Assessment and Reading Literacy Data Codebook are provided in the document "Colombia STEP Skills Measurement Survey Instruments", available in external resources.
Cleaning operations
---------------------------
STEP data management process:
1) Raw data is sent by the survey firm
2) The World Bank (WB) STEP team runs data checks on the background questionnaire data. Educational Testing Services (ETS) runs data checks on the Reading Literacy Assessment data. Comments and questions are sent back to the survey firm.
3) The survey firm reviews comments and questions. When a data entry error is identified, the survey firm corrects the data.
4) The WB STEP team and ETS check if the data files are clean. This might require additional iterations with the survey firm.
5) Once the data has been checked and cleaned, the WB STEP team computes the weights. Weights are computed by the STEP team to ensure consistency across sampling methodologies.
6) ETS scales the Reading Literacy Assessment data.
7) The WB STEP team merges the background questionnaire data with the Reading Literacy Assessment data and computes derived variables.
Detailed information on data processing in STEP surveys is provided in "STEP Guidelines for Data Processing", available in external resources. The template do-file used by the STEP team to check raw background questionnaire data is provided as an external resource, too.`
Response rate
---------------------------
An overall response rate of 48% was achieved in the Colombia STEP Survey.
摘要
---------------------------
STEP(就业与生产力技能测量)项目是首个旨在生成发展中国家技能国际可比数据的倡议。该项目通过实施标准化调查,收集关于低收入国家技能供给与分布以及技能在劳动力市场需求的资料。
独具匠心的家庭调查包括衡量代表样本成人(15至64岁,无论是否工作)认知技能(阅读、写作和计算)、社会情感技能(个性、行为和偏好)以及特定工作技能(与工作直接相关的跨技能子集)的模块。认知技能模块还结合了基于成人技能调查工具的直接阅读能力评估。模块还收集有关家庭、健康和语言的信息。
地理覆盖范围
---------------------------
13个主要大都市区:波哥大、麦德林、卡利、巴兰基亚、布卡拉曼加、库库塔、卡塔赫纳、帕斯托、伊巴瓜、佩雷拉、马尼萨莱斯、蒙特里亚和维利亚维西奥。
分析单元
---------------------------
分析单元为个体受访者和家庭。调查开始时进行家庭名单编制,并在所有年龄在15至64岁之间的家庭成员中随机选择个体受访者。随机选择过程由STEP团队设计,并在实地工作中严格监控其合规性。
总体
---------------------------
哥伦比亚STEP调查的目标总体为所有非机构化、年龄在15至64岁(含)之间、在数据收集时居住在私人住宅的城市地区的居民。这包括所有居民,但除外国外交官和国际组织的工作人员。
以下群体被排除在样本之外:
- 机构居民(监狱、医院等)
- 老年之家和安宁疗护机构居民
- 其他集体居住地居民,如大学宿舍、中途之家、工人宿舍等
- 数据收集时居住在国外的人员
数据类型
---------------------------
样本调查数据 [ssd]
抽样程序
---------------------------
在哥伦比亚使用了分层7级样本设计。分层变量为城市规模类别。
第一阶段样本
基本抽样单位(PSU)为大都市区。从样本框架中的13个大都市区中选择了9个大都市区样本。大都市区根据城市规模分组;前五个最大的大都市区包含在层1中,其余8个大都市区包含在层2中。层1中的五个大都市区被确定选中;在层2中,四个大都市区根据规模(以15至64岁人口数量衡量)以概率比例大小(PPS)被选中。
第二阶段样本
第二阶段样本单位为区域。在样本选择第二阶段,从样本大都市区中选择了267个区域的PPS样本;规模衡量标准为区域中15至64岁人口数量。267个区域样本包括243个初始区域和24个备用区域,以备区域层面完全无响应时使用。
第三阶段样本
第三阶段样本单位为街区。在每个选定的区域内,根据街区中15至64岁人口数量选择了4个街区的PPS样本。最初激活了两个样本街区,其余两个样本街区保留用于街区层面拒绝合作或街区不属于目标群体(例如,公园、商业和工业区)的情况。
第四阶段样本
第四阶段样本单位为街区段。关于街区分段策略,哥伦比亚文件《最终抽样计划(ARD-397)》中提到:“根据DANE于2005年进行的2005年人口和住房普查,13个大城市或大都市区中每个街区的平均住宅数量约为42套。基于这一发现,定义的协议是报告在给定街区中存在80套或更多住宅的情况,以使用随机选择算法对街区进行划分。”在样本选择第四阶段,每个选定的街区使用简单随机抽样(SRS)方法选择1个街区段。
第五阶段样本
第五阶段样本单位为住宅。在样本选择第五阶段,从样本街区/街区段中选择了5582套住宅,使用简单随机抽样(SRS)方法。根据哥伦比亚文件《最终抽样计划(ARD-397)》,在参与街区中选择住宅时,根据哥伦比亚政府用于公共事业交叉补贴的不同社会经济阶层(在这种情况下,用于电费的社会经济阶层)进行了差异化的选择。鉴于从前调查实施中得知,拒绝率在较高社会经济阶层的家庭中最高,因此根据给定街区中最普遍的社会经济阶层(1为最贫穷,6为最富裕)选择住宅的数量随着社会经济阶层而增加。
第六阶段样本
第六阶段样本单位为家庭。在样本选择第六阶段,在每个选定的住宅中使用SRS方法选择一个家庭。
第七阶段样本
第七阶段样本单位为15-64岁(含)的个人。抽样目标是选择每个选定的家庭中的一个个体,以等概率选择。
每个国家的抽样方法在两份文件中进行了描述,并作为外部资源提供:
(i) 国家调查设计规划报告(NSDPR)
(ii)加权文档(所有国家均可获得)
数据收集方式
---------------------------
面对面 [f2f]
研究工具
---------------------------
STEP调查工具包括:
- 世界银行(WB)STEP团队开发的背景问卷
- 由教育测试服务(ETS)开发的阅读能力评估。
所有国家都根据STEP技术标准对这两项工具进行了改编和翻译:两名独立的翻译者对STEP背景问卷和阅读能力评估进行了改编和翻译,而协调工作由第三名翻译者完成。
调查工具作为调查预测试的一部分进行了试点。
背景问卷涵盖了受访者的人口统计特征、住宅特征、教育和技术培训、健康、就业、工作技能要求、个性、行为和偏好、语言和家庭背景等主题。
背景问卷、阅读能力评估的结构和阅读能力数据代码簿在“哥伦比亚STEP技能测量调查工具”文件中提供,该文件可在外部资源中找到。
数据清洗操作
---------------------------
STEP数据管理流程:
1) 原始数据由调查公司发送
2) 世界银行(WB)STEP团队对背景问卷数据进行数据检查。教育测试服务(ETS)对阅读能力评估数据进行数据检查。将评论和问题发送回调查公司。
3) 调查公司审查评论和问题。当发现数据输入错误时,调查公司更正数据。
4) WB STEP团队和ETS检查数据文件是否清洁。这可能需要与调查公司进行额外的迭代。
5) 数据经过检查和清洗后,WB STEP团队计算权重。权重由STEP团队计算以确保抽样方法的连贯性。
6) ETS缩放阅读能力评估数据。
7) WB STEP团队将背景问卷数据与阅读能力评估数据合并,并计算派生变量。
STEP调查中数据处理的具体信息在“STEP数据加工指南”中提供,该指南可在外部资源中找到。STEP团队用于检查原始背景问卷数据的模板文件也作为外部资源提供。"
响应率
---------------------------
哥伦比亚STEP调查的整体响应率为48%。
提供机构:
microdata.worldbank.org



