STEP Skills Measurement Household Survey 2012 (Wave 1), Yunnan Province - China
收藏microdata.worldbank.org2016-03-23 更新2025-01-16 收录
下载链接:
https://microdata.worldbank.org/index.php/catalog/2019
下载链接
链接失效反馈官方服务:
资源简介:
Abstract
---------------------------
The STEP (Skills Toward Employment and Productivity) Measurement program is the first ever initiative to generate internationally comparable data on skills available in developing countries. The program implements standardized surveys to gather information on the supply and distribution of skills and the demand for skills in labor market of low-income countries.
The uniquely-designed Household Survey includes modules that measure the cognitive skills (reading, writing and numeracy), socio-emotional skills (personality, behavior and preferences) and job-specific skills (subset of transversal skills with direct job relevance) of a representative sample of adults aged 15 to 64 living in urban areas, whether they work or not. The cognitive skills module also incorporates a direct assessment of reading literacy based on the Survey of Adults Skills instruments. Modules also gather information about family, health and language.
Geographic coverage
---------------------------
Areas are classified as urban based on each country's official definition.Some STEP surveys had narrower urban sampling. In Yunnan Province the sample covered the urban areas of Kunming.
- Detailed information is provided in the weighting documentation.
Analysis unit
---------------------------
The units of analysis are the individual respondents and households. A household roster is undertaken at the start of the survey and the individual respondent is randomly selected among all household members aged 15 to 64 included. The random selection process was designed by the STEP team and compliance with the procedure is carefully monitored during fieldwork.
Universe
---------------------------
The STEP target population is the urban population aged 15 to 64 included, living in urban areas, as defined by each country's statistical office.
The target population for the China-Yunnan STEP survey comprised all non-institutionalized persons 15 to 64 years of age (inclusive) living in private dwellings in urban areas of Kunming at the time of data collection.
The following are excluded from the sample:
- Residents of institutions (prisons, hospitals, etc)
- Residents of senior homes and hospices
- Residents of other group dwellings such as college dormitories, halfway homes, workers' quarters, etc
- Persons living outside the country at the time of data collection
In some countries, extremely remote villages or conflict-ridden regions could not be surveyed. These cases are listed in the weighting documentation.
Kind of data
---------------------------
Sample survey data [ssd]
Sampling procedure
---------------------------
The China-Yunnan survey firm implemented a partial literacy assessment design. The partial assessment required each selected person to attempt to complete a General Booklet comprising Reading Components and a set of Core Literacy Items. The partial assessment sampling objective was to have a minimum of about 2000 selected persons attempt the General Booklet. The target population for the China-Yunnan STEP survey comprised all non-institutionalized persons 15 to 64 years of age (inclusive) living in private dwellings in urban areas of Kunming at the time of data collection. The sample frame for the selection of first stage sample units was the Excel file 'sampling frame for STEP _CHINA' that was provided by the China-Yunnan survey firm. The frame is a complete list of first stage sampling units in the urban areas of Kunming. The source of this sample frame is the National Population Census, November, 2010. The sample frame includes 5564 PSUs in 299 Census Enumeration Areas. According to the sample frame, there are 1,067,256 households in the 5564 PSUs.
The China-Yunnan sample design was a 3 stage cluster sample design.
First Stage Sample
The primary sample unit (PSU) is a Census Enumeration Area (CEA) Block. The sampling objective was to conduct interviews in 135 CEA Blocks. At the first stage of sample selection, 27 additional PSUs were also selected as reserve PSUs to be used in the event that it was impossible to obtain any interviews in one or more of the initial PSUs. A total of 162 PSUs were selected with probability proportional to size, where the measure of size was the number of households in a PSU. Subsequently, from the file of 162 sampled PSUs, a PPS sample of 135 PSUs was selected to be the 'Initial' PSU sample. Note that none of the 27 reserve PSUs was activated during data collection.
Second Stage Sample
The second stage sample unit (SSU) is a household. The sampling objective was to obtain interviews at 15 households within each selected PSU. At the second stage of sample selection, 30 households were selected in each PSU using a systematic random method. The 30 households were randomly divided into 15 'Initial' households, and 15 'Reserve' households that were ranked according to the random sample selection order.
Third Stage Sample
The third stage sample unit was an individual aged 15-64 (inclusive). The sampling objective was to select one individual with equal probability from each selected household.
Mode of data collection
---------------------------
Face-to-face [f2f]
Research instrument
---------------------------
The STEP survey instruments include:
- The background Questionnaire developed by the WB STEP team
- Reading Literacy Assessment developed by Educational Testing Services (ETS).
All countries adapted and translated both instruments following the STEP Technical Standards: 2 independent translators adapted and translated the Background Questionnaire and Reading Literacy Assessment, while reconciliation was carried out by a third translator.
The WB STEP team and ETS collaborated closely with the Chinese survey firm during the process and reviewed the adaptation and translation to Mandarin using a back translation.
The survey instruments were both piloted as part of the survey pretest.
The adapted Background Questionnaires are provided in English as external resources. The Reading Literacy Assessment is protected by copyright and will not be published.
Cleaning operations
---------------------------
STEP Data Management Process:
1) Raw data is sent by the survey firm
2) The WB STEP team runs data checks on the Background Questionnaire data.
- ETS runs data checks on the Reading Literacy Assessment data.
- Comments and questions are sent back to the survey firm.
3) The survey firm reviews comments and questions. When a data entry error is identified, the survey firm corrects the data.
4) The WB STEP team and ETS check the data files are clean. This might require additional iterations with the survey firm.
5) Once the data has been checked and cleaned, the WB STEP team computes the weights. Weights are computed by the STEP team to ensure consistency across sampling methodologies.
6) ETS scales the Reading Literacy Assessment data.
7) The WB STEP team merges the Background Questionnaire data with the Reading Literacy Assessment data and computes derived variables.
Detailed information data processing in STEP surveys is provided in the 'Guidelines for STEP Data Entry Programs' document provided as an external resource. The template do-file used by the STEP team to check the raw background questionnaire data is provided as an external resource.
Response rate
---------------------------
The response rate for Yunnan Province (urban) was 98% (See STEP Methodology Note Table 4)
Sampling error estimates
---------------------------
A weighting documentation was prepared for each participating country and provides some information on sampling errors.
All country weighting documentations are provided as an external resource.
摘要
---------------------------
STEP(就业与生产力技能测量)项目是首个旨在生成发展中国家技能可用性的国际可比数据的倡议。该项目通过实施标准化调查,收集低收入国家劳动力市场中技能供应与分配以及技能需求的信息。
该独特设计的家庭调查包括衡量15至64岁成年人认知技能(阅读、写作和计算)、社会情感技能(个性、行为和偏好)以及特定职业技能(与直接工作相关的跨技能子集)的模块。认知技能模块还结合了基于成人技能调查工具的直接阅读素养评估。模块还收集有关家庭、健康和语言的信息。
地理覆盖范围
---------------------------
地区根据每个国家的官方定义划分为城市。一些STEP调查具有较窄的城市抽样范围。在云南省,样本涵盖了昆明市的城区。
- 详细信息可在加权文档中找到。
分析单位
---------------------------
分析单位为个人受访者和家庭。调查开始时进行家庭名单登记,并在所有15至64岁的家庭成员中随机选择个人受访者。随机选择过程由STEP团队设计,并在实地工作中严格监控程序遵守情况。
总体
---------------------------
STEP的目标总体为15至64岁的城市人口,包括生活在城市地区的居民,其定义由各国的统计办公室确定。
中国-云南STEP调查的目标总体为所有非机构化人员,包括在数据收集时居住在昆明市城区私人住宅中的15至64岁(含)人员。
以下人员不包括在样本中:
- 机构(监狱、医院等)的居民
- 老年院和安宁疗养院的居民
- 其他集体住宅的居民,如大学宿舍、中途之家、工人宿舍等
- 数据收集时居住在国外的人员
在一些国家,偏远村庄或冲突地区无法进行调查。这些情况列在加权文档中。
数据类型
---------------------------
样本调查数据 [ssd]
抽样程序
---------------------------
中国-云南调查公司实施了部分阅读能力评估设计。部分评估要求每个选定的人员尝试完成一份包括阅读组件和一组核心阅读能力项目的通用手册。部分评估抽样目标是让至少约2000名选定的人员尝试完成通用手册。中国-云南STEP调查的目标总体为所有非机构化人员,包括在数据收集时居住在昆明市城区私人住宅中的15至64岁(含)人员。
样本选择的第一阶段样本单位的框架是由中国-云南调查公司提供的'STEP _CHINA抽样框架'Excel文件。该框架是昆明市城区第一阶段抽样单位的完整清单。该样本框架的来源是2010年11月的全国人口普查。样本框架包括5564个抽样单位,分布在299个普查登记区。根据样本框架,5564个抽样单位中有1067256户家庭。
中国-云南的样本设计是一个三阶段集群样本设计。
第一阶段样本
主要样本单位(PSU)是普查登记区(CEA)块。抽样目标是采访135个CEA块。在第一阶段样本选择中,还选择了27个额外的PSU作为储备PSU,以备在无法从一个或多个初始PSU中获得任何访谈的情况下使用。总共选择了162个PSU,其选择概率与规模成比例,其中规模的衡量标准是PSU中的家庭数量。随后,从162个抽样PSU的文件中,选择了135个PSU的PPS样本作为“初始”PSU样本。请注意,在数据收集期间,27个储备PSU都没有被激活。
第二阶段样本
第二阶段样本单位(SSU)是家庭。抽样目标是获取每个选定PSU中15个家庭的访谈。在第二阶段样本选择中,每个PSU使用了系统随机方法选择了30个家庭。30个家庭被随机分为15个“初始”家庭和15个根据随机样本选择顺序排名的“储备”家庭。
第三阶段样本
第三阶段样本单位是15至64岁(含)的个人。抽样目标是从每个选定家庭中随机选择一个个人。
数据收集方式
---------------------------
面对面 [f2f]
研究工具
---------------------------
STEP调查工具包括:
- 由世界银行STEP团队开发的背景问卷
- 由教育测试服务(ETS)开发的阅读素养评估。
所有国家都根据STEP技术标准对这两套工具进行了改编和翻译:2名独立的翻译者对背景问卷和阅读素养评估进行了改编和翻译,而协调工作则由第三位翻译者完成。
WB STEP团队和ETS在过程中与中国的调查公司紧密合作,并使用回译对改编和翻译成普通话的过程进行了审查。
调查工具作为调查预测试的一部分进行了试点。
改编的背景问卷以英语作为外部资源提供。阅读素养评估受版权保护,将不予发表。
数据清理操作
---------------------------
STEP数据管理流程:
1) 调查公司发送原始数据
2) WB STEP团队对背景问卷数据进行数据检查。
- ETS对阅读素养评估数据进行数据检查。
- 将评论和问题发送回调查公司。
3) 调查公司审查评论和问题。当发现数据输入错误时,调查公司会纠正数据。
4) WB STEP团队和ETS检查数据文件是否干净。这可能需要与调查公司进行额外的迭代。
5) 一旦数据经过检查和清理,WB STEP团队就会计算权重。STEP团队计算权重以确保抽样方法之间的一致性。
6) ETS对阅读素养评估数据进行缩放。
7) WB STEP团队将背景问卷数据与阅读素养评估数据合并,并计算衍生变量。
详细的数据处理信息可在提供的'STEP数据输入程序指南'文档中找到。STEP团队用于检查原始背景问卷数据的模板do-file作为外部资源提供。
响应率
---------------------------
云南省(城市)的响应率为98%(见STEP方法论注释表4)
抽样误差估计
---------------------------
为每个参与国准备了加权文档,并提供了一些有关抽样误差的信息。
所有国家的加权文档都作为外部资源提供。
提供机构:
microdata.worldbank.org



