COVID-19 High Frequency Phone Survey of Households 2020 - Viet Nam
收藏microdata.worldbank.org2023-10-26 更新2025-01-15 收录
下载链接:
https://microdata.worldbank.org/index.php/catalog/3813
下载链接
链接失效反馈官方服务:
资源简介:
Abstract
---------------------------
The main objective of this project is to collect household data for the ongoing assessment and monitoring of the socio-economic impacts of COVID-19 on households and family businesses in Vietnam. The estimated field work and sample size of households in each round is as follows:
Round 1 June fieldwork- approximately 6300 households (at least 1300 minority households)
Round 2 August fieldwork - approximately 4000 households (at least 1000 minority households)
Round 3 September fieldwork- approximately 4000 households (at least 1000 minority households)
Round 4 December- approximately 4000 households (at least 1000 minority households)
Round 5 - pending discussion
Geographic coverage
---------------------------
National, regional
Analysis unit
---------------------------
Households
Kind of data
---------------------------
Sample survey data [ssd]
Sampling procedure
---------------------------
The 2020 Vietnam COVID-19 High Frequency Phone Survey of Households (VHFPS) uses a nationally representative household survey from 2018 as the sampling frame. The 2018 baseline survey includes 46980 households from 3132 communes (about 25% of total communes in Vietnam). In each commune, one EA is randomly selected and then 15 households are randomly selected in each EA for interview. Out of the 15 households, 3 households have information collected on both income and expenditure (large module) as well as many other aspects. The remaining 12 other households have information collected on income, but do not have information collected on expenditure (small module). Therefore, estimation of large module includes 9396 households and are representative at regional and national levels, while the whole sample is representative at the provincial level.
We use the large module of to select the households for official interview of the VHFPS survey and the small module households as reserve for replacement. The sample size of large module has 9396 households, of which, there are 7951 households having phone number (cell phone or line phone).
After data processing, the final sample size is 6,213 households.
Mode of data collection
---------------------------
Computer Assisted Telephone Interview [cati]
Research instrument
---------------------------
The questionnaire for Round 1 consisted of the following sections
Section 2. Behavior
Section 3. Health
Section 4. Education & Child caring
Section 5A. Employment (main respondent)
Section 5B. Employment (other household member)
Section 6. Coping
Section 7. Safety Nets
Section 8. FIES
Cleaning operations
---------------------------
Data cleaning began during the data collection process. Inputs for the cleaning process include available interviewers’ note following each question item, interviewers’ note at the end of the tablet form as well as supervisors’ note during monitoring. The data cleaning process was conducted in following steps:
• Append households interviewed in ethnic minority languages with the main dataset interviewed in Vietnamese.
• Remove unnecessary variables which were automatically calculated by SurveyCTO
• Remove household duplicates in the dataset where the same form is submitted more than once.
• Remove observations of households which were not supposed to be interviewed following the identified replacement procedure.
• Format variables as their object type (string, integer, decimal, etc.)
• Read through interviewers’ note and make adjustment accordingly. During interviews, whenever interviewers find it difficult to choose a correct code, they are recommended to choose the most appropriate one and write down respondents’ answer in detail so that the survey management team will justify and make a decision which code is best suitable for such answer.
• Correct data based on supervisors’ note where enumerators entered wrong code.
• Recode answer option “Other, please specify”. This option is usually followed by a blank line allowing enumerators to type or write texts to specify the answer. The data cleaning team checked thoroughly this type of answers to decide whether each answer needed recoding into one of the available categories or just keep the answer originally recorded. In some cases, that answer could be assigned a completely new code if it appeared many times in the survey dataset.
• Examine data accuracy of outlier values, defined as values that lie outside both 5th and 95th percentiles, by listening to interview recordings.
• Final check on matching main dataset with different sections, where information is asked on individual level, are kept in separate data files and in long form.
• Label variables using the full question text.
• Label variable values where necessary.
Response rate
---------------------------
The target for Round 1 is to complete interviews for 6300 households, of which 1888 households are located in urban area and 4475 households in rural area. In addition, at least 1300 ethnic minority households are to be interviewed. A random selection of 6300 households was made out of 7951 households for official interview and the rest as for replacement. However, the refusal rate of the survey was about 27 percent, and households from the small module in the same EA were contacted for replacement and these households are also randomly selected.
摘要
---------------------------
本项目的核心目标是收集越南家庭及家族企业受COVID-19社会经济影响评估与监控所需的家庭数据。各轮次现场工作和家庭样本量估算如下:
第一轮次(六月)现场工作 - 约计6300户家庭(至少1300户少数民族家庭)
第二轮次(八月)现场工作 - 约计4000户家庭(至少1000户少数民族家庭)
第三轮次(九月)现场工作 - 约计4000户家庭(至少1000户少数民族家庭)
第四轮次(十二月) - 约计4000户家庭(至少1000户少数民族家庭)
第五轮次 - 待定
地理覆盖范围
---------------------------
全国、区域
分析单元
---------------------------
家庭
数据类型
---------------------------
样本调查数据 [ssd]
抽样程序
---------------------------
2020年越南COVID-19高频电话家庭调查(VHFPS)采用2018年全国代表性家庭调查作为抽样框架。2018年基线调查包括来自3132个乡(约占越南总乡数的25%)的46980户家庭。在每个乡中,随机选择一个EA,然后在每个EA中随机选择15户家庭进行访谈。在15户家庭中,有3户家庭收集了收入和支出(大模块)以及许多其他方面的信息。其余12户其他家庭仅收集了收入信息,而没有收集支出信息(小模块)。因此,大模块的估计包括9396户家庭,并在区域和国家层面上具有代表性,而整个样本在省级层面上具有代表性。
我们使用大模块选择家庭进行VHFPS调查的官方访谈,并将小模块家庭作为备用。大模块的样本量为9396户,其中7951户有电话号码(手机或固定电话)。
数据处理后,最终样本量为6213户。
数据收集方式
---------------------------
计算机辅助电话访谈 [cati]
研究工具
---------------------------
第一轮次的问卷包括以下部分
第二节:行为
第三节:健康
第四节:教育与儿童照料
第五节A:就业(主要受访者)
第五节B:就业(其他家庭成员)
第六节:应对策略
第七节:安全网
第八节:FIES
数据清洗操作
---------------------------
数据清洗在数据收集过程中开始。清洗过程的输入包括每个问题项后的访谈员笔记、平板表单末尾的访谈员笔记以及监控过程中的监督员笔记。数据清洗过程按以下步骤进行:
• 将使用少数民族语言访谈的家庭与主要使用越南语访谈的数据库合并。
• 删除由SurveyCTO自动计算的无关变量。
• 删除数据集中提交了同一表格多次的家庭重复项。
• 删除不符合访谈程序的未访谈家庭观察值。
• 将变量格式化为它们的对象类型(字符串、整数、小数等)。
• 阅读访谈员笔记并根据需要进行调整。在访谈过程中,当访谈员发现难以选择正确的代码时,建议选择最合适的代码,并详细记录受访者的答案,以便调查管理团队对答案进行解释并作出决定。
• 根据监督员的笔记更正数据,当数据员输入错误代码时。
• 重新编码答案选项“其他,请说明”。此选项通常后跟一个空白行,允许访谈员输入或书写以指定答案。数据清洗团队将彻底检查此类答案,以确定每个答案是否需要重新编码到可用类别之一,或者仅保留原始记录的答案。在某些情况下,如果该答案在调查数据集中出现多次,则可以分配一个全新的代码。
• 通过听取访谈录音来检查异常值的准确性,异常值定义为位于第5百分位数和第95百分位数之外。
• 对主数据集与不同部分进行最终检查,其中个体层面的信息被分别保存在单独的数据文件和长格式文件中。
• 对变量使用完整问题文本进行标记。
• 在必要时对变量值进行标记。
响应率
---------------------------
第一轮次的目标是完成6300户家庭的访谈,其中1888户位于城市地区,4475户位于农村地区。此外,还需访谈至少1300户少数民族家庭。从7951户家庭中随机选择6300户进行官方访谈,其余作为备用。然而,调查的拒绝率约为27%,来自同一EA的小模块家庭被联系以进行替换,这些家庭也被随机选择。
提供机构:
microdata.worldbank.org



