five

Multiple Indicator Cluster Survey 2005 - Tajikistan

收藏
catalog.ihsn.org2019-03-29 更新2025-03-22 收录
下载链接:
http://catalog.ihsn.org/catalog/966
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract --------------------------- The Multiple Indicator Cluster Survey (MICS) is a household survey programme developed by UNICEF to assist countries in filling data gaps for monitoring human development in general and the situation of children and women in particular. MICS is capable of producing statistically sound, internationally comparable estimates of social indicators. The current round of MICS is focused on providing a monitoring tool for the Millennium Development Goals (MDGs), the World Fit for Children (WFFC), as well as for other major international commitments, such as the United Nations General Assembly Special Session (UNGASS) on HIV/AIDS and the Abuja targets for malaria. Survey Objectives The 2005 Tajikistan Multiple Indicator Cluster Survey has as its primary objectives: - To provide up-to-date information for assessing the situation of children and women in Tajikistan - To furnish data needed for monitoring progress toward goals established in the Millennium Declaration, the goals of A World Fit For Children (WFFC), and other internationally agreed upon goals, as a basis for future action; - To contribute to the improvement of data and monitoring systems in Tajikistan and to strengthen technical expertise in the design, implementation, and analysis of such systems. Survey Content MICS questionnaires are designed in a modular fashion that can be easily customized to the needs of a country. They consist of a household questionnaire, a questionnaire for women aged 15-49 and a questionnaire for children under the age of five (to be administered to the mother or caretaker). Other than a set of core modules, countries can select which modules they want to include in each questionnaire. Survey Implementation The surveys was carried out by The State Committee on Statistics of the Republic of Tajikistan, with the support and assistance of UNICEF and other partners. Technical assistance and training for the surveys is provided through a series of regional workshops, covering questionnaire content, sampling and survey implementation; data processing; data quality and data analysis; report writing and dissemination. Survey results Results from the surveys, including national reports, standard sets of tabulations and micro level datasets will all be made widely available after completion of the surveys. Results from the surveys will also be made available in DevInfo format. DevInfo v5.0 is a powerful database system which has been adapted from UNICEF's ChildInfo technology to specifically monitor progress towards the Millennium Development Goals. MICS Results will also be available through UNICEF's web site dedicated to monitoring the situation of children and women at www.childinfo.org. Results of the prior round of MICS can already be found at this site. Geographic coverage --------------------------- The survey is nationally representative and covers the whole of Tajikistan. Analysis unit --------------------------- Households (defined as a group of persons who usually live and eat together) De jure household members (defined as memers of the household who usually live in the household, which may include people who did not sleep in the household the previous night, but does not include visitors who slept in the household the previous night but do not usually live in the household) Women aged 15-49 Children aged 0-4 Universe --------------------------- The survey covered all de jure household members (usual residents), all women aged 15-49 years resident in the household, and all children aged 0-4 years (under age 5) resident in the household. Kind of data --------------------------- Sample survey data [ssd] Sampling procedure --------------------------- The principal objective of the sample design was to provide current and reliable estimates on a set of indicators covering the four major areas of the World Fit for Children declaration, including promoting healthy lives; providing quality education; protecting against abuse, exploitation and violence; and combating HIV/AIDS. The population covered by the 2005-2006 MICS is defined as the universe of all women aged 15-49 and all children aged under 5. A sample of households was selected and all women aged 15-49 identified as usual residents of these households were interviewed. In addition, the mother or the caretaker of all children aged under 5 who were usual residents of the household were also interviewed about the child.The 2005 MICS collected data from a nationally representative sample of households, women and children. The primary focus of the 2005 MICS was to prodvide estimates of key population and health, education, child protection and HIV related indicators for the country as a whole, and for urban and rural areas separately. In additon, the sample was designed to provide estimates for each of the 5 regions for key indicators. Tajikistan is devided into 5 regions: Dushanbe (the capital), Direct Rule Districts (DRD), Sogd, Khatlon and Gorno Badakhshan (GBAO). Each of such division, for last census purposes, was further subdivided in convenient areas called “census section”, “instructor's sector” and “enumeration sector”(ESs). Each ES is either totally urban or rural. The list of ESs has census information on population information for each ES, and the ESs list are grouped by administrative unit. The size of the rural ES was 342 persons and the size of the urban ES was 378 persons. There were established 17923 ES in Tajikistan. The available demarcated cartographic material for each ES in localities with more than 5000 population made last census material as the most appropriate sample frame for the MICS2005. The primary sampling unit (PSU) - or cluster - for the MICS2005 is defined on basis of ES from the 2000 census frame, as having one (or more) ES per PSU. The number of clusters selected in each region was not allocated proportional to the size of the region, but approximately equal number of clusters was allocated to each region due to the need to present estimates for each of the 5 regions. A total of 290 clasters and 6,960 households were selected. The primary sampling unit variable is the cluster (HH1). The sampling stratas can be identified using a combination of the region (HH7) and area (HH6) variables, e.g. stratum = HH7*10 + HH6. A two-stage, stratified cluster sampling approach was used for the selection of the survey sample. The sampling domains are defined as urban and rural areas of each of the regions of Tajikistan. In the first stage, 290 PSU were selected in each stratum with equal probabilities. Within each cluster a fixed number of 24 households were selected. Updating of the list of households in each cluster was done prior to selection of the households. Since the distribution of clusters between sampling domains was not proportional to the census distribution of population and, consequently neither was the final household distribution, the sample is not self-weighting. For reporting national level results, sample weights are used. Following standard MICS data collection rules, if a household was actually more than one household when visited, then a) if the selected household contained two households, both were interviewed, or b) if the selected household contained 3 or more households, then only the household of the person named as the head was interviewd. No replacement of households was permitted in case of non-response or non-contactable households. Adjustments were made to the sampling weights to correct for non-response, according to MICS standard procedures. The sampling procedures are more fully described in the sampling design document and the sampling appendix of the final report. Sampling deviation --------------------------- No major deviations from the original sample design were made. All sample enumeration areas were accessed and successfully interviewed with good response rates. Mode of data collection --------------------------- Face-to-face [f2f] Research instrument --------------------------- The questionnaires for the Tajikistan MICS were structured questionnaires based on the MICS3 Model Questionnaire with some modifications and additions. A household questionnaire was administered in each household, which collected various information on household members including sex, age, relationship, and orphanhood status. The household questionnaire includes Household Listing, Education, Water and Sanitation, Household characteristics, Insecticide treated Net (ITN) , Child Labour, Child Discipline, Maternal Mortality, and Salt Iodization. In addition to a household questionnaire, questionnaires were administered in each household for women age 15-49 and children under age five. For children, the questionnaire was administered to the mother or caretaker of the child. The women's questionnaire include Women's Characteristics, Child Mortality, Maternal and Newborn Health, Marriage and Union, Contraception, Attitudes Towards Domestic Violence, HIV knowledge, and Tuberculosis knowledge. The children's questionnaire includes Children's Characteristics, Birth Registration and Early Learning, Child Development, Vitamin A, , Breastfeeding, Care of Illness, Malaria, Immunization, and Anthropometry. The questionnaires were developed in Russian from the MICS3 Model Questionnaires, and were translated into Tajik. After an initial review the questionnaires were translated back into English by an independent translator with no prior knowledge of the survey. The back translation from the Russian version was independently reviewed and compared to the English original. Differences in translation were reviewed and resolved in collaboration with the original translators. The Tajik and Russian questionnaires were both piloted as part of the survey pretest. All questionnaires and modules are provided as external resources. Cleaning operations --------------------------- Data were processed in clusters, with each cluster being processed as a complete unit through each stage of data processing. Each cluster goes through the following steps: 1) Questionnaire reception 2) Office editing and coding 3) Data entry 4) Structure and completeness checking 5) Verification entry 6) Comparison of verification data 7) Back up of raw data 8) Secondary editing 9) Edited data back up After all clusters are processed, all data is concatenated together and then the following steps are completed for all data files: 10) Export to SPSS in 4 files (hh - household, hl - household members, wm - women, ch - children under 5) 11) Recoding of variables needed for analysis 12) Adding of sample weights 13) Calculation of wealth quintiles and merging into data 14) Structural checking of SPSS files 15) Data quality tabulations 16) Production of analysis tabulations Details of each of these steps can be found in the data processing documentation, data editing guidelines, data processing programs in CSPro and SPSS, and tabulation guidelines. Please see http://www.childinfo.org/mics/mics3/manual.php Twelve data entry operators entered data using the CSPro software into twelve microcomputers. Two data entry supervisors supervised the whole process. All data entry was conducted at the State Committee on Statistics of the Republic of the Tajikistan head office using manual data entry. For data entry, CSPro version 2.6.007 was used with a highly structured data entry program, using system controlled approach, that controlled entry of each variable. All range checks and skips were controlled by the program and operators could not override these. A limited set of consistency checks were also included inthe data entry program. In addition, the calculation of anthropometric Z-scores was also included in the data entry programs for use during analysis. Open-ended responses ("Other" answers) were not entered or coded, except in rare circumstances where the response matched an existing code in the questionnaire. Structure and completeness checking ensured that all questionnaires for the cluster had been entered, were structurally sound, and that women's and children's questionnaires existed for each eligible woman and child. 100% verification of all variables was performed using independent verification, i.e. double entry of data, with separate comparison of data followed by modification of one or both datasets to correct keying errors by original operators who first keyed the files. After completion of all processing in CSPro, all individual cluster files were backed up before concatenating data together using the CSPro file concatenate utility. Data editing took place at a number of stages throughout the processing (see Other processing), including: a) Office editing and coding b) During data entry c) Structure checking and completeness d) Secondary editing e) Structural checking of SPSS data files Detailed documentation of the editing of data can be found in the data processing guidelines found in the MICS manual (please see http://www.childinfo.org/mics/mics3/manual.php) Response rate --------------------------- Of the 6.968 households selected for the sample, 6.961 were found to be occupied. Of these, 6.684 were successfully interviewed for a household response rate of 96 percent. In the interviewed households, 10.626 women (age 15-49) were identified. Of these, 10.243 were successfully interviewed, yielding a response rate of 96 percent. In addition, 4.370 children under the age of five were listed in the household questionnaire. Questionnaires were completed for 4.273 of these children, which corresponds to a response rate of 98 percent. Overall response rates of 93 and 94 percent are calculated for the women's and under-5's interviews respectively. The household response rates are slightly higher in rural than in urban areas, 97 compared to 94 percent. The response rates in Dushanbe are a little lower than in others regions, due to the busy lifestyle of the respondents living in the capital city. The lowest women's response rate is noted in GBAO region, 88 percent. Differences between number of sampled and occupied households almost does not exist, thanks to the previously updated household listings. Sampling error estimates --------------------------- Estimates from a sample survey are affected by two types of errors: 1) non-sampling errors and 2) sampling errors. Non-sampling errors are the results of mistakes made in the implementation of data collection and data processing. Numerous efforts were made during implementation of the 2005 MICS to minimize this type of error, however, non-sampling errors are impossible to avoid and difficult to evaluate statistically. Sampling errors can be evaluated statistically. The sample of respondents to the 2005 MICS is only one of many possible samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differe somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability in the results of the survey between all possible samples, and, although, the degree of variability is not known exactly, it can be estimated from the survey results. The sampling erros are measured in terms of the standard error for a particular statistic (mean or percentage), which is the square root of the variance. Confidence intervals are calculated for each statistic within which the true value for the population can be assumed to fall. Plus or minus two standard errors of the statistic is used for key statistics presented in MICS, equivalent to a 95 percent confidence interval. If the sample of respondents had been a simple random sample, it would have been possible to use straightforward formulae for calculating sampling errors. However, the 2005 MICS sample is the result of a multi-stage stratified design, and consequently needs to use more complex formulae. The SPSS complex samples module has been used to calculate sampling errors for the 2005 MICS. This module uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. This method is documented in the SPSS file CSDescriptives.pdf found under the Help, Algorithms options in SPSS. Sampling errors have been calculated for a select set of statistics (all of which are proportions due to the limitations of the Taylor linearization method) for the national sample, urban and rural areas, and for each of the five regions. For each statistic, the estimate, its standard error, the coefficient of variation (or relative error -- the ratio between the standard error and the estimate), the design effect, and the square root design effect (DEFT -- the ratio between the standard error using the given sample design and the standard error that would result if a simple random sample had been used), as well as the 95 percent confidence intervals (+/-2 standard errors). Details of the sampling errors are presented in the sampling errors appendix to the report and in the sampling errors table presented in the external resources. Data appraisal --------------------------- A series of data quality tables and graphs are available to review the quality of the data and include the following: Age distribution of the household population Age distribution of eligible women and interviewed women Age distribution of eligible children and children for whom the mother or caretaker was interviewed Age distribution of children under age 5 by 3 month groups Age and period ratios at boundaries of eligibility Percent of observations with missing information on selected variables Presence of mother inthe household and person interviewed for the under 5 questionnaire School attendance by single year age Sex ratio at birth among children ever born, surviving and dead by age of respondent Distribution of women by time since last birth Scatterplot of weight by height, weight by age and height by age Graph of male and female population by single years of age Population pyramid The results of each of these data quality tables is shown in the appendix of the final report and is also given in the external resources section. The general rule for presentation of missing data in the final report tabulations is that a column is presented for missing data if the percentage of cases with missing data is 1% or more. Cases with missing data on the background characteristics (e.g. education) are included in the tables, but the missing data rows are suppressed and noted at the bottom of the tables in the report (not in the SPSS output, however).

摘要 --------------------------- 联合国儿童基金会(UNICEF)开发的多指标集群调查(MICS)是一项旨在协助各国填补人类发展监测数据空白的家庭调查项目,尤其关注儿童和妇女的状况。MICS能够产生统计上可靠、国际可比的社会指标估计值。当前MICS的轮次旨在为千年发展目标(MDGs)、适合儿童的世界(WFFC)以及其他主要国际承诺(如联合国大会特别会议(UNGASS)关于艾滋病和 Abuja 恙虫病目标)提供监测工具。 调查目标 2005年塔吉克斯坦多指标集群调查的主要目标是: - 提供最新的信息,以评估塔吉克斯坦儿童和妇女的状况; - 提供数据,以监测实现千年宣言、适合儿童的世界(WFFC)和其他国际协议目标所取得的进展,作为未来行动的基础; - 为改善塔吉克斯坦的数据和监测系统做出贡献,并加强在设计、实施和分析此类系统方面的技术专长。 调查内容 MICS问卷以模块化方式设计,可轻松定制以满足各国的需求。它们包括家庭问卷、15-49岁女性问卷和5岁以下儿童问卷(由母亲或监护人填写)。除了核心模块外,各国可以选择在每个问卷中包含哪些模块。 调查实施 该调查由塔吉克斯坦共和国国家统计委员会实施,得到联合国儿童基金会和其他合作伙伴的支持和协助。通过一系列区域性研讨会提供调查的技术援助和培训,涵盖问卷内容、抽样和调查实施;数据处理;数据质量和数据分析;报告编写和传播。 调查结果 调查完成后,包括国家报告、标准汇总集和微观层面数据集的结果将广泛公布。调查结果也将以DevInfo格式提供。DevInfo v5.0是一个功能强大的数据库系统,它基于联合国儿童基金会的ChildInfo技术进行修改,专门用于监测实现千年发展目标的进展。MICS结果也将通过联合国儿童基金会监测儿童和妇女状况的网站www.childinfo.org提供。先前MICS轮次的结果已可在该网站找到。 地理覆盖范围 --------------------------- 该调查是全国代表性的,覆盖整个塔吉克斯坦。 分析单位 --------------------------- 家庭(定义为通常共同生活、共同进食的人员群体) 法定家庭成员(定义为通常居住在家庭中的人,可能包括前一天晚上没有在该家庭过夜但通常居住在该家庭的人,但不包括前一天晚上在该家庭过夜但通常不居住在该家庭的人) 15-49岁的妇女 0-4岁的儿童 总体 --------------------------- 该调查涵盖了所有法定家庭成员(常住居民)、所有在家庭中居住的15-49岁妇女和所有在家庭中居住的0-4岁儿童。 数据类型 --------------------------- 样本调查数据 [ssd] 抽样程序 --------------------------- 抽样设计的主要目标是提供关于涵盖《适合儿童的世界》宣言四大主要领域的指标的最新和可靠的估计值,包括促进健康生活、提供优质教育、防止虐待、剥削和暴力以及对抗艾滋病/艾滋病病毒。2005-2006年MICS涵盖的人口被定义为所有15-49岁妇女和所有5岁以下儿童的总体。选择了一组家庭样本,并采访了这些家庭中所有15-49岁的常住居民妇女。此外,还采访了所有5岁以下常住儿童的母亲或监护人,了解儿童的情况。2005年MICS从全国代表性的家庭、妇女和儿童样本中收集数据。2005年MICS的主要重点是提供关于整个国家以及城市和农村地区的关键人口和健康、教育、儿童保护和艾滋病相关指标的估计值。样本设计旨在为每个地区的关键指标提供估计值。塔吉克斯坦分为5个地区:杜尚别(首都)、直接统治区(DRD)、索格德、哈特隆和戈尔诺巴达赫尚(GBAO)。每个这样的分区,为了上一次普查的目的,进一步细分为方便的称为“普查区”、“指导员区”和“登记区”(ESs)的区域。每个ES要么完全是城市,要么完全是农村。ESs列表包含每个ES的人口信息,并按行政单位分组。农村ES的大小为342人,城市ES的大小为378人。在塔吉克斯坦建立了17,923个ES。对于人口超过5000的当地,可用的每个ES的划分地图材料作为MICS2005的最合适的样本框架。 MICS2005的主要抽样单位(PSU)或聚类,基于2000年普查框架的ES,每个PSU有一个(或多个)ES。每个地区选择的聚类数量并不按地区大小分配,而是由于需要为每个5个地区提供估计值,因此每个地区分配了大约相同数量的聚类。总共选择了290个聚类和6,960个家庭。主要抽样单位变量是聚类(HH1)。可以使用区域(HH7)和区域(HH6)变量的组合来识别抽样层,例如:层 = HH7 * 10 + HH6。 采用两阶段、分层聚类抽样方法选择调查样本。抽样领域定义为每个塔吉克斯坦地区城市和农村地区。在第一阶段,在每个层中按等概率选择了290个PSU。在每个聚类内,选择了一个固定数量的24个家庭。在家庭选择之前更新了每个聚类中的家庭名单。由于聚类在抽样领域之间的分布与人口普查分布不成比例,因此最终家庭分布也不成比例,样本不是自加权的。在报告国家层面结果时,使用样本权重。 根据标准MICS数据收集规则,如果访问时实际上有不止一个家庭,则:a)如果选定的家庭包含两个家庭,则两个家庭都接受采访;或者b)如果选定的家庭包含3个或更多家庭,则只采访被命名为户主的家庭。 在非响应或无法联系的家庭的情况下,不允许替换家庭。根据MICS标准程序对抽样权重进行调整,以纠正非响应。 抽样程序在抽样设计文档和最终报告的抽样附录中得到了更详细的描述。 抽样偏差 --------------------------- 没有对原始样本设计做出重大偏差。所有样本计数区都被访问并成功采访,响应率良好。 数据收集方式 --------------------------- 面对面 [f2f] 研究工具 --------------------------- 塔吉克斯坦MICS的问卷是基于MICS3模型问卷进行结构化设计的问卷,并进行了一些修改和补充。在每户家庭中进行了家庭问卷,收集了有关家庭成员的各种信息,包括性别、年龄、关系和孤儿状态。家庭问卷包括家庭登记、教育、水和卫生、家庭特征、杀虫剂处理的蚊帐(ITN)、儿童劳动、儿童纪律、孕产妇死亡和食盐加碘。 除了家庭问卷外,还为每户15-49岁的妇女和5岁以下的儿童进行了问卷。对于儿童,问卷由孩子的母亲或监护人填写。 妇女问卷包括妇女特征、儿童死亡率、孕产妇和新生儿保健、婚姻和联合、避孕、对家庭暴力的态度、艾滋病知识以及结核病知识。 儿童问卷包括儿童特征、出生登记和早期学习、儿童发展、维生素A、母乳喂养、疾病护理、疟疾、免疫接种和人体测量学。 问卷是从MICS3模型问卷以俄语开发的,并翻译成塔吉克语。在初步审查后,由一位没有事先了解调查的独立翻译员将问卷翻译回英语。独立审查了从俄语版本到英语版本的回译,并与英语原文进行了比较。翻译中的差异与原始翻译员合作进行了审查和解决。 塔吉克语和俄语问卷都作为调查预测试的一部分进行了试点。 所有问卷和模块都作为外部资源提供。 数据清理操作 --------------------------- 数据以聚类为单位进行处理,每个聚类都作为一个完整的单元通过数据处理的所有阶段。每个聚类都经过以下步骤:1)问卷接收;2)办公室编辑和编码;3)数据录入;4)结构和完整性检查;5)验证录入;6)验证数据比较;7)原始数据备份;8)二级编辑;9)编辑数据备份。所有聚类处理完成后,将所有数据连接起来,然后对所有数据文件执行以下步骤:10)导出到SPSS的4个文件(hh - 家庭,hl - 家庭成员,wm - 女性,ch - 5岁以下儿童);11)重新编码分析所需的变量;12)添加样本权重;13)计算财富五分位数并将其合并到数据中;14)SPSS文件的结构性检查;15)数据质量汇总;16)生产分析汇总。 每个这些步骤的详细信息可以在数据处理文档、数据编辑指南、CSPro和SPSS中的数据处理程序以及汇总指南中找到。请参阅http://www.childinfo.org/mics/mics3/manual.php 12个数据录入操作员使用CSPro软件在12台微型计算机上录入数据。两名数据录入监督员监督整个流程。所有数据录入都在塔吉克斯坦共和国国家统计委员会总部通过手动数据录入进行。对于数据录入,使用CSPro版本2.6.007,使用高度结构化的数据录入程序,采用系统控制方法,控制每个变量的录入。所有范围检查和跳过都由程序控制,操作员不能覆盖这些。 结构和完整性检查确保每个聚类都有问卷录入,结构良好,并且每个符合条件的妇女和儿童的问卷都存在。 100%的变量验证是通过独立验证进行的,即数据的双重录入,随后通过比较数据并进行修改来纠正原始操作员首次键入文件时的键入错误。 在CSPro中完成所有处理之后,在将数据连接起来之前,对所有单个聚类文件进行了备份。 在整个处理过程中,包括:a)办公室编辑和编码;b)数据录入期间;c)结构检查和完整性;d)二级编辑;e)SPSS数据文件的结构性检查。在数据处理的各个阶段都进行了数据编辑。 有关数据编辑的详细文档可以在MICS手册中的数据处理指南中找到(请参阅http://www.childinfo.org/mics/mics3/manual.php) 响应率 --------------------------- 在选定的6,968个样本家庭中,发现6,961个家庭有人居住。在这些家庭中,有6,684个家庭成功接受了采访,家庭响应率为96%。在采访的家庭中,确定了10,626名(15-49岁)妇女。在这些妇女中,有10,243人成功接受了采访,响应率为96%。此外,家庭问卷中列出了4,370名5岁以下的儿童。完成了4,273名这些儿童的问卷,对应于98%的响应率。妇女和5岁以下儿童的总体响应率分别计算为93%和94%。农村地区的家庭响应率略高于城市地区,为97%比94%。由于首都的受访者生活方式繁忙,杜尚别地区的响应率略低于其他地区。在GBAO地区,妇女的响应率最低,为88%。由于之前更新了家庭名单,样本家庭和占用家庭之间的数量差异几乎不存在。 抽样误差估计 --------------------------- 样本调查的估计受到两种类型错误的影响:1)非抽样误差和2)抽样误差。非抽样误差是由于数据收集和数据处理实施中的错误造成的。在实施2005年MICS期间,付出了许多努力来最大限度地减少此类错误,但是非抽样误差是不可能避免的,也难以进行统计评估。 抽样误差可以统计评估。2005年MICS的受访者样本只是从同一总体中按相同设计和预期规模选择的许多可能样本之一。这些样本中的每一个都会产生与实际样本选择的结果略有不同的结果。抽样误差是衡量所有可能样本之间调查结果变异性的一种度量,尽管变异性程度并不完全清楚,但可以从调查结果中估计出来。抽样误差是以特定统计量(均值或百分比)的标准误差来衡量的,这是方差的平方根。每个统计量都计算置信区间,其中可以假设总体真实值落在其中。MICS中展示的关键统计量的正负两个标准误差用于关键统计量。 如果受访者样本是简单随机样本,则可以使用简单的公式来计算抽样误差。然而,2005年MICS样本是多层次分层设计的产物,因此需要使用更复杂的公式。已经使用SPSS复杂样本模块计算了2005年MICS的抽样误差。该模块使用泰勒线性化方法估计调查估计的方差,该方法用于均值或比例。该方法在SPSS帮助选项下的算法选项中找到的SPSS文件CSDescriptives.pdf中进行了说明。 已为选定的统计量集合(所有这些统计量都是由于泰勒线性化方法的限制而成为比例)计算了国家样本、城市和农村地区以及每个5个地区的抽样误差。对于每个统计量,估计值、其标准误差、变异系数(或相对误差——标准误差与估计值的比率)、设计效应以及设计效应的平方根(DEFT——使用给定样本设计时的标准误差与如果使用简单随机样本将产生的标准误差之间的比率),以及95%置信区间(±2个标准误差)。 抽样误差的详细信息在报告的抽样误差附录中提供,并在外部资源部分提供的抽样误差表中提供。 数据评估 --------------------------- 一系列数据质量表和图表可用于审查数据质量,包括以下内容: - 家庭人口年龄分布 - 合格妇女和采访妇女的年龄分布 - 合格儿童和母亲或监护人接受采访的儿童的年龄分布 - 5岁以下儿童的年龄和时间段比率 - 具有选定变量缺失信息的观测值的百分比 - 家庭中是否有母亲以及为5岁以下儿童问卷接受采访的人 - 单一年龄的学龄儿童 - 儿童出生时男性和女性的性别比,按受访者的年龄分组 - 妇女按上次分娩时间的分布 - 体重与身高、体重与年龄以及身高与年龄的散点图 - 按单一年龄分组男性和女性人口的图表 - 人口金字塔 每个这些数据质量表的结果都显示在最终报告的附录中,也提供在外部资源部分。 在最终报告汇总中呈现缺失数据的一般规则是,如果缺失数据的百分比达到1%或更多,则呈现缺失数据列。背景特征(例如教育)的缺失数据案例包括在表中,但缺失数据行在报告的底部被抑制并注明(但在SPSS输出中不注明)。
提供机构:
catalog.ihsn.org
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作