five

COVID-19 Case Surveillance Public Use Data

收藏
data.cdc.gov2024-07-09 更新2025-03-25 收录
下载链接:
https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data/vbim-akqf
下载链接
链接失效反馈
官方服务:
资源简介:
<b>Note:</b> Reporting of new COVID-19 Case Surveillance data will be discontinued July 1, 2024, to align with the process of removing SARS-CoV-2 infections (COVID-19 cases) from the list of nationally notifiable diseases. Although these data will continue to be publicly available, the dataset will no longer be updated. Authorizations to collect certain public health data expired at the end of the U.S. public health emergency declaration on May 11, 2023. The following jurisdictions discontinued COVID-19 case notifications to CDC: Iowa (11/8/21), Kansas (5/12/23), Kentucky (1/1/24), Louisiana (10/31/23), New Hampshire (5/23/23), and Oklahoma (5/2/23). Please note that these jurisdictions will not routinely send new case data after the dates indicated. As of 7/13/23, case notifications from Oregon will only include pediatric cases resulting in death. This case surveillance public use dataset has 12 elements for all COVID-19 cases shared with CDC and includes demographics, any exposure history, disease severity indicators and outcomes, presence of any underlying medical conditions and risk behaviors, and no geographic data. <h4><b>CDC has three COVID-19 case surveillance datasets:</b></h4><ul><li><a href="https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data-with-Ge/n8mc-b4w4">COVID-19 Case Surveillance Public Use Data with Geography</a>: Public use, patient-level dataset with clinical data (including symptoms), demographics, and county and state of residence. (19 data elements)</li><li><a href="https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data/vbim-akqf">COVID-19 Case Surveillance Public Use Data</a>: Public use, patient-level dataset with clinical and symptom data and demographics, with no geographic data. (12 data elements)</li><li><a href="https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Restricted-Access-Detai/mbd7-r32t">COVID-19 Case Surveillance Restricted Access Detailed Data</a>: Restricted access, patient-level dataset with clinical and symptom data, demographics, and state and county of residence. Access requires a registration process and a data use agreement. (33 data elements)</li></ul> The following apply to all three datasets: <ul><li>Data elements can be found on the COVID-19 case report form located at <a href="https://www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf">www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf</a>.</li><li>Data are considered provisional by CDC and are subject to change until the data are reconciled and verified with the state and territorial data providers.</li><li>Some data cells are suppressed to protect individual privacy.</li><li>The datasets will include all cases with the earliest date available in each record (date received by CDC or date related to illness/specimen collection) at least 14 days prior to the creation of the current datasets. This 14-day lag allows case reporting to be stabilized and ensures that time-dependent outcome data are accurately captured.</li><li>Datasets are updated monthly.</li><li>Datasets are created using CDC’s <a href="https://www.cdc.gov/grants/additional-requirements/ar-25.html">Policy on Public Health Research and Nonresearch Data Management and Access</a> and include protections designed to protect individual privacy.</li><li>For more information about data collection and reporting, please see <a href="https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html">https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html.</a></li><li>For more information about the COVID-19 case surveillance data, please see <a href="https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html"> https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html</a><br></li></ul> <h4><b>Overview</b></h4> The COVID-19 case surveillance database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of Columbia (D.C.), as well as U.S. territories and affiliates. On April 5, 2020, COVID-19 was added to the <a href="https://ndc.services.cdc.gov/search-results-year/">Nationally Notifiable Condition List</a> and classified as “immediately notifiable, urgent (within 24 hours)” by a Council of State and Territorial Epidemiologists (CSTE) Interim Position Statement (<a href="https://cdn.ymaws.com/www.cste.org/resource/resmgr/ps/positionstatement2020/Interim-20-ID-01_COVID-19_NO.pdf">Interim-20-ID-01</a>). CSTE updated the position statement on August 5, 2020, to clarify the interpretation of antigen detection tests and serologic test results within the case classification (<a href="https://cdn.ymaws.com/www.cste.org/resource/resmgr/ps/positionstatement2020/Interim-20-ID-02_COVID-19.pdf">Interim-20-ID-02</a>). The statement also recommended that all states and territories enact laws to make COVID-19 reportable in their jurisdiction, and that jurisdictions conducting surveillance should submit case notifications to CDC. COVID-19 case surveillance data are collected by jurisdictions and reported voluntarily to CDC. For more information: <a href="https://www.cdc.gov/nndss/action/stories/covid-19-response.html">NNDSS Supports the COVID-19 Response | CDC</a>.<br> The deidentified data in the “COVID-19 Case Surveillance Public Use Data” include demographic characteristics, any exposure history, disease severity indicators and outcomes, clinical data, laboratory diagnostic test results, and presence of any underlying medical conditions and risk behaviors. All data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf. <h4><b>COVID-19 Case Reports</b></h4> COVID-19 case reports have been routinely submitted using nationally standardized case reporting forms. On April 5, 2020, CSTE released an Interim Position Statement with national surveillance case definitions for COVID-19 included. Current versions of these case definitions are available here: https://ndc.services.cdc.gov/case-definitions/coronavirus-disease-2019-2021/. All cases reported on or after were requested to be shared by public health departments to CDC using the standardized case definitions for laboratory-confirmed or probable cases. On May 5, 2020, the standardized case reporting form was revised. Case reporting using this new form is ongoing among U.S. states and territories. <h4><b>Data are Considered Provisional</b></h4> <ul><li>The COVID-19 case surveillance data are dynamic; case reports can be modified at any time by the jurisdictions sharing COVID-19 data with CDC. CDC may update prior cases shared with CDC based on any updated information from jurisdictions. For instance, as new information is gathered about previously reported cases, health departments provide updated data to CDC. As more information and data become available, analyses might find changes in surveillance data and trends during a previously reported time window. Data may also be shared late with CDC due to the volume of COVID-19 cases.</li><li>Annual finalized data: To create the final NNDSS data used in the annual tables, CDC works carefully with the reporting jurisdictions to reconcile the data received during the year until each state or territorial epidemiologist confirms that the data from their area are correct.</li><li>Access <a href="https://preparedness.cste.org/wp-content/uploads/2022/04/RaceEthnicityData_FINAL.pdf">Addressing Gaps in Public Health Reporting of Race and Ethnicity for COVID-19</a>, a report from the Council of State and Territorial Epidemiologists, to better understand the challenges in completing race and ethnicity data for COVID-19 and recommendations for improvement.</li></ul> <h3><b>Data Limitations</b></h3> To learn more about the limitations in using case surveillance data, visit <a href="https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html">FAQ: COVID-19 Data and Surveillance</a>. <h4><b>Data Quality Assurance Procedures</b></h4> CDC’s Case Surveillance Section routinely performs data quality assurance procedures (i.e., ongoing corrections and logic checks to address data errors). To date, the following data cleaning steps have been implemented: <ul><li>Questions that have been left unanswered (blank) on the case report form are reclassified to a Missing value, if applicable to the question. For example, in the question “Was the individual hospitalized?” where the possible answer choices include “Yes,” “No,” or “Unknown,” the blank value is recoded to Missing because the case report form did not include a response to the question.</li><li>Logic checks are performed for date data. If an illogical date has been provided, CDC reviews the data with the reporting jurisdiction. For example, if a symptom onset date in the future is reported to CDC, this value is set to null until the reporting jurisdiction updates the date appropriately.</li><li>Additional data quality processing to recode free text data is ongoing. Data on symptoms, race and ethnicity, and healthcare worker status have been prioritized.</li></ul> <h4><b>Data Suppression</b></h4> To prevent release of data that could be used to identify people, data cells are suppressed for low frequency (<5) records and indirect identifiers (e.g., date of first positive specimen). Suppression includes rare combinations of demographic characteristics (sex, age group, race/ethnicity). Suppressed values are re-coded to the NA answer option; records with data suppression are never removed. <b>For questions, please contact Ask SRRG (<a href="mailto:eocevent394@cdc.gov">eocevent394@cdc.gov</a>).</b> <h4><b>Additional COVID-19 Data</b></h4> COVID-19 data are available to the public as summary or aggregate count files, including total counts of cases and deaths by state and by county. These and other COVID-19 data are available from multiple public locations: <a href="https://www.cdc.gov/covid-data-tracker/">COVID Data Tracker</a>; <a href="https://data.cdc.gov/Case-Surveillance/Weekly-United-States-COVID-19-Cases-and-Deaths-by-/pwn4-m3yp">United States COVID-19 Cases and Deaths by State</a>; <a href="https://www.cdc.gov/vaccines/covid-19/reporting/index.html">COVID-19 Vaccination Reporting Data Systems</a>; and <a href="https://www.cdc.gov/nchs/nvss/covid-19.htm">COVID-19 Death Data and Resources</a>. <b>Notes:</b> <b>March 1, 2022:</b> The "COVID-19 Case Surveillance Public Use Data" will be updated on a monthly basis. <b>April 7, 2022:</b> An adjustment was made to CDC’s cleaning algorithm for COVID-19 line level case notification data. An assumption in CDC's algorithm led to misclassifying deaths that were not COVID-19 related. The algorithm has since been revised, and this dataset update reflects corrected individual level information about death status for all cases collected to date. <b>June 25, 2024:</b> An adjustment was made to CDC’s cleaning algorithm for COVID-19 line level case notification data. An assumption in CDC's algorithm led to misclassification of pediatric deaths from New York State. The algorithm has since been revised, and this dataset update reflects corrected individual level information about death status for all cases collected to date for New York State. <b>June 26, 2024:</b> Reported COVID-19 cases resulting in death of Texas residents should be interpreted with caution. In November 2023, CDC and Texas Department of State Health Services (DSHS) identified an unresolved issue in the demographics of COVID-19 patient case deaths as reported to CDC by Texas DSHS. As a result, Texas case deaths reported to COVID-19 line level case notification data are disproportionate compared with reporting by Texas DSHS and National Center for Health Statistics (NCHS). Because of this issue, an adjustment could not be implemented to correct the misclassifications. However, provisional COVID-19 death counts from the National Center for Health Statistics (NCHS) are updated with information from death certificates and offers accurate death counts.”

<b>注意:</b> 自2024年7月1日起,将停止报告新的COVID-19病例监测数据,以与从国家报告疾病清单中移除SARS-CoV-2感染(COVID-19病例)的过程相一致。尽管这些数据将继续公开可用,但数据集将不再更新。 美国公共卫生紧急状态声明于2023年5月11日结束,因此收集某些公共卫生数据的授权已到期。以下司法管辖区已停止向美国疾病控制与预防中心(CDC)报告COVID-19病例:爱荷华州(2021年11月8日)、堪萨斯州(2023年5月12日)、肯塔基州(2024年1月1日)、路易斯安那州(2023年10月31日)、新罕布什尔州(2023年5月23日)和俄克拉荷马州(2023年5月2日)。请注意,这些司法管辖区将在所指示日期之后不再常规发送新的病例数据。截至2023年7月13日,俄勒冈州的病例报告仅包括导致死亡的儿童病例。 此病例监测公共使用数据集包含12个元素,涵盖与CDC共享的所有COVID-19病例,包括人口统计学、任何暴露史、疾病严重程度指标和结果、任何潜在的医疗状况和风险行为,以及无地理数据。 <h4><b>CDC有三个COVID-19病例监测数据集:</b></h4> <ul><li><a href="https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data-with-Ge/n8mc-b4w4">带有地理信息的COVID-19病例监测公共使用数据</a>:公共使用、患者级别的数据集,包含临床数据(包括症状)、人口统计学以及居住的县和州。(19个数据元素)</li><li><a href="https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data/vbim-akqf">COVID-19病例监测公共使用数据</a>:公共使用、患者级别的数据集,包含临床和症状数据以及人口统计学,无地理数据。(12个数据元素)</li><li><a href="https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Restricted-Access-Detai/mbd7-r32t">COVID-19病例监测受限访问详细数据</a>:受限访问、患者级别的数据集,包含临床和症状数据、人口统计学以及居住的州和县。访问需要注册流程和数据使用协议。(33个数据元素)</li></ul> 以下适用于所有三个数据集: <ul><li>数据元素可在位于<a href="https://www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf">www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf</a>的COVID-19病例报告表中找到。</li><li>CDC认为数据是临时性的,并可能在数据与州和领地数据提供者核对和验证之前发生变化。</li><li>某些数据单元格被压制以保护个人隐私。</li><li>数据集将包括每个记录中可用的最早日期的所有病例(CDC接收日期或与疾病/标本采集相关的日期)至少在当前数据集创建前的14天之前。这个14天的滞后期允许病例报告稳定,并确保时间依赖性结果数据被准确捕获。</li><li>数据集每月更新。</li><li>数据集是根据CDC的<a href="https://www.cdc.gov/grants/additional-requirements/ar-25.html">公共卫生研究和非研究数据管理和访问政策</a>创建的,并包括旨在保护个人隐私的保护措施。</li><li>有关数据收集和报告的更多信息,请参阅<a href="https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html">https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html。</a></li><li>有关COVID-19病例监测数据的更多信息,请参阅<a href="https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html">https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html</a></li></ul> <h4><b>概述</b></h4> COVID-19病例监测数据库包括报告给美国各州和自主报告实体的个体级别数据,包括纽约市和哥伦比亚特区(D.C.),以及美国领土和附属机构。2020年4月5日,COVID-19被纳入<a href="https://ndc.services.cdc.gov/search-results-year/">国家报告条件列表</a>,并被州和领地流行病学家委员会(CSTE)临时立场声明归类为“立即报告,紧急(24小时内)”(<a href="https://cdn.ymaws.com/www.cste.org/resource/resmgr/ps/positionstatement2020/Interim-20-ID-01_COVID-19_NO.pdf">Interim-20-ID-01</a>)。CSTE于2020年8月5日更新了立场声明,以阐明病例分类中对抗原检测和血清学检测结果的理解。该声明还建议所有州和领地制定法律,使其辖区内的COVID-19可报告,并建议进行监测的司法管辖区向CDC提交病例通知。COVID-19病例监测数据由司法管辖区收集,并自愿报告给CDC。 有关更多信息: <a href="https://www.cdc.gov/nndss/action/stories/covid-19-response.html">NNDSS支持COVID-19应对 | CDC</a>。 “COVID-19病例监测公共使用数据”中的去标识化数据包括人口统计学特征、任何暴露史、疾病严重程度指标和结果、临床数据、实验室诊断检测结果以及任何潜在的医疗状况和风险行为。所有数据元素均可在位于www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf的COVID-19病例报告表中找到。 <h4><b>COVID-19病例报告</b></h4> COVID-19病例报告常规使用国家标准化病例报告表提交。2020年4月5日,CSTE发布了包含COVID-19国家监测病例定义的临时立场声明。这些病例定义的当前版本可在以下网址找到:https://ndc.services.cdc.gov/case-definitions/coronavirus-disease-2019-2021/。 所有报告的病例(包括或之后)均要求公共卫生部门使用实验室确诊或疑似病例的标准化病例定义与CDC共享。2020年5月5日,标准化病例报告表进行了修订。美国各州和领土正在使用此新表进行病例报告。 <h4><b>数据被视为临时数据</b></h4> <ul><li>COVID-19病例监测数据是动态的;共享COVID-19数据给CDC的司法管辖区可以随时修改病例报告。CDC可能会根据司法管辖区提供的任何更新信息更新之前与CDC共享的病例。例如,随着关于先前报告的病例的新信息的收集,卫生部门向CDC提供更新后的数据。随着更多信息和数据变得可用,分析可能会发现先前报告的时间窗口内的监测数据和趋势发生变化。由于COVID-19病例数量庞大,数据也可能延迟与CDC共享。</li><li>年度最终数据:为了创建年度表格中使用的最终NNDSS数据,CDC与报告司法管辖区精心合作,核对全年接收到的数据,直到每个州或领地流行病学家确认他们区域的数据是正确的。</li><li>访问<a href="https://preparedness.cste.org/wp-content/uploads/2022/04/RaceEthnicityData_FINAL.pdf">州和领地流行病学家委员会关于解决COVID-19公共卫生报告中种族和民族数据差距的报告</a>,以更好地了解完成COVID-19种族和民族数据所面临的挑战以及改进建议。</li></ul> <h3><b>数据局限性</b></h3> 了解更多关于使用病例监测数据局限性的信息,请访问<a href="https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html">COVID-19数据和监测常见问题解答</a>。 <h4><b>数据质量保证程序</b></h4> CDC的病例监测部门定期执行数据质量保证程序(即,持续纠正和逻辑检查以解决数据错误)。迄今为止,已实施了以下数据清理步骤: <ul><li>在病例报告表中未回答的问题(空白)将被重新分类为缺失值,如果适用的话。例如,在“个人是否住院?”这个问题中,可能的答案选项包括“是”、“否”或“未知”,空白值被重新编码为缺失值,因为病例报告表没有对这个问题提供回答。</li><li>对日期数据进行逻辑检查。如果提供了不合理的日期,CDC将与报告司法管辖区审查数据。例如,如果向CDC报告的症状开始日期在将来,则此值将被设置为null,直到报告司法管辖区适当地更新日期。</li><li>正在进行额外的数据质量处理以重新编码自由文本数据。症状、种族和民族、以及医护人员状态的数据已被优先处理。</li></ul> <h4><b>数据压制</b></h4> 为了防止发布可用于识别个人的数据,对低频(<5)记录和间接标识符(例如,首次阳性标本的日期)的数据单元格进行压制。压制包括人口统计学特征(性别、年龄组、种族/民族)的罕见组合。压制值被重新编码为NA答案选项;压制数据的记录永远不会被删除。 <b>如有疑问,请联系Ask SRRG (<a href="mailto:eocevent394@cdc.gov">eocevent394@cdc.gov</a>)。</b> <h4><b>其他COVID-19数据</b></h4> COVID-19数据以摘要或汇总计数文件的形式向公众提供,包括按州和县的总病例和死亡计数。这些和其他COVID-19数据可在多个公共位置获得:<a href="https://www.cdc.gov/covid-data-tracker/">COVID数据追踪器</a>;<a href="https://data.cdc.gov/Case-Surveillance/Weekly-United-States-COVID-19-Cases-and-Deaths-by-/pwn4-m3yp">美国COVID-19病例和死亡按州</a>;<a href="https://www.cdc.gov/vaccines/covid-19/reporting/index.html">COVID-19疫苗接种报告数据系统</a>;以及<a href="https://www.cdc.gov/nchs/nvss/covid-19.htm">COVID-19死亡数据和资源</a>。 <b>备注:</b> <b>2022年3月1日:</b>“COVID-19病例监测公共使用数据”将每月更新。 <b>2022年4月7日:</b>对CDC的COVID-19行级病例通知数据清理算法进行了调整。CDC算法中的一个假设导致未与COVID-19相关的死亡被误分类。该算法已得到修订,此数据集更新反映了迄今为止收集的所有病例的纠正后的死亡状态的个人级别信息。 <b>2024年6月25日:</b>对CDC的COVID-19行级病例通知数据清理算法进行了调整。CDC算法中的一个假设导致纽约州儿童死亡被误分类。该算法已得到修订,此数据集更新反映了迄今为止收集的纽约州所有病例的纠正后的死亡状态的个人级别信息。 <b>2024年6月26日:</b>应谨慎解释导致德克萨斯州居民死亡的报告COVID-19病例。2023年11月,CDC和德克萨斯州州卫生服务部(DSHS)确定了德克萨斯州向CDC报告的COVID-19患者死亡病例人口统计学中的一个未解决的问题。因此,与德克萨斯州DSHS和国家卫生统计中心(NCHS)的报告相比,德克萨斯州报告的COVID-19病例死亡比例不正常。由于这个问题,无法实施调整以纠正误分类。然而,国家卫生统计中心(NCHS)的临时COVID-19死亡计数已更新,并提供了准确的死亡计数。” }
提供机构:
data.cdc.gov
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作