five

CDC Open Data Product

收藏
Databricks2026-03-06 收录
下载链接:
https://marketplace.databricks.com/details/9ad35afb-9b10-4ca7-a26b-b812fbc9a3dc/Dataplex-Consulting-Data-Products_CDC-Open-Data-Product
下载链接
链接失效反馈
官方服务:
资源简介:
## Overview Access **1,300+ CDC health datasets** with **2 billion+ records** — fully transformed, continuously updated, and ready to query in your Databricks workspace. No pipelines to build. No infrastructure to manage. No engineering resources required. > **[Start Free Trial](https://trial.dataplex-consulting.com)** | **[Purchase Full Access](https://checkout.dataplex-consulting.com/b/14A6oAcXFfOw13y9y4bQY01)** Our automated platform monitors the CDC's entire open data catalog daily, detects new and updated datasets, and delivers production-ready tables directly to your Unity Catalog. Data is processed within 12 hours of publication and organized into **56 topic-specific schemas** covering infectious diseases, chronic conditions, vaccinations, environmental health, vital statistics, and more. **Pricing:** $565/month with 14-day free trial --- ## What's Included - **1,300+ datasets** across 56 organized topic schemas - **2 billion+ records** spanning public health surveillance, epidemiology, and population health - **Daily automated updates** — new data appears within 12 hours of CDC publication - **56 topic schemas** organized by subject area (e.g., vaccination data, NCHS statistics, NNDSS surveillance, chronic disease indicators) - **Full historical coverage** from 2015 to present - **Standardized column naming** and data types across all datasets - **Lineage tracking** — every row traces back to its source dataset and batch ### Top Schemas by Coverage - **NNDSS Data** — 295 tables — Nationally Notifiable Disease Surveillance - **NCHS Statistics** — 235 tables — National Center for Health Statistics - **Vaccination Data** — 88 tables — Immunization coverage and trends - **Public Health Surveillance** — 74 tables — Disease monitoring and outbreak data - **CDC Cities (500 Cities/PLACES)** — 64 tables — Local health estimates - **CDC Data Catalog** — 61 tables — Cross-cutting health indicators - **Other Data** — 60 tables — Specialized public health topics --- ## Why Buy vs. Build? Building and maintaining CDC data pipelines in-house means your engineering team is responsible for: - **Monitoring 1,300+ CDC endpoints** for new data, schema changes, and retracted datasets - **Handling rate limits, API failures, and retry logic** across government data sources - **Running daily ETL jobs** on provisioned compute to extract, transform, and load data - **Debugging pipeline failures** — including weekends and holidays when CDC publishes updates - **Managing schema evolution** as the CDC adds, renames, or removes columns - **Building and maintaining quality checks** to catch data anomalies before they reach analysts With this listing, all of that is handled for you. Your team can focus on **analysis, not infrastructure**. **Estimated build cost to replicate:** 2–3 senior data engineers × 6+ months, plus ongoing maintenance. **Cost with this listing:** $565/month — less than a single day of engineering time. --- ## Use Cases ### Population Health Analytics Track disease prevalence, mortality trends, and health disparities across geographies. Combine CDC surveillance data with your clinical or claims datasets to identify at-risk populations and measure intervention outcomes. ### Life Sciences & Pharma Access real-world evidence from CDC surveillance systems including FAERS-adjacent drug safety signals, vaccination coverage by demographics, and chronic disease burden data. Support regulatory submissions, pharmacovigilance, and market research with continuously updated public health data. ### Health Plans & Payers Benchmark population health metrics across service areas using CDC PLACES data (500+ health indicators at county and census tract level). Identify high-cost chronic disease concentrations and design targeted care management programs. ### Government & Policy Research Analyze NNDSS notifiable disease trends, NCHS vital statistics, and environmental health indicators for evidence-based policy development. Access standardized datasets spanning a decade of public health reporting. --- ## Featured Datasets - **ABCs (Active Bacterial Core surveillance)** — Invasive bacterial disease incidence across surveillance sites - **NNDSS Weekly Tables** — Nationally notifiable disease case counts by jurisdiction - **BRFSS (Behavioral Risk Factor Surveillance)** — Health-related risk behaviors and chronic conditions survey - **500 Cities / PLACES** — Local health estimates for counties and census tracts - **NCHS Vital Statistics** — Birth, death, and mortality data from national vital records - **COVID-19 Case Surveillance** — Detailed case-level COVID-19 public use data - **Vaccination Coverage (NIS)** — National Immunization Survey data by demographics - **CDC Wonder Mortality** — Detailed mortality statistics by cause, demographics, geography - **Chronic Disease Indicators** — 200+ indicators across 35+ chronic disease topics - **Environmental Health Tracking** — Environmental hazards and health outcome indicators --- ## Data Quality & Updates - **Update frequency:** Daily automated pipeline - **Processing time:** Within 12 hours of CDC publication - **Historical coverage:** January 2015 – present - **Quality checks:** Row count validation, schema drift detection, null rate monitoring - **Lineage:** Every record includes BATCH_ID linking to source dataset metadata - **Schema management:** Automatic detection and handling of CDC schema changes - **Monitoring:** 24/7 pipeline health monitoring with automated alerting --- ## Getting Started ### Free Trial Request access to start a **14-day free trial** with full access to all 1,300+ datasets. Explore the data, run queries, and validate the product fits your use case — no commitment required. ### Full Access After trial, continue with a monthly subscription at **$565/month**. Includes: - All current and future CDC datasets - Daily automated updates - Full historical backfill - Schema and lineage metadata - Email support from the Dataplex data engineering team --- ## Example Queries - **Count datasets by topic schema:** Query information_schema.tables filtered to cdc_dwv_% schemas grouped by table_schema - **Explore vaccination data:** Select from cdc_dwv_vacc_data schema tables - **Search by topic:** Query cdc_dwv.datasets and filter description by keyword (e.g., mortality, diabetes, vaccination) - **Latest batch data:** Join any data table to cdc_dwv.datasets_batches on batch_id and filter to is_latest_batch = TRUE See the included sample notebook for full working query examples. --- ## Documentation & Support - **Data Catalog:** [docs.dataplex-consulting.com/data-catalog/cdc-open-data-product](https://docs.dataplex-consulting.com/data-catalog/cdc-open-data-product) - **Support:** support@dataplex-consulting.com - **Provider:** [Dataplex Consulting & Data Products](https://www.dataplex-consulting.com) > **[Start Free Trial](https://trial.dataplex-consulting.com)** | **[Purchase Full Access](https://checkout.dataplex-consulting.com/b/14A6oAcXFfOw13y9y4bQY01)**
提供机构:
Dataplex Consulting & Data Products
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作