CDC Open Data Product
收藏Databricks2026-03-06 收录
下载链接:
https://marketplace.databricks.com/details/9ad35afb-9b10-4ca7-a26b-b812fbc9a3dc/Dataplex-Consulting-Data-Products_CDC-Open-Data-Product
下载链接
链接失效反馈官方服务:
资源简介:
## Overview
Access **1,300+ CDC health datasets** with **2 billion+ records** — fully transformed, continuously updated, and ready to query in your Databricks workspace. No pipelines to build. No infrastructure to manage. No engineering resources required.
> **[Start Free Trial](https://trial.dataplex-consulting.com)** | **[Purchase Full Access](https://checkout.dataplex-consulting.com/b/14A6oAcXFfOw13y9y4bQY01)**
Our automated platform monitors the CDC's entire open data catalog daily, detects new and updated datasets, and delivers production-ready tables directly to your Unity Catalog. Data is processed within 12 hours of publication and organized into **56 topic-specific schemas** covering infectious diseases, chronic conditions, vaccinations, environmental health, vital statistics, and more.
**Pricing:** $565/month with 14-day free trial
---
## What's Included
- **1,300+ datasets** across 56 organized topic schemas
- **2 billion+ records** spanning public health surveillance, epidemiology, and population health
- **Daily automated updates** — new data appears within 12 hours of CDC publication
- **56 topic schemas** organized by subject area (e.g., vaccination data, NCHS statistics, NNDSS surveillance, chronic disease indicators)
- **Full historical coverage** from 2015 to present
- **Standardized column naming** and data types across all datasets
- **Lineage tracking** — every row traces back to its source dataset and batch
### Top Schemas by Coverage
- **NNDSS Data** — 295 tables — Nationally Notifiable Disease Surveillance
- **NCHS Statistics** — 235 tables — National Center for Health Statistics
- **Vaccination Data** — 88 tables — Immunization coverage and trends
- **Public Health Surveillance** — 74 tables — Disease monitoring and outbreak data
- **CDC Cities (500 Cities/PLACES)** — 64 tables — Local health estimates
- **CDC Data Catalog** — 61 tables — Cross-cutting health indicators
- **Other Data** — 60 tables — Specialized public health topics
---
## Why Buy vs. Build?
Building and maintaining CDC data pipelines in-house means your engineering team is responsible for:
- **Monitoring 1,300+ CDC endpoints** for new data, schema changes, and retracted datasets
- **Handling rate limits, API failures, and retry logic** across government data sources
- **Running daily ETL jobs** on provisioned compute to extract, transform, and load data
- **Debugging pipeline failures** — including weekends and holidays when CDC publishes updates
- **Managing schema evolution** as the CDC adds, renames, or removes columns
- **Building and maintaining quality checks** to catch data anomalies before they reach analysts
With this listing, all of that is handled for you. Your team can focus on **analysis, not infrastructure**.
**Estimated build cost to replicate:** 2–3 senior data engineers × 6+ months, plus ongoing maintenance.
**Cost with this listing:** $565/month — less than a single day of engineering time.
---
## Use Cases
### Population Health Analytics
Track disease prevalence, mortality trends, and health disparities across geographies. Combine CDC surveillance data with your clinical or claims datasets to identify at-risk populations and measure intervention outcomes.
### Life Sciences & Pharma
Access real-world evidence from CDC surveillance systems including FAERS-adjacent drug safety signals, vaccination coverage by demographics, and chronic disease burden data. Support regulatory submissions, pharmacovigilance, and market research with continuously updated public health data.
### Health Plans & Payers
Benchmark population health metrics across service areas using CDC PLACES data (500+ health indicators at county and census tract level). Identify high-cost chronic disease concentrations and design targeted care management programs.
### Government & Policy Research
Analyze NNDSS notifiable disease trends, NCHS vital statistics, and environmental health indicators for evidence-based policy development. Access standardized datasets spanning a decade of public health reporting.
---
## Featured Datasets
- **ABCs (Active Bacterial Core surveillance)** — Invasive bacterial disease incidence across surveillance sites
- **NNDSS Weekly Tables** — Nationally notifiable disease case counts by jurisdiction
- **BRFSS (Behavioral Risk Factor Surveillance)** — Health-related risk behaviors and chronic conditions survey
- **500 Cities / PLACES** — Local health estimates for counties and census tracts
- **NCHS Vital Statistics** — Birth, death, and mortality data from national vital records
- **COVID-19 Case Surveillance** — Detailed case-level COVID-19 public use data
- **Vaccination Coverage (NIS)** — National Immunization Survey data by demographics
- **CDC Wonder Mortality** — Detailed mortality statistics by cause, demographics, geography
- **Chronic Disease Indicators** — 200+ indicators across 35+ chronic disease topics
- **Environmental Health Tracking** — Environmental hazards and health outcome indicators
---
## Data Quality & Updates
- **Update frequency:** Daily automated pipeline
- **Processing time:** Within 12 hours of CDC publication
- **Historical coverage:** January 2015 – present
- **Quality checks:** Row count validation, schema drift detection, null rate monitoring
- **Lineage:** Every record includes BATCH_ID linking to source dataset metadata
- **Schema management:** Automatic detection and handling of CDC schema changes
- **Monitoring:** 24/7 pipeline health monitoring with automated alerting
---
## Getting Started
### Free Trial
Request access to start a **14-day free trial** with full access to all 1,300+ datasets. Explore the data, run queries, and validate the product fits your use case — no commitment required.
### Full Access
After trial, continue with a monthly subscription at **$565/month**. Includes:
- All current and future CDC datasets
- Daily automated updates
- Full historical backfill
- Schema and lineage metadata
- Email support from the Dataplex data engineering team
---
## Example Queries
- **Count datasets by topic schema:** Query information_schema.tables filtered to cdc_dwv_% schemas grouped by table_schema
- **Explore vaccination data:** Select from cdc_dwv_vacc_data schema tables
- **Search by topic:** Query cdc_dwv.datasets and filter description by keyword (e.g., mortality, diabetes, vaccination)
- **Latest batch data:** Join any data table to cdc_dwv.datasets_batches on batch_id and filter to is_latest_batch = TRUE
See the included sample notebook for full working query examples.
---
## Documentation & Support
- **Data Catalog:** [docs.dataplex-consulting.com/data-catalog/cdc-open-data-product](https://docs.dataplex-consulting.com/data-catalog/cdc-open-data-product)
- **Support:** support@dataplex-consulting.com
- **Provider:** [Dataplex Consulting & Data Products](https://www.dataplex-consulting.com)
> **[Start Free Trial](https://trial.dataplex-consulting.com)** | **[Purchase Full Access](https://checkout.dataplex-consulting.com/b/14A6oAcXFfOw13y9y4bQY01)**
提供机构:
Dataplex Consulting & Data Products



