National Health Surveys
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/bedfc260-7d67-47e4-9363-31e547c1d64c/John-Snow-Labs_National-Health-Surveys
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
This data package contains the National Health Surveys. The NHIS is one of the major programs of data collection of the Nation Center for Health Satistics of the Centers for Disease Control and Prevention, is a recopilation of information on the amount, distribution and effects of diseases and disability in the US.
**Description**
This data package contains the information of Cardiovascular Health Indicators, Heart Disease Surveillance System and Notifiable Disease Surveillance.
The National Health Interview Survey (NHIS) has monitored the health of the nation since 1957. NHIS data on a broad range of health topics are collected through personal household interviews. Data for this dataset has been computed by personnel in CDC's Division for Heart Disease and Stroke Prevention (DHDSP). This dataset is one of the datasets provided by National Cardiovascular Disease Surveillance System.
Behavioral Risk Factor Surveillance System is a continuous, state-based surveillance system that collects information about modifiable risk factors for chronic diseases and other leading causes of death. Indicators from this data source have been computed by personnel in CDC's Division for Heart Disease and Stroke Prevention (DHDSP).
It also contains a summary of National Health and Nutrition Examination Survey (NHANES), which is a program of the National Center for Health Statistics from the Centers for Disease Control and Prevention. This survey assesses the health and nutrition of the Aemerican population through interview and physical examinations combined. The data is collected from different populations and health topics from a representative sample each year of around 5,000 subjects located across the country, 15 of these persons are included every year.
It also shows the Notifiable Disease Surveillance 1 and 2. The purpose of the Project Tycho database is to make availabe large scale public health data. Includes the digitalization of historical and current reports of the National Notifiable Disease Surveillance System of the United States from 1888 to 2013. This covers the notifiable diseases cases or deaths by disease, location and time per epidemiological week.
**Benefits**
- Useful to researchers, public health specialist and specialized institutions to analyze and obtain useful information related to diseases and health risk factors prevalence.
- useful to adapt and/or adjust the current public health interventions and health policies.
- useful for preventive and curative healthcare services resources distribution.
**License Information**
The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.
**Included Datasets**
- [Cardiovascular Health Indicators Household Survey](https://www.johnsnowlabs.com/marketplace/cardiovascular-health-indicators-household-survey)
- This dataset contains a summary of the Cardiovascular Health Indicators Household Survey provided by the National Health Interview Survey (NHIS). The NHIS is one of the major programs of data collection of the Nation Center for Health Statistics of the Centers for Disease Control and Prevention, is a recompilation of information on the amount, distribution and effects of diseases and disability in the US.
- [Heart Disease Surveillance System](https://www.johnsnowlabs.com/marketplace/heart-disease-surveillance-system)
- This dataset shows the heart disease surveillance system survey. The Behavioral Risk Factor Surveillance System (BRFSS) collects surveys through phone calls related to health-related risk behaviors, chronic diseases and use of preventive services in the US, the data covers all 50 states, the District of Columbia and three US from more than 400,000 adults.
- [Medicare Heart Disease and Stroke Prevention Claims Data](https://www.johnsnowlabs.com/marketplace/medicare-heart-disease-and-stroke-prevention-claims-data)
- This dataset shows the Medicare Heart Disease and Stroke Prevention Claims Data based on Center for Medicare & Medicaid Services (CMS), Medicare Claims data 2004 forward. CMS compiles claims data for Medicare and Medicaid patients across a variety of categories and years. This includes Inpatient and Outpatient claims, Master Beneficiary Summary Files, and many other files.
- [National Health Surveillance System](https://www.johnsnowlabs.com/marketplace/national-health-surveillance-system)
- This dataset contains a summary of National Health and Nutrition Examination Survey (NHANES), which is a program of the National Center for Health Statistics from the Centers for Disease Control and Prevention. This survey assesses the health and nutrition of the American population through interview and physical examinations combined.
- [Notifiable Disease Surveillance 1](https://www.johnsnowlabs.com/marketplace/notifiable-disease-surveillance-1)
- This dataset contains the Level 1 data of the Project Tycho database, this includes counts at city and state level of 50 communicable diseases since 1888, including smallpox, polio, measles, mumps, rubella, hepatitis A, whooping cough and diphtheria. It covers 126 years of disease reports.
- [Notifiable Disease Surveillance 2](https://www.johnsnowlabs.com/marketplace/notifiable-disease-surveillance-2)
- This dataset contains the Level 2 data of the Project Tycho database. These data include tables of disease counts reported by US states and cities to the Federal Health Authorities every week since 1888. The purpose of the Project Tycho database is to make available large-scale public health data.
**Data Engineering Overview**
**We deliver high-quality data**
- Each dataset goes through 3 levels of quality review
- 2 Manual reviews are done by domain experts
- Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
- Data is normalized into one unified type system
- All dates, unites, codes, currencies look the same
- All null values are normalized to the same value
- All dataset and field names are SQL and Hive compliant
- Data and Metadata
- Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
- Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
- Data Updates
- Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted
**Our data is curated and enriched by domain experts**
Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:
- Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
- Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
- Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
- The data is always kept up to date – even when the source requires manual effort to get updates
- Support for data subscribers is provided directly by the domain experts who curated the data sets
- Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.
**Need Help?**
If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).
提供机构:
John Snow Labs



