Health and Disease Registries
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/40d7cd69-5898-4c12-8d48-2f566094a014/John-Snow-Labs_Health-and-Disease-Registries
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
This data package contains the information for health and disease registries which includes childhood cancer, organs and tissue donor and survival from childhood cancer.
**Description**
This data package shows the the number of registered cases of childhood cancer in the National Registry of Childhood Tumours. This data package also combines meaningful attestations from the Medicare EHR Incentive Program and certified EHR product data from the ONC Certified Health IT Product List (CHPL) to identify the unique vendors, products, and product types of each EHR used to attest to meaningful use. The National Hospital Discharge Survey (NHDS) collects information on patients discharged from hospitals in the United States. The registry is a confidential database that records a person’s consent to organ/tissue donation upon death.
**Benefits**
- The database is used by donation professionals to determine a person donation status at the time of their death.
**License Information**
The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.
**Included Datasets**
- [Childhood Cancer Registry](https://www.johnsnowlabs.com/marketplace/childhood-cancer-registry)
- The number of registered cases of childhood cancer in the National Registry of Childhood Tumours (NRCT) is currently available for the five-year periods 1971-75, 1976-80, 1981-85, 1986-1990, 1991-1995, 1996-2000 and 2001-2005, and the two-year period 2006-2007. Within each period the number of cases is given for each of the major groups in the International Classification of Childhood Cancer, Third Edition, by sex and age group at diagnosis. The age groups are 0-4, 5-9 and 10-14 years.
- [Electronic Health Records Products 2013 to 2018](https://www.johnsnowlabs.com/marketplace/electronic-health-records-products-2013-to-2018)
- The Medicare Electronic Health Record Incentive Program provides financial incentives to eligible hospitals and ambulatory providers to adopt and meaningfully use certified electronic health record technology (CEHRT). Electronic Health Record (EHR) technology reported by participants is certified by the Office of the National Coordinator for Health Information Technology ONC. This dataset provides the CEHRT reported by providers when they annually attest to meaningful use.
- [Initial Diagnosis of Disease by Age](https://www.johnsnowlabs.com/marketplace/initial-diagnosis-of-disease-by-age)
- This dataset contains data from the National Hospital Discharge Survey (NHDS), which collects information on patients discharged from hospitals in the US. The data contained is by first-listed diagnosis and age of patients.
- [Medical Marijuana Statistics by County](https://www.johnsnowlabs.com/marketplace/medical-marijuana-statistics-by-county)
- This dataset contains the Medical Marijuana Registry Program Update by County as of January 31, 2014.
- [Organs and Tissue Donor Registry](https://www.johnsnowlabs.com/marketplace/organs-and-tissue-donor-registry)
- This dataset contains information regarding the number of enrollments by population in the New York State Donate Life Registry from 2008 to 2023. The database is used by donation professionals to determine a person donation status at the time of their death.
- [Survival from Childhood Cancer Registry](https://www.johnsnowlabs.com/marketplace/survival-from-childhood-cancer-registry)
- The dataset presents the percentage survival rates at five years after diagnosis for children in the National Registry of Childhood Tumours (NRCT) who were diagnosed in the five-year periods 1971-75, 1976-80, 1981-85, 1986-1990, 1991-1995, 1996-2000 and 2001-05. Data are given for most of the main groups in the International Classification of Childhood Cancer, Third Edition, and for the most important individual subtypes of childhood cancer.
**Data Engineering Overview**
**We deliver high-quality data**
- Each dataset goes through 3 levels of quality review
- 2 Manual reviews are done by domain experts
- Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
- Data is normalized into one unified type system
- All dates, unites, codes, currencies look the same
- All null values are normalized to the same value
- All dataset and field names are SQL and Hive compliant
- Data and Metadata
- Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
- Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
- Data Updates
- Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted
**Our data is curated and enriched by domain experts**
Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:
- Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
- Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
- Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
- The data is always kept up to date – even when the source requires manual effort to get updates
- Support for data subscribers is provided directly by the domain experts who curated the data sets
- Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.
**Need Help?**
If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).
提供机构:
John Snow Labs



