International Clinical Trial
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/02e6ecc4-a78a-467c-97ad-c8c0c2468d46/John-Snow-Labs_International-Clinical-Trial
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
This data package contains datasets on clinical trials conducted worldwide. Currently, this data package contains datasets on the International Stroke Trial (IST) conducted on individual patients with acute stroke and respiratory syncytial virus (RSV) disease in early infants.
**Description**
This data package contains the International Stroke Trial database and the replication data for respiratory syncytial virus (RSV) disease. The International Stroke Trial (IST) is one of the largest randomized trials ever conducted on individual patients in acute stroke. The respiratory syncytial virus (RSV) disease database contains replication data for the absence of an association between cord specific antibody levels and severe respiratory syncytial virus (RSV) disease in early infants from a case-control study from coastal Kenya.
**Benefits**
- This data can be useful for further research as a source of primary data and available for public use for the conduct of secondary analyses. this data can also be useful in the planning of future trials and public health strategies.
**License Information**
The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.
**Included Datasets**
- [International Stroke Trial Database](https://www.johnsnowlabs.com/marketplace/international-stroke-trial-database)
- The International Stroke Trial (IST) dataset includes data on 19,435 patients and 112 variables. For each randomized patient, data were extracted on the variables assessed at randomization, at the early outcome point, and at 6-months. This dataset provides a source of primary data and is available for public use for the conduct of secondary analyses and in the planning of future trials particularly in older patients and in resource-poor settings given the age distribution of the dataset.
- [Respiratory Syncytial Virus Disease](https://www.johnsnowlabs.com/marketplace/respiratory-syncytial-virus-disease)
- This dataset provides replication data for the absence of the association between cord specific antibody levels and severe respiratory syncytial virus (RSV) disease in early infants from a case-control study from coastal Kenya.
**Data Engineering Overview**
**We deliver high-quality data**
- Each dataset goes through 3 levels of quality review
- 2 Manual reviews are done by domain experts
- Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
- Data is normalized into one unified type system
- All dates, unites, codes, currencies look the same
- All null values are normalized to the same value
- All dataset and field names are SQL and Hive compliant
- Data and Metadata
- Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
- Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
- Data Updates
- Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted
**Our data is curated and enriched by domain experts**
Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:
- Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
- Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
- Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
- The data is always kept up to date – even when the source requires manual effort to get updates
- Support for data subscribers is provided directly by the domain experts who curated the data sets
- Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.
**Need Help?**
If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).
提供机构:
John Snow Labs



