five

Drug Targets and Drug Lists

收藏
Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/b19af1ee-ff38-408a-b8bd-e1ef4c867f78/John-Snow-Labs_Drug-Targets-and-Drug-Lists
下载链接
链接失效反馈
官方服务:
资源简介:
**Overview** This data package contains information on approved, researched and proven drug targets and drug lists. **Description** This data package contains datasets on a selection of both approved and research drug targets and a selection of attributed target lists extracted from literature "Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds"; a selection of proven drug target lists extracted from the literature, "Novelty in the target landscape of the pharmaceutical industry" as a supplementary data; and a selection of The Therapeutic Target Database protein IDs for successful targets. This data package also contains the dataset Druggable Genome Comprehensive Drug Targets is a selection of supplementary data from "The Druggable Genome: Evaluation of Drug Targets in Clinical Trials Suggests Major Shifts in Molecular Class and Indication" ; dataset Protein Chemical Structure Comparison from Three Drug Databases is a selection of a 3-way consensus list from the paper "Comparing the Chemical Structure and Protein Content of ChEMBL, DrugBank, Human Metabolome Database and the Therapeutic Target Database". **Benefits** - This data package can be useful for further drug research and for updating both the hypothetical and successful human drug target statistics and research for drug discovery to expand towards new therapeutic areas, new targets, broader cross-screening activities, repurposing, and polypharmacology. **License Information** The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes. **Included Datasets** - [Approved and Researched Drug Targets Human SwissProt Accessions](https://www.johnsnowlabs.com/marketplace/approved-and-researched-drug-targets-human-swissprot-accessions) - This dataset is a supplementary data from "Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds" (2011). In this case the Entrez Gene IDs were mapped to 1651 human Swiss-Prot accessions but this includes both approved and research targets. - [ChEMBL Approved Drug Targets Human Swiss-Prot Accessions](https://www.johnsnowlabs.com/marketplace/chembl-approved-drug-targets-human-swiss-prot-accessions) - ChEMBL dataset, released on 17 August 2013, includes a download option for approved drug targets. This converted to 251 human Swiss-Prot accessions but note this does not encompass additional protein IDs from target groups. - [Druggable Genome Comprehensive Drug Targets](https://www.johnsnowlabs.com/marketplace/druggable-genome-comprehensive-drug-targets) - This dataset Druggable Genome Comprehensive Drug Targets is a selection of supplementary data from "The Druggable Genome: Evaluation of Drug Targets in Clinical Trials Suggests Major Shifts in Molecular Class and Indication" (2013) [PMID:24016212]. The comprehensive list includes 461 targets of approved drugs. - [Protein Chemical Structure Comparison from Three Drug Databases](https://www.johnsnowlabs.com/marketplace/protein-chemical-structure-comparison-from-three-drug-databases) - This dataset Protein Chemical Structure Comparison from Three Drug Databases is a selection of a 3-way consensus list from the paper "Comparing the Chemical Structure and Protein Content of ChEMBL, DrugBank, Human Metabolome Database and the Therapeutic Target Database" (2013) [Abstract]. It includes 352 proteins-in-common between the three drug databases. - [Proven Drug Targets Converted to Human SwissProt Accessions](https://www.johnsnowlabs.com/marketplace/proven-drug-targets-converted-to-human-swissprot-accessions) - This dataset is a supplementary data from "Novelty in the target landscape of the pharmaceutical industry" (2013). The listing of proven drug targets is converted to 248 human Swiss-Prot accessions. - [The Therapeutic Drug Target Database Human SwissProt](https://www.johnsnowlabs.com/marketplace/the-therapeutic-drug-target-database-human-swissprot) - This dataset is a selection of The Therapeutic Target Database (release 4.3.02, 18th Oct 2013) protein IDs for successful targets. The web page states 388 but these reduced to 345 human Swiss-Prot accessions. **Data Engineering Overview** **We deliver high-quality data** - Each dataset goes through 3 levels of quality review - 2 Manual reviews are done by domain experts - Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints - Data is normalized into one unified type system - All dates, unites, codes, currencies look the same - All null values are normalized to the same value - All dataset and field names are SQL and Hive compliant - Data and Metadata - Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters - Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated - Data Updates - Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted **Our data is curated and enriched by domain experts** Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts: - Field names, descriptions, and normalized values are chosen by people who actually understand their meaning - Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset - Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations - The data is always kept up to date – even when the source requires manual effort to get updates - Support for data subscribers is provided directly by the domain experts who curated the data sets - Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution. **Need Help?** If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).
提供机构:
John Snow Labs
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集整合了来自多篇文献和数据库的已批准及研究中的药物靶点信息,包括人类Swiss-Prot访问号等关键数据,经过严格的质量审查和专家验证,适用于药物研发和靶点统计分析。数据以CSV和Parquet格式提供,支持商业和研究用途。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作