five

Mapping between Human Phenotype Ontology and phecode terminologies

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.7272%252FQ6H70D20
下载链接
链接失效反馈
官方服务:
资源简介:
This biomedical data repository contains a mapping between the Human Phenotype Ontology (HPO) and phecodes which are curated groupings of International Classification of Diseases (ICD) codes. These data correspond to the mappings published and evaluated in the JAMIA Open manuscript, "Linking rare and common disease vocabularies by mapping between the Human Phenotype Ontology and phecodes." This mapping was created using a variety of data sources and methods, including text matching, the National Library of Medicine's Unified Medical Language System, Wikipedia, SORTA, and PheMap. The mapping includes 38,950 links, and the files allow users to tailor the HPO-phecode links via a variety of filters for diverse applications across the spectrum of monogenic to polygenic diseases. Other intermediate files and the most up-to-date mappings can be found at the "phecode-HPO-map" GitHub repository (https://github.com/emcarthur/phecode-HPO-map/). Methods The map between phecodes and HPO terms is constructed using multiple types of evidence including string or sub-string match, UMLS, SORTA, WikiMedMap and PheMap. The HPO terms used are from the 2022-04-14 release and phecodes are from version 1.2 available in the PheWAS catalog (https://phewascatalog.org/). Mappings are also replicated with Phecode X, which has increased granularity and coverage of terms related to pregnancy, congenital anomalies, and neonatology. Further technical details of the integration process can be found in the manuscript methods. The code used the process, plot, and create the data are in the "phecode-HPO-map" github repository (https://github.com/emcarthur/phecode-HPO-map/).
创建时间:
2023-01-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作