five

Linking diseases and phenotypes for clinical usage

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4726713
下载链接
链接失效反馈
官方服务:
资源简介:
Background: In recent years a large volume of clinical genomics data has become available due to rapid advances in sequencing technologies. Efficient exploitation of genomics data requires linkage to patient phenotype profiles. Current resources,   providing disease-phenotype associations, are not comprehensive, and   they often do not cover all of the disease terminologies, particularly ICD-10,  which is still the primary terminology used in clinical settings.    Methods:   We used a text-mining method that utilizes semantic relations in  phenotype ontologies, and applied statistical methods to extract associations between diseases in ICD-10 and phenotype ontology classes from the literature. In addition, we present a semi-automatic way to curate ICD-10:phenotype associations from existing resources containing known associations.   Results:   We created two datasets linking diseases to their phenotypes based on each strategy. We extensively validated these datasets based on comparison to  gold standard manually created disease--phenotype associations, based on their similarity to disease--phenotype associations found in databases, and based on how well they can be used to recover gene--disease   associations using phenotype similarity. We find that our text mining method can produce phenotypes that are useful but often too   specific or too general. However, the dataset generated from integrating multiple knowledgebases is more granular
创建时间:
2021-04-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作