five

PhenoMiner database

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/records/12493
下载链接
链接失效反馈
官方服务:
资源简介:
Phenotypes play a key role in inferring the complex relationships between genes and human heritable diseases. PhenoMiner is a research project aimed at the capture and encoding of phenotypes in the scientific literature. This should provide insights into the complex processes involved in human diseases as well as enabling semantic interoperability with existing biomedical ontologies such as those that describe human anatomy, genetics and behaviours. The PhenoMiner database contains the results of an FP7 Marie Curie fellowship project on text/data-mining technology - natural language processing, machine learning and conceptual analysis. It builds on insights gained from semantic parsing to extract structured information about phenotypes from whole sentences - in contrast to existing techniques which often apply string matching. The system exploits the wealth of scientific data locked within the scientific literature in databases such as PubMed Central and Europe PMC to extract the semantic vocabulary of phenotypes that scientists use. The system will provide scientists, clinicians and informaticians with the data and tools they need to gain new insights into Mendelian diseases. The database currently contains over 4800 phenotype terms automatically mined from full scientific articles and then associated to Online Mendelian Inheritance of Man (OMIM) disorders. All data is provided without manual filtering. Please contact the author for further information and comments/suggestions. - Nigel Collier (collier@ebi.ac.uk)
创建时间:
2020-01-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作