An annotated dataset for gene-melanoma relation extraction from scientific literature
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/745bpf597f
下载链接
链接失效反馈官方服务:
资源简介:
Melanoma is the least common but the deadliest of skin cancers. This cancer begins when the genes of a cell suffer damage or fail, and identifying the genes involved in melanoma is crucial for understanding the melanoma tumorigenesis. To date, machine learning for gene-melanoma relation extraction from text has been limited by the lack of annotated resources. To overcome this problem, we have exploited the information of the Melanoma Gene Database (a manually curated database of human melanoma related genes) to build an annotated dataset of binary relations between genes and melanoma entities mentioned in PubMed abstracts. The exploitability of the dataset was tested with both traditional machine learning, and neural network-based models. These models are then used to automatically extract gene-melanoma relations from the biomedical literature. Researchers can use the annotated dataset to develop and compare their own models. Moreover, the relations extracted from the literature can be integrated with existing structured knowledge to facilitate researchers in their data search.
创建时间:
2022-08-30



