five

MHDR: Annotated Corpus for Medicinal Herb-Disease Relationships in Biomedical Articles with a Focus on Traditional Medicine.

收藏
DataCite Commons2026-05-18 更新2025-09-08 收录
下载链接:
https://figshare.com/articles/dataset/MHDR_Annotated_Corpus_for_Medicinal_Herb-Disease_Relationships_in_Biomedical_Articles_with_a_Focus_on_Traditional_Medicine_/29555549/1
下载链接
链接失效反馈
官方服务:
资源简介:
Traditional medicine (TM) has long relied on medicinal herbs (MHs) to prevent and treat various ailments. Recently, efforts to integrate MHs with modern biomedical research have gained momentum, aiming to validate traditional knowledge and identify new therapeutic agents. However, the surge in biomedical literature presents challenges in extracting meaningful relationships, such as those between MHs and diseases. While automated methods like Natural Language Processing (NLP) and machine learning (ML) offer solutions, they often struggle with ambiguity, inconsistent terminology, and implicit meanings in texts. To address these limitations, the development of high-quality, expert-annotated corpora is essential. This study introduces the Medicinal Herb-Disease Relationships (MHDR) corpus, specifically designed to capture MH-disease relationships within the context of TM integrated with modern research. Constructed from 800 PubMed abstracts, the MHDR corpus emphasizes pharmacognostic herb names, which reflect specific plant parts and processing methods—key to accurate and fine-grained entity recognition. The corpus contains 5,119 MH mentions, 6,621 disease mentions, and 1,314 annotated relationships, extracted from 832 key sentences summarizing study findings. To evaluate its utility, baseline performance tests using Transformer-based models were conducted. With its comprehensive coverage and high-quality manual annotations, the MHDR corpus serves as a foundational resource for advancing biomedical NLP, supporting more accurate knowledge extraction, and promoting deeper exploration of MHs’ therapeutic potential through computational approaches.
提供机构:
figshare
创建时间:
2025-07-18
二维码
社区交流群
二维码
科研交流群
商业服务