Manually curated evaluation dataset and HBDB snapshot without full text
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14958796
下载链接
链接失效反馈官方服务:
资源简介:
The HBDB snapshot can be imported directly. However, please note that the sentences table has been removed due to the terms of Elsevier's Text and Data Mining (TDM) service.
The 'eval_dataset' is a manually curated dataset that includes various terms associated with four chemicals. The folder structure is organized into four layers as follows:
Term A: The target volatile organic compound (VOC), such as acetone.
Category of Term B: This could be a classification like chemical or molecular function.
Reference ID in HBDB: Please refer to the HBDB snapshot for the URL (DOI or URL) and PubMed ID (PMID). For example, Reference ID 15878 corresponds to PubMed ID 21871718 or this link.
JSON File Attributes:
term_A: The target VOC.
term_B: Related term.
context: Truncated sentences limited to 200 characters due to the terms of Elsevier's TDM service. Please refer to the original paper for the complete text.
category: Category of term B, matching the second layer.
score: Relationship score.
verified: Indicates manual curation, done twice.
table: The corresponding table in the HBDB database snapshot.
compound_id: Compound ID in HBDB (e.g., 28 for acetone).
reference_id: Reference ID, corresponding to the third layer.
paragraph: Section containing the extracted sentences.
创建时间:
2025-03-03



