five

Synthetic dataset for end-to-end Relation Extraction of relationships between Organisms and Natural-Products with Mixtral-8x7B-Instruct-v0.1

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10849377
下载链接
链接失效反馈
官方服务:
资源简介:
A new synthetic dataset (training/validation) for end-to-end Relation Extraction of relationships between Organisms and Natural-Products. The new dataset was generated using Mixtral-8x7B-Instruct-v0.1. Like the model, the produced synthetic data are also submitted to the License of the model used for generation (apache-2.0). The new dataset was created based on the top-1000 (per biological kingdom) LOTUS literature references extracted with the GME-sampler. The dataset contains 8,913 items in the training set and 344 items in the validation set. The dataset was generated using the same protocol as described in the article.
创建时间:
2024-04-24
二维码
社区交流群
二维码
科研交流群
商业服务