hugging-science/goa_reasoning
收藏Hugging Face2025-10-15 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/hugging-science/goa_reasoning
下载链接
链接失效反馈官方服务:
资源简介:
该数据集将基因本体(Gene Ontology, GO)注释(经过实验证据筛选,IDA)与欧洲文献库(Europe PMC)的参考文献数据相结合。每条记录都将一个生物实体(基因/蛋白质)与一个GO术语以及支持的科学出版物(标题和摘要)相链接。数据集包含多个列,如源数据库和基因/蛋白质标识符、关系限定符、GO术语及其描述性名称、支持参考文献、实验证据类型、本体分支、实体元数据、生物分类ID、注释元数据和出版物信息。该数据集适用于生物推理、文本挖掘和基于文献的机器学习数据集。
This dataset integrates Gene Ontology (GO) annotations (filtered to experimental evidence, IDA) with bibliographic data from Europe PMC. Each record links a biological entity (gene/protein) to a GO term and the supporting scientific publication (Title and Abstract). The dataset includes various columns such as source database and gene/protein identifiers, relationship qualifier, GO term and its descriptive name, supporting reference, evidence code, ontology branch, entity metadata, organism taxonomy ID, annotation metadata, and publication information. The dataset is suitable for biological reasoning, text mining, and literature-grounded machine learning datasets.
提供机构:
hugging-science



