CleverThis/uniprotkb_reviewed_archea_promethearchaeati_1935183_0-v1
收藏Hugging Face2026-01-02 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/CleverThis/uniprotkb_reviewed_archea_promethearchaeati_1935183_0-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个全面的蛋白质知识库,包含功能注释。它包含了从uniprotkb_reviewed_archea_promethearchaeati_1935183_0转换而来的RDF三元组,格式为HuggingFace数据集,便于在机器学习流程中使用。数据集大小为0.392 GB(解压后),包含约9000万个蛋白质条目和约34亿个三元组。原始许可为CC BY 4.0。推荐用于蛋白质研究、分子生物学和功能基因组学。数据集采用标准无损格式表示RDF数据,保留了原始RDF知识图谱的所有语义信息,支持完美往返转换。
This dataset is a comprehensive protein knowledgebase with functional annotations. It contains RDF triples from uniprotkb_reviewed_archea_promethearchaeati_1935183_0 converted to HuggingFace dataset format for easy use in machine learning pipelines. The dataset size is 0.392 GB (extracted), with ~90M protein entries and ~3.4B triples. The original license is CC BY 4.0. It is recommended for protein research, molecular biology, and functional genomics. The dataset uses a standard lossless format for representing RDF data, preserving all semantic information from the original RDF knowledge graph, enabling perfect round-trip conversion between RDF and HuggingFace formats.
提供机构:
CleverThis



