CleverThis/uniprotkb_reviewed_archea_nanobdellati_1783276_0-v1
收藏Hugging Face2025-12-30 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/CleverThis/uniprotkb_reviewed_archea_nanobdellati_1783276_0-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个全面的蛋白质知识库,包含功能注释。数据集中的RDF三元组已转换为HuggingFace数据集格式,便于在机器学习流程中使用。数据集包含约9000万个蛋白质条目和约34亿个三元组,原始格式为RDF,转换后大小为0.392 GB(解压后)。数据集适用于蛋白质研究、分子生物学和功能基因组学。高质量的手工注释适用于Swiss-Prot条目,每8周更新一次。
Comprehensive protein knowledgebase with functional annotations. This dataset contains RDF triples from uniprotkb_reviewed_archea_nanobdellati_1783276_0 converted to HuggingFace dataset format for easy use in machine learning pipelines. It includes ~90M protein entries and ~3.4B triples, originally in RDF format, converted size is 0.392 GB (extracted). Recommended for protein research, molecular biology, functional genomics. High quality with manual curation for Swiss-Prot entries. Updated every 8 weeks.
提供机构:
CleverThis



