CleverThis/uniprotkb_reviewed_archea_environmental_samples_48510_0-v1
收藏Hugging Face2025-12-29 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/CleverThis/uniprotkb_reviewed_archea_environmental_samples_48510_0-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含功能注释的综合蛋白质知识库,来源于UniProt数据库的uniprotkb_reviewed_archea_environmental_samples_48510_0部分。数据集原本为RDF格式,已转换为HuggingFace数据集格式以便于机器学习流程中使用。数据集大小为0.392 GB(解压后),包含约90M蛋白质条目和约3.4B三元组。推荐用于蛋白质研究、分子生物学和功能基因组学。数据集质量高,包含Swiss-Prot条目的手动管理,每8周更新一次。
This dataset is a comprehensive protein knowledgebase with functional annotations, sourced from the uniprotkb_reviewed_archea_environmental_samples_48510_0 portion of the UniProt database. The dataset was originally in RDF format and has been converted to HuggingFace dataset format for easy use in machine learning pipelines. The dataset size is 0.392 GB (extracted), containing approximately 90M protein entries and ~3.4B triples. It is recommended for protein research, molecular biology, and functional genomics. The dataset is of high quality with manual curation for Swiss-Prot entries and is updated every 8 weeks.
提供机构:
CleverThis



