CleverThis/uniprotkb_obsolete_entries_260000000-v1
收藏Hugging Face2025-12-29 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/CleverThis/uniprotkb_obsolete_entries_260000000-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自uniprotkb_obsolete_entries_260000000的RDF三元组,已转换为HuggingFace数据集格式,便于在机器学习管道中使用。它是一个全面的蛋白质知识库,带有功能注释。数据集格式最初为RDF,转换为HuggingFace数据集后大小为0.392 GB(解压后),包含约9000万个蛋白质条目和约34亿个三元组。原始许可证为CC BY 4.0。推荐用于蛋白质研究、分子生物学和功能基因组学。数据集采用标准无损格式表示RDF数据,保留了所有语义信息,支持完美往返转换。
This dataset contains RDF triples from uniprotkb_obsolete_entries_260000000 converted to HuggingFace dataset format for easy use in machine learning pipelines. It is a comprehensive protein knowledgebase with functional annotations. The dataset was originally in RDF format, converted to HuggingFace Dataset with a size of 0.392 GB (extracted), containing ~90M protein entries and ~3.4B triples. The original license is CC BY 4.0. Recommended for protein research, molecular biology, and functional genomics. The dataset uses a standard lossless format for representing RDF data, preserving all semantic information and enabling perfect round-trip conversion.
提供机构:
CleverThis



