Synthyra/BIOGRID-MV-5.0.253
收藏Hugging Face2026-01-19 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/Synthyra/BIOGRID-MV-5.0.253
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从BioGRID MV 5.0.253版本生成的,用于生物医学领域中的蛋白质、遗传和化学相互作用研究。数据集包含相互作用对的两个伙伴(A和B)、它们的序列(SeqA和SeqB)以及所属的生物体(OrgA和OrgB)。处理过程包括下载和解压BioGRID数据、流式处理BioGRID的tab3文件、标准化和解析UniProtKB主ID和序列、生成行数据并上传到Hugging Face Hub。数据集是多验证(MV)版本的BioGRID,选择标准在README中有简要描述。
This dataset is generated from BioGRID MV 5.0.253 and is used for studying protein, genetic, and chemical interactions in the biomedical field. The dataset includes features such as A, B, SeqA, SeqB, OrgA, and OrgB, representing the two interaction partners, their sequences, and the organisms they belong to. The processing steps involve downloading and unzipping BioGRID data, streaming BioGRID tab3 files, normalizing and resolving UniProtKB primary IDs and sequences, generating rows, and uploading them to the Hugging Face Hub. The dataset is the Multi-Validated (MV) version of BioGRID, with selection criteria briefly described in the README.
提供机构:
Synthyra



