Protein-FN
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/Protein-FN/Protein-FN
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个精心挑选的蛋白质功能数据集,提供了超过9000个带有意义标签的蛋白质数据,其中包括一维氨基酸序列、三维蛋白质结构以及功能特性。该数据集将蛋白质分为六类:蛋白酶、激酶、受体、碳酸酐酶、磷酸酶和异构酶。规模上,数据集包含了9014个蛋白质数据,分为7211个训练样本和1803个测试样本。其任务包括蛋白质功能预测、基序识别和发现。
This is a carefully curated protein function dataset that provides over 9000 protein entries with meaningful annotations, including 1-dimensional amino acid sequences, 3-dimensional protein structures and functional characteristics. The dataset categorizes proteins into six classes: proteases, kinases, receptors, carbonic anhydrases, phosphatases, and isomerases. In terms of scale, the dataset contains 9014 protein samples, which are split into 7211 training samples and 1803 test samples. The supported tasks include protein function prediction, motif recognition and discovery.
提供机构:
Hugging Face Datasets



