AI4Protein/cloning_clf
收藏Hugging Face2025-11-19 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/AI4Protein/cloning_clf
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- text-classification
tags:
- chemistry
- biology
- medical
---
Dataset Summary
Protein structure determination includes a series of experimental stages to yield stable proteins for X-ray crystallography. Specifically, the proteins are first selected and expressed, then purified for crystal structure determination. Each step corresponds to a "stage tag" to denote whether the protein is stable under a certain stage.
Data Fields
seq: a string containing the protein sequence
label: a float value indicating the $k_cat$ score of the protein sequence.
Original Dataset Name: biomap-research/cloning_clf
Original Author / Organization: Biomap
Original URL: https://huggingface.co/datasets/biomap-research/cloning_clf
Original License: Apache License 2.0
No changes were made to the data except for the column name.
All credit and rights belong to the original authors.
---
许可证:apache-2.0
任务类别:
- 文本分类
标签:
- 化学
- 生物学
- 医学
---
数据集摘要
蛋白质结构测定包含一系列实验阶段,以获得用于X射线晶体学的稳定蛋白质。具体而言,首先选择并表达蛋白质,随后对其进行纯化以用于晶体结构测定。每个步骤对应一个"阶段标签(stage tag)",用于表示蛋白质在特定阶段是否稳定。
数据字段
seq:包含蛋白质序列的字符串
label:表示蛋白质序列$k_cat$评分的浮点数值。
原始数据集名称:biomap-research/cloning_clf
原始作者/机构:Biomap
原始URL:https://huggingface.co/datasets/biomap-research/cloning_clf
原始许可证:Apache License 2.0
除列名外,未对数据进行任何修改。
所有功劳和权利归原始作者所有。
提供机构:
AI4Protein



