tyang816/DeepSoluE_ESMFold
收藏Hugging Face2024-05-10 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/tyang816/DeepSoluE_ESMFold
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- text-classification
tags:
- protein
- downstream task
---
# DeepSoluE Dataset with ESMFold Structural Sequence
- Description: Solubility is a fundamental protein property that has important connotations for therapeutics and use in diagnosis.
- Number of labels: 2
- Problem Type: single_label_classification
- Columns:
- aa_seq: protein amino acid sequence
- foldseek_seq: foldseek 20 3di structural sequence
- ss8_seq: DSSP 8 secondary structure sequence
# Github
Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models
https://github.com/tyang816/SES-Adapter
# Citation
Please cite our work if you use our dataset.
```
@article{tan2024ses-adapter,
title={Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models},
author={Tan, Yang and Li, Mingchen and Zhou, Bingxin and Zhong, Bozitao and Zheng, Lirong and Tan, Pan and Zhou, Ziyi and Yu, Huiqun and Fan, Guisheng and Hong, Liang},
journal={arXiv preprint arXiv:2404.14850},
year={2024}
}
```
提供机构:
tyang816
原始信息汇总
DeepSoluE Dataset with ESMFold Structural Sequence
数据集概述
- 描述: 可溶性是蛋白质的基本属性,对治疗和诊断具有重要意义。
- 标签数量: 2
- 问题类型: 单标签分类
- 数据集列信息:
aa_seq: 蛋白质氨基酸序列foldseek_seq: foldseek 20 3di结构序列ss8_seq: DSSP 8二级结构序列
许可
- 许可证: Apache-2.0
任务类别
- 任务类别: 文本分类
标签
- 标签: 蛋白质, 下游任务



