peptides_soluble
收藏魔搭社区2025-10-03 更新2025-05-31 收录
下载链接:
https://modelscope.cn/datasets/jablonkagroup/peptides_soluble
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Details
### Dataset Description
Solubility was estimated by retrospective analysis of electronic laboratory notebooks.
The notebooks were part of a large effort called the Protein Structure Initiative and consider sequences
linearly through the following stages: Selected, Cloned, Expressed, Soluble, Purified, Crystallized,
HSQC (heteronuclear single quantum coherence), Structure, and deposited in PDB. The peptides were identified
as soluble or insoluble by "Comparing the experimental status at two time points, September 2009 and May 2010,
we were able to derive a set of insoluble proteins defined as those which were not
soluble in September 2009 and still remained in that state 8 months later."
- **Curated by:**
- **License:** CC BY 4.0
### Dataset Sources
- [corresponding publication](https://doi.org/10.1021/acs.jcim.2c01317)
- [data source](https://doi.org/10.1111/j.1742-4658.2012.08603.x)
## Citation
**BibTeX:**
```bibtex
@article{berman2009protein,
title={The protein structure initiative structural genomics knowledgebase},
author={Berman, Helen M and Westbrook, John D and Gabanyi, Margaret J and Tao,
Wendy and Shah, Raship and Kouranov, Andrei and Schwede, Torsten and Arnold,
Konstantin and Kiefer, Florian and Bordoli, Lorenza and others},
journal={Nucleic acids research},
volume={37},
number={suppl1},
pages={D365--D368},
year={2009},
publisher={Oxford University Press}
@article{smialowski2012proso,
title={PROSO II--a new method for protein solubility prediction},
author={Smialowski, Pawel and Doose, Gero and Torkler, Phillipp and Kaufmann,
Stefanie and Frishman, Dmitrij},
journal={The FEBS journal},
volume={279},
number={12},
pages={2192--2200},
year={2012},
publisher={Wiley Online Library}
```
## 数据集详情
### 数据集描述
本数据集通过对电子实验记录本的回顾性分析来估算蛋白质溶解度。这些实验记录隶属于一项名为"蛋白质结构计划(Protein Structure Initiative)"的大型科研项目,其按线性流程对蛋白序列依次经过以下阶段:筛选、克隆、表达、可溶性检测、纯化、结晶、异核单量子相干谱(heteronuclear single quantum coherence,HSQC)检测、结构解析,并最终提交至蛋白质数据库(Protein Data Bank,PDB)。多肽的可溶性与不溶性判定依据为:"通过对比2009年9月与2010年5月两个时间点的实验状态,我们得到了一组不溶性蛋白数据集,其定义为2009年9月时不溶且在8个月后仍维持该状态的蛋白。"
- **数据整理方:**
- **授权协议:** CC BY 4.0
### 数据集来源
- [相关研究论文](https://doi.org/10.1021/acs.jcim.2c01317)
- [数据源](https://doi.org/10.1111/j.1742-4658.2012.08603.x)
## 引用信息
**BibTeX格式引用:**
bibtex
@article{berman2009protein,
title={The protein structure initiative structural genomics knowledgebase},
author={Berman, Helen M and Westbrook, John D and Gabanyi, Margaret J and Tao,
Wendy and Shah, Raship and Kouranov, Andrei and Schwede, Torsten and Arnold,
Konstantin and Kiefer, Florian and Bordoli, Lorenza and others},
journal={Nucleic acids research},
volume={37},
number={suppl1},
pages={D365--D368},
year={2009},
publisher={Oxford University Press}
@article{smialowski2012proso,
title={PROSO II--a new method for protein solubility prediction},
author={Smialowski, Pawel and Doose, Gero and Torkler, Phillipp and Kaufmann,
Stefanie and Frishman, Dmitrij},
journal={The FEBS journal},
volume={279},
number={12},
pages={2192--2200},
year={2012},
publisher={Wiley Online Library}
提供机构:
maas
创建时间:
2025-05-27



