five

FAIRsharing record for: Mutational data for protein solubility

收藏
DataCite Commons2025-12-01 更新2026-05-03 收录
下载链接:
https://fairsharing.org/10.25504/FAIRsharing.0ad681
下载链接
链接失效反馈
官方服务:
资源简介:
This FAIRsharing record describes: SoluProtMutDB is a comprehensive, manually curated database of the protein solubility data. Low protein solubility presents major challenges to industrial applications and is often reported to be behind many human diseases. Understanding how mutations affect protein solubility can therefore help elucidate the mechanisms associated with the development of human diseases and better utilize protein engineering in industrial applications. Multiple factors may play a role here: the presence of a chaperone or co-factor required for correct protein folding, unnatural physiological conditions such as high temperature, pH, or protein concentration, tendency of a protein to aggregate due to aggregation-prone regions, etc. The predictive power of the existing protein engineering tools is often compromised by limited experimental data available for rigorous training and testing of solubility predictions. The published data used for solubility prediction upon mutation are usually scattered in the literature and had to be collected manually. The goal of the SoluProtMut database is to collect the reported evidence of solubility changes upon mutations from published sources to guide future protein engineering effort in producing soluble protein variants. The database currently contains data previously used for training Machine Learning-based predictors, such as PON-Sol, CamSol, AGGRESCAN3D, OptSolMut, as well as recently published datasets. We are providing manually curated and reliable data in the standardized format which are pre-processed for machine learning applications.
提供机构:
FAIRsharing
创建时间:
2025-12-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作