five

Rate-Perturbing Single Amino Acid Mutation for Hydrolases: A Statistical Profiling

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/Rate-Perturbing_Single_Amino_Acid_Mutation_for_Hydrolases_A_Statistical_Profiling/16624552
下载链接
链接失效反馈
官方服务:
资源简介:
Hydrolases are a critical component for modern chemical, pharmaceutical, and environmental sciences. Identifying mutations that enhance catalytic efficiency presents a roadblock to design and to discover new hydrolases for broad academic and industrial uses. Here, we report the statistical profiling for rate-perturbing mutant hydrolases with a single amino acid substitution. We constructed an integrated structure−kinetics database for hydrolases, IntEnzyDB, which contains 3907 kcats, 4175 KMs, and 2715 Protein Data Bank IDs. IntEnzyDB adopts a relational architecture with a flattened data structure, enabling facile and efficient access to clean and tabulated data for machine learning uses. We conducted statistical analyses on how single amino acids mutations influence the turnover number (i.e., kcat) and efficiency (i.e., kcat/KM), with a particular emphasis on profiling the features for rate-enhancing mutations. The results show that mutation to bulky nonpolar residues with a hydrocarbon chain involves a higher likelihood for rate acceleration than to other types of residues. Linear regression models reveal geometric descriptors of substrate and mutation residues that mediate rate-perturbing outcomes for hydrolases with bulky nonpolar mutations. On the basis of the analyses of the structure−kinetics relationship, we observe that the propensity for rate enhancement is independent of protein sizes. In addition, we observe that distal mutations (i.e., >10 Å from the active site) in hydrolases are significantly more prone to induce efficiency neutrality and avoid efficiency deletion but involve similar propensity for rate enhancement. The studies reveal the statistical features for identifying rate-enhancing mutations in hydrolases, which will potentially guide hydrolase discovery in biocatalysis.

水解酶(Hydrolases)是现代化学、制药与环境科学领域的关键组成部分。筛选可提升催化效率的突变体,是开发并获取可广泛应用于学术与工业领域的新型水解酶的一大阻碍。本研究针对携带单氨基酸替换的速率扰动型水解酶突变体开展了统计特征分析。我们构建了一款整合了水解酶结构-动力学数据的数据库IntEnzyDB,该库包含3907个催化常数(kcat)、4175个米氏常数(KM)以及2715个蛋白质数据库(Protein Data Bank, PDB)编号。IntEnzyDB采用关系型架构与扁平化数据结构,可便捷高效地获取经过整理的结构化数据,适配机器学习应用需求。我们针对单氨基酸突变如何影响周转数(即kcat)与催化效率(即kcat/KM)开展了统计分析,重点梳理了可提升催化速率的突变体的特征。研究结果表明,相较于其他类型的氨基酸残基,将残基突变为带有烃链的大体积非极性残基时,催化速率提升的概率更高。线性回归模型揭示了底物与突变残基的几何描述符,这类描述符可调控携带大体积非极性突变的水解酶的速率扰动效应。基于结构-动力学关联的分析结果,我们发现催化速率提升的倾向性与蛋白质分子量无关。此外,我们发现水解酶中的远端突变(即距活性位点大于10埃(Å)的突变)更易导致催化效率保持中性,且不会引发效率下降,但这类突变同样具备相似的速率提升倾向性。本研究明确了可用于筛选水解酶速率提升突变体的统计特征,有望为生物催化领域的水解酶开发提供指导。
创建时间:
2021-09-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作