OGTFinder: A Curated Growth Temperature Data Set and Its Application To Predict Optimal Growth Temperatures of Bacteria and Archaea
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/OGTFinder_A_Curated_Growth_Temperature_Data_Set_and_Its_Application_To_Predict_Optimal_Growth_Temperatures_of_Bacteria_and_Archaea/31995869
下载链接
链接失效反馈官方服务:
资源简介:
The optimal growth temperature (OGT) of organisms is
valuable in
bioprospecting enzymes that work under extreme conditions. Existing
OGT prediction models achieve high accuracy but mainly capture trends
of overrepresented groups in the training set including organisms
that thrive at moderate temperatures and those from well-described
taxa. In this study, we incorporated weighted scoring and phylogenetic
splits to improve the generalizability of the prediction models. We
first built a new growth temperature data set comprising more than
15,000 species distributed over all three domains of life, with special
attention to include OGT and extreme temperature data. We then trained
machine learning models on the prokaryotic OGT data using proteome-averaged
amino acid descriptors. The best-performing model was the multilayer
perceptron (MLP) with a test RMSE of 5.49 °C and an R2 of 0.84. The most important proteome features were related
to backbone flexibility and charged residues, as well as surface accessibility.
The MLP model is integrated in the command line tool OGTFinder and
available under MIT license at: https://github.com/SC-Git1/OGTFinder.
创建时间:
2026-04-13



