five

Table_3_Automating methods for estimating metabolite volatility.XLSX

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/Table_3_Automating_methods_for_estimating_metabolite_volatility_XLSX/24804189
下载链接
链接失效反馈
官方服务:
资源简介:
The volatility of metabolites can influence their biological roles and inform optimal methods for their detection. Yet, volatility information is not readily available for the large number of described metabolites, limiting the exploration of volatility as a fundamental trait of metabolites. Here, we adapted methods to estimate vapor pressure from the functional group composition of individual molecules (SIMPOL.1) to predict the gas-phase partitioning of compounds in different environments. We implemented these methods in a new open pipeline called volcalc that uses chemoinformatic tools to automate these volatility estimates for all metabolites in an extensive and continuously updated pathway database: the Kyoto Encyclopedia of Genes and Genomes (KEGG) that connects metabolites, organisms, and reactions. We first benchmark the automated pipeline against a manually curated data set and show that the same category of volatility (e.g., nonvolatile, low, moderate, high) is predicted for 93% of compounds. We then demonstrate how volcalc might be used to generate and test hypotheses about the role of volatility in biological systems and organisms. Specifically, we estimate that 3.4 and 26.6% of compounds in KEGG have high volatility depending on the environment (soil vs. clean atmosphere, respectively) and that a core set of volatiles is shared among all domains of life (30%) with the largest proportion of kingdom-specific volatiles identified in bacteria. With volcalc, we lay a foundation for uncovering the role of the volatilome using an approach that is easily integrated with other bioinformatic pipelines and can be continually refined to consider additional dimensions to volatility. The volcalc package is an accessible tool to help design and test hypotheses on volatile metabolites and their unique roles in biological systems.

代谢物的挥发性可影响其生物学功能,并为其检测方法的优化提供依据。然而,现有大量已报道的代谢物尚未有公开的挥发性数据,这限制了将挥发性作为代谢物核心特征开展的相关研究。本研究基于单分子官能团组成(SIMPOL.1)改编了蒸气压估算方法,用于预测不同环境下化合物的气相分配行为。我们将该方法整合至一款名为volcalc的全新开源流程中,该流程借助化学信息学工具,可对涵盖代谢物、生物体与反应的大规模持续更新通路数据库——京都基因与基因组百科全书(Kyoto Encyclopedia of Genes and Genomes, KEGG)中所有代谢物的挥发性进行自动化估算。我们首先以人工整理数据集对该自动化流程开展基准测试,结果显示93%的化合物可被准确归类至相同的挥发性等级(如非挥发性、低挥发性、中挥发性、高挥发性)。随后,我们展示了volcalc如何用于生成并验证关于挥发性在生物系统与生物体中作用的相关假说。具体而言,我们估算得出:依据环境(分别为土壤与洁净大气)的不同,KEGG数据库中有3.4%与26.6%的化合物属于高挥发性类别;同时,所有生物域共享30%的核心挥发性代谢物,其中细菌界所特有的挥发性代谢物占比最高。借助volcalc,我们为揭示挥发组的功能奠定了基础:该方法可便捷地与其他生物信息学流程整合,且可通过纳入更多挥发性相关维度实现持续优化。volcalc工具包是一款易用的辅助工具,可用于设计并验证关于挥发性代谢物及其在生物系统中独特功能的相关假说。
创建时间:
2023-12-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作