five

eRah: A Computational Tool Integrating Spectral Deconvolution and Alignment with Quantification and Identification of Metabolites in GC/MS-Based Metabolomics

收藏
NIAID Data Ecosystem2026-03-09 收录
下载链接:
https://figshare.com/articles/dataset/eRah_A_Computational_Tool_Integrating_Spectral_Deconvolution_and_Alignment_with_Quantification_and_Identification_of_Metabolites_in_GC_MS-Based_Metabolomics/3827751
下载链接
链接失效反馈
官方服务:
资源简介:
Gas chromatography coupled to mass spectrometry (GC/MS) has been a long-standing approach used for identifying small molecules due to the highly reproducible ionization process of electron impact ionization (EI). However, the use of GC-EI MS in untargeted metabolomics produces large and complex data sets characterized by coeluting compounds and extensive fragmentation of molecular ions caused by the hard electron ionization. In order to identify and extract quantitative information on metabolites across multiple biological samples, integrated computational workflows for data processing are needed. Here we introduce eRah, a free computational tool written in the open language R composed of five core functions: (i) noise filtering and baseline removal of GC/MS chromatograms, (ii) an innovative compound deconvolution process using multivariate analysis techniques based on compound match by local covariance (CMLC) and orthogonal signal deconvolution (OSD), (iii) alignment of mass spectra across samples, (iv) missing compound recovery, and (v) identification of metabolites by spectral library matching using publicly available mass spectra. eRah outputs a table with compound names, matching scores and the integrated area of compounds for each sample. The automated capabilities of eRah are demonstrated by the analysis of GC-time-of-flight (TOF) MS data from plasma samples of adolescents with hyperinsulinaemic androgen excess and healthy controls. The quantitative results of eRah are compared to centWave, the peak-picking algorithm implemented in the widely used XCMS package, MetAlign, and ChromaTOF software. Significantly dysregulated metabolites are further validated using pure standards and targeted analysis by GC-triple quadrupole (QqQ) MS, LC-QqQ, and NMR. eRah is freely available at http://CRAN.R-project.org/package=erah.

气相色谱-质谱联用法(Gas Chromatography-Mass Spectrometry, GC/MS)因电子轰击电离(Electron Impact Ionization, EI)的电离过程重复性极佳,长期以来一直是小分子鉴定的经典方法。然而,在非靶向代谢组学研究中使用GC-EI MS会生成体量庞大且结构复杂的数据集,这类数据集以共流出化合物以及硬电子电离导致的分子离子大量碎裂为核心特征。为实现在多份生物样本中鉴定代谢物并提取其定量信息的目标,亟需集成化的计算数据分析工作流。本文介绍eRah——一款基于开源语言R编写的免费计算工具,其包含五大核心功能模块:(1)GC/MS色谱图的噪声过滤与基线校正;(2)基于局部协方差化合物匹配(Compound Match by Local Covariance, CMLC)与正交信号解卷积(Orthogonal Signal Deconvolution, OSD)的多元分析技术,实现创新性的化合物解卷积流程;(3)跨样本质谱谱图对齐;(4)缺失化合物补全;(5)利用公开质谱谱图进行谱库匹配,完成代谢物鉴定。eRah可输出包含各样本中化合物名称、匹配得分以及化合物积分峰面积的标准化结果表格。本研究通过分析高胰岛素血症性雄激素过多症青少年与健康对照者的血浆样本的气相色谱-飞行时间质谱(Gas Chromatography-Time of Flight Mass Spectrometry, GC-TOF MS)数据,验证了eRah的自动化分析性能。研究将eRah的定量分析结果与三款常用工具进行了对比:广泛使用的XCMS软件包中集成的峰拾取算法centWave、MetAlign软件以及ChromaTOF软件。研究进一步通过纯标准品以及气相色谱-三重四极杆质谱(Gas Chromatography-Triple Quadrupole Mass Spectrometry, GC-QqQ MS)、液相色谱-三重四极杆质谱(Liquid Chromatography-Triple Quadrupole Mass Spectrometry, LC-QqQ)与核磁共振波谱法(Nuclear Magnetic Resonance, NMR)的靶向分析,对显著失调的代谢物进行了实验验证。eRah可于http://CRAN.R-project.org/package=erah免费获取。
创建时间:
2016-09-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作