five

Supplementary tables:MetaFetcheR: An R package for complete mapping of small compound data

收藏
Mendeley Data2024-06-27 更新2024-06-27 收录
下载链接:
https://snd.se/catalogue/dataset/2024-341
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset includes a PDF file containing the results and an Excel file with the following tables: Table S1 Results of comparing the performance of MetaFetcheR to MetaboAnalystR using Diamanti et al. Table S2 Results of comparing the performance of MetaFetcheR to MetaboAnalystR for Priolo et al. Table S3 Results of comparing the performance of MetaFetcheR to MetaboAnalyst 5.0 webtool using Diamanti et al. Table S4 Results of comparing the performance of MetaFetcheR to MetaboAnalyst 5.0 webtool for Priolo et al. Table S5 Data quality test results for running 100 iterations on HMDB database. Table S6 Data quality test results for running 100 iterations on KEGG database. Table S7 Data quality test results for running 100 iterations on ChEBI database. Table S8 Data quality test results for running 100 iterations on PubChem database. Table S9 Data quality test results for running 100 iterations on LIPID MAPS database. Table S10 The list of metabolites that were not mapped by MetaboAnalystR for Diamanti et al. Table S11 An example of an input matrix for MetaFetcheR. Table S12 Results of comparing the performance of MetaFetcheR to MS_targeted using Diamanti et al. Table S13 Data set from Diamanti et al. Table S14 Data set from Priolo et al. Table S15 Results of comparing the performance of MetaFetcheR to CTS using KEGG identifiers available in Diamanti et al. Table S16 Results of comparing the performance of MetaFetcheR to CTS using LIPID MAPS identifiers available in Diamanti et al. Table S17 Results of comparing the performance of MetaFetcheR to CTS using KEGG identifiers available in Priolo et al. Table S18 Results of comparing the performance of MetaFetcheR to CTS using KEGG identifiers available in Priolo et al. (See the "index" tab in the Excel file for more information) Small-compound databases contain a large amount of information for metabolites and metabolic pathways. However, the plethora of such databases and the redundancy of their information lead to major issues with analysis and standardization. Lack of preventive establishment of means of data access at the infant stages of a project might lead to mislabelled compounds, reduced statistical power and large delays in delivery of results. We developed MetaFetcheR, an open-source R package that links metabolite data from several small-compound databases, resolves inconsistencies and covers a variety of use-cases of data fetching. We showed that the performance of MetaFetcheR was superior to existing approaches and databases by benchmarking the performance of the algorithm in three independent case studies based on two published datasets. The dataset was originally published in DiVA and moved to SND in 2024.

本数据集包含一份载有研究结果的PDF文件,以及一份包含如下表格的Excel文件: 表S1 基于Diamanti等人研究的MetaFetcheR与MetaboAnalystR性能对比结果 表S2 针对Priolo等人研究的MetaFetcheR与MetaboAnalystR性能对比结果 表S3 基于Diamanti等人研究的MetaFetcheR与MetaboAnalyst 5.0网页工具性能对比结果 表S4 针对Priolo等人研究的MetaFetcheR与MetaboAnalyst 5.0网页工具性能对比结果 表S5 在人类代谢组数据库(HMDB)上运行100次迭代的数据质量测试结果 表S6 在京都基因与基因组百科全书数据库(KEGG)上运行100次迭代的数据质量测试结果 表S7 在ChEBI数据库(ChEBI)上运行100次迭代的数据质量测试结果 表S8 在PubChem数据库(PubChem)上运行100次迭代的数据质量测试结果 表S9 在LIPID MAPS数据库(LIPID MAPS)上运行100次迭代的数据质量测试结果 表S10 Diamanti等人研究中未被MetaboAnalystR映射的代谢物列表 表S11 MetaFetcheR输入矩阵示例 表S12 基于Diamanti等人研究的MetaFetcheR与MS_targeted性能对比结果 表S13 Diamanti等人研究的数据集 表S14 Priolo等人研究的数据集 表S15 基于Diamanti等人研究中可用的KEGG标识符的MetaFetcheR与CTS性能对比结果 表S16 基于Diamanti等人研究中可用的LIPID MAPS标识符的MetaFetcheR与CTS性能对比结果 表S17 基于Priolo等人研究中可用的KEGG标识符的MetaFetcheR与CTS性能对比结果 表S18 基于Priolo等人研究中可用的KEGG标识符的MetaFetcheR与CTS性能对比结果(详细信息请参阅Excel文件中的"索引"工作表) 小分子数据库拥有海量代谢物与代谢通路信息,但这类数据库数量繁多且信息冗余,给分析与标准化工作带来诸多难题。若在项目初期未预先构建数据访问途径,可能会导致化合物标注错误、统计效力降低以及成果交付大幅延迟。我们开发了开源R包MetaFetcheR,该工具可整合多个小分子数据库的代谢物数据,解决数据不一致问题,并覆盖多种数据获取应用场景。通过基于两篇已发表数据集开展三项独立案例研究对算法性能进行基准测试,我们证实MetaFetcheR的性能优于现有方法与数据库。本数据集最初发布于DiVA平台,并于2024年迁移至SND平台。
创建时间:
2024-06-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作