five

On selecting robust approaches for learning predictive biomarkers in metabolomics datasets

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/On_selecting_robust_approaches_for_learning_predictive_biomarkers_in_metabolomics_datasets/29959079
下载链接
链接失效反馈
官方服务:
资源简介:
Code repo (data acquisition) : https://github.com/thibgo/metabolightsbinarydatasets Code repo (experiments) : https://github.com/thibgo/metabolightsbinarydatasets Article : https://www.semanticscholar.org/paper/On-Selecting-Robust-Approaches-for-Learning-in-Data-Godon-Plante/cdac02dd43aa79d5ef5240367ca02dec9ba635e4 Abstract : Metabolomics, the study of small molecules within biological systems, offers insights into metabolic processes and, consequently, holds great promise for advancing health outcomes. Biomarker discovery in metabolomics represents a significant challenge, notably due to the high dimensionality of the data. Recent work has addressed this problem by analyzing the most important variables in machine learning models. Unfortunately, this approach relies on prior hypotheses about the structure of the data and may overlook simple patterns. To assess the true usefulness of machine learning methods, we evaluate them on a collection of 835 metabolomics data sets. This effort provides valuable insights for metabolomics researchers regarding where and when to use machine learning. It also establishes a benchmark for the evaluation of future methods. Nonetheless, the results emphasize the high diversity of data sets in metabolomics and the complexity of finding biologically relevant biomarkers. As a result, we propose a novel approach applicable across all data sets, offering guidance for future analyses. This method involves directly comparing univariate and multivariate models. We demonstrate through selected examples how this approach can guide data analysis across diverse data set structures, representative of the observed variability. Code and data are available for research purposes.
创建时间:
2025-08-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作