five

A Biologically Informed Machine Learning Pipeline Uncovers Metabolic Features of Intestinal Barrier Dysfunction

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/A_Biologically_Informed_Machine_Learning_Pipeline_Uncovers_Metabolic_Features_of_Intestinal_Barrier_Dysfunction/31812623
下载链接
链接失效反馈
官方服务:
资源简介:
Intestinal barrier dysfunction is a pivotal factor in diverse pathologies, yet clear diagnostic criteria remain elusive, and biomarker discovery is often hindered by the high costs of large-scale clinical cohorts. Here, we developed a biologically informed machine learning (ML) pipeline that integrates function-guided network biology with ensemble feature selection (EFS) to identify key metabolic features associated with this condition. As an integrative framework, the pipeline combines knowledge-based screening and computational prioritization with biological validation, reconciling statistical rigor with biological interpretability. We first constructed a heterogeneous data set from multiphenotypic murine models and derived a continuous intestinal barrier index as a predicted variable. This approach reframes the conventional classification paradigm into a supervised regression task, enabling the discovery of disease-relevant clues without clear diagnostic end points. By leveraging network biology-informed EFS, we identified a functional feature subset of 10 core metabolites. The subset demonstrated remarkable stability across five distinct regression architectures, maintaining consistent robust performance (R2: 0.604–0.654; mean absolute error: 0.319–0.352). Furthermore, independent clinical validation coupled with in vitro biotransformation assays confirmed the applicability of these prioritized metabolic features, highlighting serotonin, N-acetylputrescine, d-phenylalanine, and hippuric acid as functional metabolic features of barrier dysfunction. In this work, ML was repurposed as a tool for feature prioritization, robustness evaluation, and hypothesis generation, reducing reliance on large cohorts. This integrated pipeline facilitates the identification of intestinal barrier dysfunction associated metabolic features and enables high-efficiency evaluation of multicomponent therapeutic responses, offering new insights into the management of complex multifactorial diseases.
创建时间:
2026-03-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作