A machine learning framework to predict cancer metabolomics from gene-expression data
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP591042
下载链接
链接失效反馈官方服务:
资源简介:
Metabolomics provides a direct functional readout of a tumor's physiology. Yet, it is lagging behind other omics technologies in facilitating disease monitoring and prognostication. This stems partly from the scarcity of large-scale metabolomic studies, but also the analytical complexities of detecting diverse metabolites with varying physicochemical properties and concentrations. To address this, we developed a machine learning framework using both tumor tissue and cell line samples across multiple cancer types that allows prediction of metabolomics from gene expression data. Two different model types were selected and trained for tissues and cell lines with their generalization capacity validated on independent cohorts, accurately predicting as high as 70-80% of tested metabolites. This work offers a scalable and efficient machine learning pipeline to determine metabolic from transcriptomic signatures, opening avenues to reconstruct and study the metabolic landscape of samples across novel and existing datasets lacking direct metabolomics measurements. Overall design: RNA-sequencing profile of MCF7 (PIK3CA wild-type (WT) and E545K mutant (MUT)) and MCF10A (PIK3CA wild-type (WT), E545K and H1047R mutant (MUT)) isogenic cell lines.
创建时间:
2025-09-30



