five

Benchmark study of feature selection strategies for multi-omics data

收藏
DataCite Commons2022-06-15 更新2024-07-29 收录
下载链接:
https://figshare.com/articles/dataset/Benchmark_study_of_feature_selection_strategies_for_multi-omics_data/20060201
下载链接
链接失效反馈
官方服务:
资源简介:
These data sets are the pre-processed versions of the multi-omics data sets used in the benchmark study presented in the paper "Benchmark study of feature selection strategies for multi-omics data" by Yingxia Li, Ulrich Mansmann, Shangming Du, and Roman Hornung. The outcome feature is "TP53_mutation" in each data set, where "1" / "0" indicates the presence / absence of a TP53 mutation in the respective patients. The remaining features are clinical and omics features, where the suffix "_clinical" indicates clinical features, the suffix "_cnv" copy number variation features, the suffix "_mirna" miRNA features, the suffix "_mutation" mutation features, and the suffix " _rna" RNA features. Note that while predicting the outcome feature TP53 yes vs. no is not meaningful contextually, TP53 mutations have been found to be associated with poor clinical outcomes in cancer patients [1]. Against this background, TP53 can be used as a surrogate for a phenotypic outcome. Thus, these data sets are meant for testing machine learning or statistical procedures, they may not be useful for biological analysis.
提供机构:
figshare
创建时间:
2022-06-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作