five

Comparative enrichment analysis of biomarker-based patient clusters: A quantitative analysis.

收藏
NIAID Data Ecosystem2026-03-07 收录
下载链接:
https://figshare.com/articles/dataset/_Comparative_enrichment_analysis_of_biomarker_based_patient_clusters_A_quantitative_analysis_/342476
下载链接
链接失效反馈
官方服务:
资源简介:
The number of validated terms found to be enriched in clusters generated with the different pre-processing procedures and clustering algorithms tested in this study. Validated enrichments and validated clusters are defined by the recurrence of statistically significant enrichment in the training and validation datasets. The clusters that were generated from the test dataset using a particular pre-processing clustering combination were subjected to enrichment analysis with 19 health/lifestyle labels (i.e. searching for statistically significant over-representation of patients with the trait in each cluster). An artificial neural network, trained with the cluster assignment of each individual in the training dataset, was used to classify individuals from the validation dataset using the same clinical biomarkers subjected to the same pre-processing algorithm as was the test dataset. The resulting clustering of the validation set was also subjected to enrichment analysis with the same terms as was the training set. An enrichment was deemed to be a validated enrichment if the same label was enriched in the test and validation datasets. A validated cluster was defined as a cluster sharing at least one enriched term between the test and validation sets (i.e. the number of clusters enriched in the training set). The enrichment factor for each pipeline is the average enrichment factor of the three most significant enrichments. K-mns = K-means; NoramTranf = transformation to normal; Z-score = Z-score normalization; AgeAdj = age adjustment followed by Z-score normalization.
创建时间:
2012-03-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作