five

Processed features in support of Liebeskind et al (2018)

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/1406722
下载链接
链接失效反馈
官方服务:
资源简介:
Processed feature matrices used in Liebeskind et al. (2018). Supporting code: https://github.com/marcottelab/plum Datasets 1 - 4 correspond to those used in Figure 4: Dataset 1: No AP-MS, yeast CF-MS, training species: Human Dataset 2: AP-MS, yeast CF-MS, training species: Human Dataset 3: AP-MS, yeast CF-MS, training species: Human, Yeast Dataset 4: AP-MS, no yeast CF-MS, training species: Human, Yeast ".train_labeled.missing_annotated.csv" files are those used for training the model and include only orthogroups for which interactions are known in the training species. These known interactions come either from gold-standard test sets, such as CORUM or EMBL's training portal, or from the fact that at least of the orthogroup pairs is missing in the focal taxon. ".missing_annotated.csv" files were used for prediction, and include the entire feature matrices, plus known missing pairs. Note that there is no dataset 3 file. This is because data sets 2 and 3 differ only in the training species used, so dataset 3 predictions used dataset2_07302018.missing_annotated.csv as a feature matrix. dataset4_prediction_07302018.csv contains the predictions for all pairs on data set 4, the best performing data set that was used for all downstream analyses.
创建时间:
2020-01-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作