Number of cell lines in training, dev and test dataset for perturbed gene expression profile prediction task and gene expression autoencoder training task.
收藏Figshare2022-08-11 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Number_of_cell_lines_in_training_dev_and_test_dataset_for_perturbed_gene_expression_profile_prediction_task_and_gene_expression_autoencoder_training_task_/20476972
下载链接
链接失效反馈官方服务:
资源简介:
We evaluated these two tasks with leave new cells out cross-validation. In each split, we leave a group of cell lines out and then split the rest cell lines to training and dev dataset. The cells in training dataset, dev dataset and test dataset are distinct to each other. There are totally 15 cell lines for the perturbed gene expression profile prediction task, which have both gene expression profile found in CCLE and high-quality data in LINCS L1000 project. 11550 cell lines are used for the gene expression autoencoder training and are from both CCLE and TCGA datasets. (XLSX)
创建时间:
2022-08-11



