five

Number of cell lines in training, dev and test dataset for perturbed gene expression profile prediction task and gene expression autoencoder training task.

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://figshare.com/articles/dataset/Number_of_cell_lines_in_training_dev_and_test_dataset_for_perturbed_gene_expression_profile_prediction_task_and_gene_expression_autoencoder_training_task_/20476972
下载链接
链接失效反馈
官方服务:
资源简介:
We evaluated these two tasks with leave new cells out cross-validation. In each split, we leave a group of cell lines out and then split the rest cell lines to training and dev dataset. The cells in training dataset, dev dataset and test dataset are distinct to each other. There are totally 15 cell lines for the perturbed gene expression profile prediction task, which have both gene expression profile found in CCLE and high-quality data in LINCS L1000 project. 11550 cell lines are used for the gene expression autoencoder training and are from both CCLE and TCGA datasets. (XLSX)
创建时间:
2022-08-11
二维码
社区交流群
二维码
科研交流群
商业服务