five

Number of cell lines in training, dev and test dataset for perturbed gene expression profile prediction task and gene expression autoencoder training task.

收藏
Figshare2022-08-11 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Number_of_cell_lines_in_training_dev_and_test_dataset_for_perturbed_gene_expression_profile_prediction_task_and_gene_expression_autoencoder_training_task_/20476972
下载链接
链接失效反馈
官方服务:
资源简介:
We evaluated these two tasks with leave new cells out cross-validation. In each split, we leave a group of cell lines out and then split the rest cell lines to training and dev dataset. The cells in training dataset, dev dataset and test dataset are distinct to each other. There are totally 15 cell lines for the perturbed gene expression profile prediction task, which have both gene expression profile found in CCLE and high-quality data in LINCS L1000 project. 11550 cell lines are used for the gene expression autoencoder training and are from both CCLE and TCGA datasets. (XLSX)
创建时间:
2022-08-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作