five

Recall of test set regulatory sequence prediction for different cell lines.

收藏
Figshare2020-12-01 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Recall_of_test_set_regulatory_sequence_prediction_for_different_cell_lines_/13313915
下载链接
链接失效反馈
官方服务:
资源简介:
Gkm-SVM models were trained on DHS datasets (positive) and corresponding negative sets of k-mer shuffled sequences (k = 2, k = 7) or genomic background sequences (tGC = 0.02) for A549 or MCF-7 cells; cell lines with two training datasets (A/B) each. Model performance was evaluated based on recall for hold-out sets (chromosome 8). There are seven different hold-out sets derived from different cell lines and we assess model generalization across cell-types. Best performance is observed for models trained on highly shuffled sequences (k = 2), model performance is reduced when trained on genomic background, while the performance of models trained on lightly shuffled sequences (k = 7) is considerably worse. Datasets are named according to S1 Table. Results for the CNN models (2conv2norm and 4conv2pool4norm) are available in S5 and S6 Tables, respectively.
创建时间:
2020-12-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作