five

ecDNA machine learning modeling

收藏
Mendeley Data2024-05-10 更新2024-06-30 收录
下载链接:
https://zenodo.org/records/11082530
下载链接
链接失效反馈
官方服务:
资源简介:
The source data of supplementary figure 13 in the accompanying article table has been found to have issues, which were identified as a result of improper Excel operation. Here, we have uploaded the correct data table 1. ecDNA_cargo_gene_modeling_data.csv.gz The dataset contains features from 386 TCGA tumors for modeling ecDNA cargo gene prediction. It was converted from R data format with the following code. NOTE: columns 'sample' and 'gene_id' are not used for actual modeling but for identifying, and sampling purposes. library(data.table) data = readRDS("~/../Downloads/ecDNA_cargo_gene_modeling_data.rds") colnames(data)[3] = "total_cn" data.table::fwrite(data, file = "~/../Downloads/ecDNA_cargo_gene_modeling_data.csv.gz", sep = ",") 2. gcap_pcawg_WGS_result.tar.gz GCAP analysis results for PCAWG allele-specific copy number profiles derived from WGS. 3. gcap_tcga_snp6_result.tar.gz GCAP analysis results for TCGA allele-specific copy number profiles derived from SNP6 array. 4. gcap_Changkang_WES_result.tar.gz GCAP analysis results for SYSUCC Changkang allele-specific copy number profiles derived from tumor-normal paired WES. 5. tcga_overlap_gene_wgs.rds, tcga_overlap_gene_snp.rds and tcga_overlap_gene_wes.rds These datasets contain TCGA gene-level copy number results in R data format from overlapping samples (dataset above). WGS from PCAWG, SNP array, and WES from GDC portal. 6. cellline-batch1.zip & cellline-batch1.zip GCAP results of cell line batch 1 and batch 2. 7. AA_cellline_wgs.zip AA software results for cell line batch 1. 8. Batch2_AA_summary.xlsx AA software results for cell line batch 2. 9. FISH-for-supp-file.zip Extended raw FISH images from 12 CRC samples. 10. SNU216.zip Extended AA and GCAP analysis on SNU216. 11. aa_ffpe.zip and AA_summary_table_of_6_erbb2_ffpe_samples.xlsx Extended AA running files (all results) and result summary data for 6 GCAP predicted ERBB2 amp clinical samples. 12. source data of fig.4 13. source data of supp fig.2 subplots 13. source data of supp fig.15 14. GCAP result data objects for three ICB cohorts. Both gene-level and sample-level data included. 15. PDX-P68: processed (AA and CNV) data of P68 from WGS and WES data. 16. source data of supp fig.13
创建时间:
2024-05-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作