10 Synthetic Genomics Datasets
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10683210
下载链接
链接失效反馈官方服务:
资源简介:
These are 10 synthetic genomics datasets generated with NEAT v3 (based on TP53 gene of Homo Sapiens) for the use case of benchmarking somatic variant callers. To find more about our generating framework please visit synth4bench GitHub repository.
The datasets explore intrinsic NGS data parameters for the use case of observing their effect on tumor-only somatic variant calling algorithms. From the 10 datasets, there are 5 of them with different coverage (while keeping all other parameters fixed) and 5 with varying read length. The reads in all datasets are paired-end .
Name of File
Coverage
Lenght of Reads
300_30_10
300x
150
700_70_10
700x
150
1000_100_10
1000x
150
3000_300_10
3000x
150
5000_500_10
5000x
150
1000_50
1000x
50
1000_100
1000x
100
1000_170
1000x
170
1000_200
1000x
200
1000_300
1000x
300
创建时间:
2024-04-10



