five

Supplementary Data for "Fast and Accurate Variant Identification Tool for Sequencing-Based Studies"

收藏
Figshare2024-03-19 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Supplementary_Data_for_b_Fast_and_Accurate_Variant_Identification_Tool_for_Sequencing-Based_Studies_b_/25437217
下载链接
链接失效反馈
官方服务:
资源简介:
Benchmark datasets used in this study to evaluate the performance of QuickVariants and bcftools.Instruction of using these benchmark datasets can be found here.Gut_microbiome_benchmark.tar.gz contains gut microbiome WGS reads, original assembled genomes, and mutated genomes with in silico point mutations and indels. This dataset includes SRX5976902 Akkermansia muciniphila (A. muciniphila), SRX5977424 Bacteroides faecis (BaFa), SRX5976649 Bacteroides fragilis (B. fragilis), SRX5976729 Bacteroides ovatus (B. ovatus), SRX6045315 Bacteroides vulgatus (BaVu), SRX6044844 Bacteroides xylanisolvens (BaXy), SRX6044813 Bifidobacterium adolescentis (BiAd), SRX5991169 Escherichia coli (EsCo), and SRX5992782 Parabacteroides distasonis (PaDi).COVID_benchmark.tar.gz contains SARS-Cov-2 (NC_045512.2 and SRR10971381) WGS reads, original assembled genomes, and mutated genomes in silico point mutations and indels.WGS_simulation_sequencingerror.tar.gz contains illumine Hiseq 2500 and NextSeq 500 v2 WGS data simulated from a reference genome of B. fragilis NCTC 9343 (NCBI accession GCF_000025985.1) with sequencing error rates varying from 0.1 to 10 times the original error rate (-qs -10, -qs 1, and -qs 10).MG_simultation_sequencingerror.tar.gz contains an illumine Hiseq 2500 metagenomic dataset simulated from nine human gut microbiome reference genomes with the original sequencing error rate and 20X read depth.MGBIG_simultation_sequencingerror.tar.gz contains an illumine Hiseq 2500 metagenomic dataset simulated from nine human gut microbiome reference genomes with the original sequencing error rate and 100X read depth.COVID_MGSW.tar.gz contains three longitudinal metagenomes from Wisconsin, USA sewage, collected in January 2022 (NCBI Accession SRR21019653, SRR21019687) and March 2023 (NCBI Accession SRR23934917), and SARS-Cov-2 reference genome (NC_045512.2).ExampleDataFig4-6.zip contains sam and vcf output files that support the examples shown in Fig 4-6.
创建时间:
2024-03-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作