five

Hybracter v0.7.0 Benchmarking Output

收藏
Mendeley Data2024-05-10 更新2024-06-27 收录
下载链接:
https://zenodo.org/records/10906937
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains: The subsampled FASTQ files used to benchmarking Hybracter (https://github.com/gbouras13/hybracter). Benchmarking Output for Hybracter v0.7.0 vs Unicycler v0.5.0 vs Dragonflye v1.1.2 on these files. The full benchmarking code and explanation is available https://github.com/gbouras13/hybracter_benchmarking. The `hybracter_benchmarking_fastqs.tar.gz` tarball will contain subsampled FASTQs (gzipped) of the first 20 samples used to benchmarking `hybracter`. These are the JKD6159, Lerminiaux, Chitale and super-accuracy model basecelled simplex ATCC fastqs. The `PRJNA1087001_ATCC_SUP_Duplex_FAST_Simplex_fastqs.tar.gz` tarball will contain subsampled FASTQs (gzipped) of the 10 added samples in v2 of the prepint used to benchmarking `hybracter`. These are the fast model basecelled simplex ATCC fastqs and super-accuracy model basecelled duplex ATCC fastqs. The other 4 tarballs ( `hybracter_benchmarking_results_v0.7.0.tar.gz`, `hybracter_benchmarking_results_fast.tar.gz`, `hybracter_benchmarking_results_duplex.tar.gz` and `hybracter_depth_Lerminiaux_isolateB_benchmarking_results.tar.gz`) contain benchmarking outputs for the first 20 samples, 5 fast model basecelled simplex ATCC samples, 5 super-accuracy model basecelled duplex ATCC and the depth analysis for Lerminiaux isolate B. The when untared, each tarball will contain: `BENCHMARKS` - contains the time etc benchmarking for each run (sample x tool) `DNADIFF` - contains raw chromosome Dnadiff results for each run (sample x tool) `DNADIFF_PARSED_OUTPUT` - contains parsed chromosome Dnadiff results for each sample `DNADIFF_PLASMIDS` - contains plasmid Dnadiff results for each run (sample x tool) `DNADIFF_PARSED_OUTPUT_PLASMID` - contains parsed plasmid Dnadiff results for each sample `REAL` - this contains all the actual output for each assembler. The following 5 directories will contain the all the raw output with subdirectories for each sample: `HYBRACTER_HYBRID_OUTPUT` `HYBRACTER_LONG_OUTPUT` `DRAGONFLYE_HYBRID_OUTPUT` `DRAGONFLYE_LONG_OUTPUT` `UNICYCLER_OUTPUT` Additionally, `hybracter_benchmarking_results_v0.7.0.tar.gz` will have `HYBRACTER_HYBRID_OUTPUT_REAL_BULK` - this contains the output for the 12 Lerminiaux et al isolates assembled using `hybracter hybrid` with modified config file `bulk_assemble_lerminiaux_config.yaml`. It will also contain a number of other subdirectories `_SUMMARIES`, `_PLASMIDS`, `_CHROMOSOMES` with parsed summary outputs and parsed specific plasmids and chromosome assemblies for Unicycler and Dragonflye (this made the assessment a lot easier and automated). To untar e.g. `tar -xzf hybracter_benchmarking_results_v0.7.0.tar.gz`
创建时间:
2024-04-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作