five

AmelHap

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/6983102
下载链接
链接失效反馈
官方服务:
资源简介:
AmelHap 1.1.0.f4 1,328 samples 17,414,346 variants Sequencing reads were aligned to the Amel_HAv3.1 reference genome using BWA-MEM v0.7.17. Reads were sorted with SAMtools v1.9 and duplicates marked (MarkDuplicates) with GATK v4.0.11.0. Variants for each sample were called using GATK’s HaplotypeCaller with the following non-default parameters --ERC GVCF, --sample-ploidy 1 and -A AlleleFraction. Joint variant calling was performed across all samples collated for AmelHap using GATK’s GenomicDBImport and GenotypeGVCFs with --sample-ploidy 1 and a window size of 10 Mb. The AmelHap dataset comprises samples and variants that have passed a series of filters. The first filter excluded variants with quality by depth (QD) less than 20 or greater than 40, or with mapping quality (MQ) less than 50, or with a strand odds ratio (SOR) greater than 3. The second filter set genotypes with depth (DP) greater than 704 or quality (GQ) less than 40 as missing. The third filter removed monomorphic variants. The fourth and final filter retained samples and variants with a minimum call rate of 90%. Unfiltered data for all samples processed is also available in the community repository in the form of individual gVCF files and joint-called raw variants grouped by ENA project accession. Sample metadata is available here: https://doi.org/10.5281/zenodo.7030888
创建时间:
2023-05-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作