five

VCF, pileup, and other files for Betula analyses using ebg & GATK

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/825227
下载链接
链接失效反馈
官方服务:
资源简介:
These are the VCF, pileup, and input files files produced by the Genome Analysis ToolKit, SAMtools, and our own scripts, respectively, used for comparing the genotypes estimated by the GATK UnifiedGenotyper and the new model for allopolyploids that we introduced in our paper. Additional processing of the files was conducting using Python and R for filtering and extracting error values and read counts (scripts available on GitHub: https://github.com/pblischak/polyploid-genotyping). Main files: pendula-ug-filtered30.vcf: VCF file from GATK with all called variants for Betula pendula (diploid). filtered30-pendula.pileup: SAMtools pileup file for variant sites identified by GATK in B. pendula. pubescens-ug-filtered30.vcf: VCF file from GATK with all called variants for Betula pubescens (allotetraploid). filtered30-pubescens.pileup: SAMtools pileup file for variant sites identified by GATK in B. pubescens. Processed files: filtered30-variants.txt: tab delimited file of shared variants between B. pendula and B. pubescens. filtered30-vcf1.vcf: VCF file for B. pendula with variants extracted from filtered30-variants.txt. filtered30-vcf2.vcf: VCF file for B. pubescens with variants extracted from filtered30-variants.txt. Input files: filtered30-pubescens-tot.txt, filtered30-pubescents-alt.txt, filtered30-pubescens-err.txt: input files for running the alloSNP model in ebg. filtered30-pubescens-alloSNP-freqs2.txt, filtered30-pubescens-alloSNP-g1.txt, filtered30-pubescens-alloSNP-g2.txt: output files from the alloSNP model. filtered30-pendula-tot.txt, filtered30-pendula-alt.txt, filtered30-pendula-err.txt: input files for running the hwe model in ebg. filtered30-pendula-hwe-freqs.txt, filtered30-pendula-hwe-genos.txt: output files from the hwe model. The allele frequency estimates here were also used as the reference panel for the alloSNP model.
创建时间:
2020-01-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作