five

High quality variant calls from multiple dog genome project - Run1

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://www.omicsdi.org/dataset/eva/PRJEB24066
下载链接
链接失效反馈
官方服务:
资源简介:
High quality variant discovery using multiple dog breeds - Run1: Paired-end fastq files were aligned to Canis lupus familaris reference genome version 3.1 using bwa mem. Duplicates were marked using Picardtools Markdup. Local realignment around indels was performed using GATK tools RealignerTargetCreator and IndelRealigner. Variant calling was done using GATK Unified Genotyper. Filters were applied based GATK best practice. These filters included remove variants with 2 or more alleles, remove variants never observed on forward or reverse strands, remove variants with overall quality less than phred score 20, remove variants with mapping quality less than phred score 30, remove variants with the same base pair position, remove lower quality indel when indels closer than 10 base pairs, remove lower quality variant where variants closer than 3 base pairs, remove SNP within 5bp of an indel. An intersect set containing those variants concordant between Samtools and GATK predictions was extracted from this union set using GATK SelectVariants to produce the final VCF files. (Software: bwa-0.7.12,GATK-3.6,samtools-1.3+htslib-1.2.1)
创建时间:
2018-01-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作