7bgzf case study dataset for x64 machine
收藏Figshare2019-05-05 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/7bgzf_case_study_dataset_for_x64_machine/8063117
下载链接
链接失效反馈官方服务:
资源简介:
7bgzf case study datasets for the x64 machineThe included files are datasets from the UCSC Genome Browser, the 1000 Genomes Project, and Ensembl, which have been prepared as test corpuses for deflation. .bcf files have been converted from vcf.gz files using the `bcftools view -Ob` with bcftools 1.9. Direct links to the original datasets are below.http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCaltechRnaSeq/wgEncodeCaltechRnaSeqGm12878R1x75dAlignsRep1V2.bamhttp://hgdownload.cse.ucsc.edu/goldenPath/mm9/encodeDCC/wgEncodeUwRnaSeq/wgEncodeUwRnaSeqThymusCellPolyaMAdult8wksC57bl6AlnRep1.bamhttp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20110521/ALL.chr22.phase1_release_v3.20101123.snps_indels_svs.genotypes.vcf.gzhttp://ftp.ensembl.org/pub/release-93/variation/vcf/mus_musculus/mus_musculus.vcf.gzReferences:[1] Haesussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC Genome Browser database: 2019 Update. Nucleic Acids Research. 2019.[2]1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015.[3]Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018. Nucleic Acids Research. 2018.
适用于x64架构机器的7组bgzf(Blocked GZIP)格式案例研究数据集。本数据集包含源自UCSC基因组浏览器(UCSC Genome Browser)、千人基因组计划(1000 Genomes Project)以及Ensembl数据库(Ensembl)的数据集,上述数据集已被整理为解压(deflation)测试语料库。其中的.bcf文件均通过bcftools 1.9版本执行`bcftools view -Ob`命令,从vcf.gz文件转换得到。原始数据集的直接链接如下:
http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCaltechRnaSeq/wgEncodeCaltechRnaSeqGm12878R1x75dAlignsRep1V2.bam
http://hgdownload.cse.ucsc.edu/goldenPath/mm9/encodeDCC/wgEncodeUwRnaSeq/wgEncodeUwRnaSeqThymusCellPolyaMAdult8wksC57bl6AlnRep1.bam
http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20110521/ALL.chr22.phase1_release_v3.20101123.snps_indels_svs.genotypes.vcf.gz
http://ftp.ensembl.org/pub/release-93/variation/vcf/mus_musculus/mus_musculus.vcf.gz
参考文献:
[1] Haesussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ 等. UCSC基因组浏览器数据库:2019年更新. 《核酸研究》(Nucleic Acids Research). 2019.
[2] 千人基因组项目联盟. 人类遗传变异的全球参考图谱. 《自然》(Nature). 2015.
[3] Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J 等. Ensembl 2018. 《核酸研究》(Nucleic Acids Research). 2018.
创建时间:
2019-05-05



