7bgzf case study dataset for ARM machine
收藏DataCite Commons2020-08-27 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/7bgzf_case_study_dataset_for_ARM_machine/8063108
下载链接
链接失效反馈官方服务:
资源简介:
7bgzf case study datasets for the ARM machine<br>The included files are datasets from the UCSC Genome Browser and Ensembl, which have been prepared as test corpuses for deflation. .bcf files have been converted from vcf.gz files using the `bcftools view -Ob` with bcftools 1.9. Direct links to the original datasets are below.<br>http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeUwRepliSeq/wgEncodeUwRepliSeqBg02esG1bAlnRep1.bam<br>http://ftp.ensembl.org/pub/release-93/variation/vcf/homo_sapiens/homo_sapiens_structural_variations.vcf.gz<br>http://ftp.ensembl.org/pub/release-93/variation/vcf/mus_musculus/mus_musculus_structural_variations.vcf.gz<br>References:[1] Haesussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC Genome Browser database: 2019 Update. Nucleic Acids Research. 2019.[2] Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018. Nucleic Acids Research. 2018.<br>
适用于ARM架构机器的7组BGZF(Blocked GZIP)案例研究数据集
本数据集包含源自UCSC基因组浏览器(UCSC Genome Browser)与Ensembl数据库(Ensembl)的数据集,已被制备为解压测试专用的测试语料库。其中的BCF(Binary Variant Call Format)文件均通过BCFtools 1.9版本的`bcftools view -Ob`命令,从vcf.gz文件转换得到。
原始数据集的直接下载链接如下:
http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeUwRepliSeq/wgEncodeUwRepliSeqBg02esG1bAlnRep1.bam
http://ftp.ensembl.org/pub/release-93/variation/vcf/homo_sapiens/homo_sapiens_structural_variations.vcf.gz
http://ftp.ensembl.org/pub/release-93/variation/vcf/mus_musculus/mus_musculus_structural_variations.vcf.gz
参考文献:
[1] Haesussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC Genome Browser database: 2019 Update. Nucleic Acids Research. 2019.
[2] Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018. Nucleic Acids Research. 2018.
提供机构:
figshare
创建时间:
2019-05-01



