five

7bgzf case study dataset for x64 machine

收藏
Figshare2019-05-05 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/7bgzf_case_study_dataset_for_x64_machine/8063117
下载链接
链接失效反馈
官方服务:
资源简介:
7bgzf case study datasets for the x64 machineThe included files are datasets from the UCSC Genome Browser, the 1000 Genomes Project, and Ensembl, which have been prepared as test corpuses for deflation. .bcf files have been converted from vcf.gz files using the `bcftools view -Ob` with bcftools 1.9. Direct links to the original datasets are below.http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCaltechRnaSeq/wgEncodeCaltechRnaSeqGm12878R1x75dAlignsRep1V2.bamhttp://hgdownload.cse.ucsc.edu/goldenPath/mm9/encodeDCC/wgEncodeUwRnaSeq/wgEncodeUwRnaSeqThymusCellPolyaMAdult8wksC57bl6AlnRep1.bamhttp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20110521/ALL.chr22.phase1_release_v3.20101123.snps_indels_svs.genotypes.vcf.gzhttp://ftp.ensembl.org/pub/release-93/variation/vcf/mus_musculus/mus_musculus.vcf.gzReferences:[1] Haesussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC Genome Browser database: 2019 Update. Nucleic Acids Research. 2019.[2]1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015.[3]Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018. Nucleic Acids Research. 2018.

适用于x64架构机器的7组bgzf(Blocked GZIP)格式案例研究数据集。本数据集包含源自UCSC基因组浏览器(UCSC Genome Browser)、千人基因组计划(1000 Genomes Project)以及Ensembl数据库(Ensembl)的数据集,上述数据集已被整理为解压(deflation)测试语料库。其中的.bcf文件均通过bcftools 1.9版本执行`bcftools view -Ob`命令,从vcf.gz文件转换得到。原始数据集的直接链接如下: http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCaltechRnaSeq/wgEncodeCaltechRnaSeqGm12878R1x75dAlignsRep1V2.bam http://hgdownload.cse.ucsc.edu/goldenPath/mm9/encodeDCC/wgEncodeUwRnaSeq/wgEncodeUwRnaSeqThymusCellPolyaMAdult8wksC57bl6AlnRep1.bam http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20110521/ALL.chr22.phase1_release_v3.20101123.snps_indels_svs.genotypes.vcf.gz http://ftp.ensembl.org/pub/release-93/variation/vcf/mus_musculus/mus_musculus.vcf.gz 参考文献: [1] Haesussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ 等. UCSC基因组浏览器数据库:2019年更新. 《核酸研究》(Nucleic Acids Research). 2019. [2] 千人基因组项目联盟. 人类遗传变异的全球参考图谱. 《自然》(Nature). 2015. [3] Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J 等. Ensembl 2018. 《核酸研究》(Nucleic Acids Research). 2018.
创建时间:
2019-05-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作