five

Cardinality of selected sets of minimal absent words in four human genome assemblies.

收藏
Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_Cardinality_of_selected_sets_of_minimal_absent_words_in_four_human_genome_assemblies_/368977
下载链接
链接失效反馈
官方服务:
资源简介:
GRCh37 is the reference human genome assembly build 37.1, HuRef is the genome of Craig Venter, NA12878 is the human genome assembly from cell line GM12878, and YH is the genome of a Han Chinese individual. For each human genome assembly, set contains all minimal absent words (MAWs) of length 11 bp, set contains all MAWs of length 50 bp, set contains all MAWs of length 100 bp, set contains all MAWs of length 300 bp, and set contains all MAWs of length 1,000 bp. The noRC columns display results without considering the reversed complement and the withRC columns display results considering the reversed complement.

GRCh37即人类参考基因组组装版本37.1;HuRef为克雷格·文特尔(Craig Venter)的个人基因组;NA12878是源自细胞系GM12878的人类基因组组装体;YH则为一名汉族个体的基因组。针对每个人类基因组组装体,本数据集包含五组最小缺失词(minimal absent words,MAWs)集合,分别涵盖长度为11 bp、50 bp、100 bp、300 bp以及1000 bp的全部MAWs。其中noRC列展示未考虑反向互补链的分析结果,withRC列则展示考虑反向互补链的分析结果。
创建时间:
2015-12-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作