Cardinality of selected sets of minimal absent words in four human genome assemblies.
收藏Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_Cardinality_of_selected_sets_of_minimal_absent_words_in_four_human_genome_assemblies_/368977
下载链接
链接失效反馈官方服务:
资源简介:
GRCh37 is the reference human genome assembly build 37.1, HuRef is the genome of Craig Venter, NA12878 is the human genome assembly from cell line GM12878, and YH is the genome of a Han Chinese individual. For each human genome assembly, set contains all minimal absent words (MAWs) of length 11 bp, set contains all MAWs of length 50 bp, set contains all MAWs of length 100 bp, set contains all MAWs of length 300 bp, and set contains all MAWs of length 1,000 bp. The noRC columns display results without considering the reversed complement and the withRC columns display results considering the reversed complement.
GRCh37即人类参考基因组组装版本37.1;HuRef为克雷格·文特尔(Craig Venter)的个人基因组;NA12878是源自细胞系GM12878的人类基因组组装体;YH则为一名汉族个体的基因组。针对每个人类基因组组装体,本数据集包含五组最小缺失词(minimal absent words,MAWs)集合,分别涵盖长度为11 bp、50 bp、100 bp、300 bp以及1000 bp的全部MAWs。其中noRC列展示未考虑反向互补链的分析结果,withRC列则展示考虑反向互补链的分析结果。
创建时间:
2015-12-02



