five

Data supporting 'Convergent somatic mutations in metabolism genes in chronic liver disease'

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/283gy325fk
下载链接
链接失效反馈
官方服务:
资源简介:
This is an RData file containing a data-frame of all somatic mutations (base substitutions and small indels) from the samples described in the paper: Stanley WK Ng et al, 'Convergent somatic mutations in metabolism genes in chronic liver disease'. Nature 2021; 598(7881): 473-478. ( http://dx.doi.org/10.1038/s41586-021-03974-6 ). The columns of the data-frame follow the usual VCF conventions for genome position numbering and describing the ref and alt entries. The 'id' column encodes the sample ID, followed by underscore, followed by the specific branch on the phylogenetic tree the mutation was assigned to (in order to avoid double-counting of mutations). the column 'vaf' contains the variant allele fraction of the mutation. The genome build for the position co-ordinates is GRCh37.

本数据集为RData格式文件,包含Stanley WK Ng等发表于《Nature》2021年第598卷第7881期473-478页的论文《慢性肝病代谢基因的趋同性体细胞突变》(DOI链接:http://dx.doi.org/10.1038/s41586-021-03974-6)中所述样本的全部体细胞突变(碱基替换与小型插入缺失)数据框。 该数据框的列遵循常规的变异识别格式(Variant Call Format,VCF)规范,用于基因组位置编号以及参考碱基(ref)与变异碱基(alt)的描述。其中'id'列编码样本ID,其后以下划线分隔该突变被分配至的系统发育树特定分支,以避免突变重复计数。'vaf'列包含该突变的变异等位基因频率(variant allele fraction,VAF)。坐标对应的基因组组装版本为GRCh37。
创建时间:
2023-01-24
二维码
社区交流群
二维码
科研交流群
商业服务