five

Table dterm_go

收藏
DataCite Commons2020-08-19 更新2025-04-15 收录
下载链接:
http://f1000research.com/articles/4-47/v1#DS1
下载链接
链接失效反馈
官方服务:
资源简介:
Table dterm_go stores relationship information for pairs of MeSH D terms and gene ontology annotations. Each row corresponds to one pair: a MeSH term of category D defining a chemical (column dterm) and a gene ontology annotation as a 10 character identifier of GO (column goterm). Attributes of this relationship consist of number of genes annotated by the GO annotation goterm in gene2go dataset of NCBI (column gogenes), number of genes associated to articles in gene2pubmed dataset in NCBI (column genetot) annotated by the dterm, number of genes having both annotations dterm and goterm (column genenum), list of comma separated Entrez identifiers of genes that make genenum (genes sharing both dterm and goterm annotations). Column id is a unique row identifier. Column dtid is a key linking to the table mesh_terms. This table has 14,225,540 rows and 9 columns that are separated by tabs. Size of a plain table is 1.31GB. Compressed table takes 379MB. Information in this table is as of September 2013.

dterm_go表存储医学主题词表D类术语(Medical Subject Headings, MeSH)与基因本体注释(Gene Ontology, GO)的配对关联信息。每一行对应一组配对:其一为定义化学物质的D类MeSH术语(对应列dterm),其二为以10位字符标识符表示的GO注释(对应列goterm)。该关联关系的属性包括:美国国家生物技术信息中心(National Center for Biotechnology Information, NCBI)gene2go数据集中被该goterm注释的基因数量(对应列gogenes);NCBI gene2pubmed数据集中被该dterm注释且关联至文献的基因数量(对应列genetot);同时携带dterm与goterm两种注释的基因数量(对应列genenum);以及构成genenum的基因的逗号分隔Entrez标识符列表(即同时拥有两种注释的基因)。列id为每行的唯一标识符。列dtid为关联至mesh_terms表的外键。该表共包含14,225,540行与9列,列间以制表符分隔。未压缩的纯文本表大小为1.31GB,压缩后体积为379MB。本表数据截至2013年9月。
提供机构:
F1000Research
创建时间:
2015-02-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作