five

DBLP Publications Records and ACM Metadata for SIGWEB Conferences

收藏
Mendeley Data2024-03-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/dn5d8fbkb9
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset ”DBLP-SIGWEB.zip” is derived from September 17, 2015 snapshot of dblp bibliography database. It contains all publications and authors records (available in dblp data and ACM metadata) of 7 ACM SIGWEB conferences (HT, DL, DocEng, WebSci, CIKM, WSDM, UMAP) dblp-sigweb.sql file creates 15 tables in mysql. Followings are the list and description of all attributes and tables used in the dataset. Same attributes used in different tables are listed only once. 1. Table- papers dblp_key- unique id of each publication in dblp database crossref- unique id of each conference in dblp database doi- unique doi url to publisher page paper_id- unique id of each article in acm digital library (DL) cite_count- number of citations for each article calculated for the papers published in acm DL pages- number of pages for each article in conference proceedings conf_id- unique id of each conference in acm DL funding- funding source information of article. NULL- if no funcding source available 2. Table- paper_authors author_id- unique id of an author in acm DL affiliation- affiliation information of author for associated article 3. Table- concepts concept- concepts in an article- tagged by ACM 4. Table- author_tags author_tag- Keywords/tags provided by authors 5. cited_by paper_id- acm DL id of article A to be cited cite_id- unique id of article that has cited article A 6. paper_references refer_id- unique id of the articles (published in sigweb conferences) cited in article A. 7. Table- conferences dblp_key- unique id of each conference in dblp database year- year of the conference publisher- publisher name of each conference (ACM, Springer, IEEE etc.) title- full name of the conference proceeding doi- unique doi url to the conference publisher page 8. Table- general_chairs, program_chairs, editors author_id- unique id of author affiliation- affiliation of author 9. authors_affiliation_history, colleagues author_id- unique id of author A in ACM DL position- index of affiliation- starts from 0 affiliation- lists all affiliations of an author colleague_id- lists acm IDs of all authors publishing papers in ACM co-authored with A. 11. authors_info author_name- full name of author acquired from ACM publisher page year_first- year of first article publication in ACM year_last- year of recent article publication in ACM pub_count- total number of publciations in ACM DL cite_count- total number of citations mentioned in ACM publciations avg_cite- average number of citation in ACM publications 12. affiliations_info affiliation- name of the affiliation affiliation_type- type of affiliatioin (Industry, Academic Institution) city, state, country- geographical location of affiliation lat, lng- geocodes of affiliation 13. Table- acceptance rate conf_id- acm id of conference dblp_key- dblp id of a conference submitted- #submission received in conf X in year Y accepted- #accepted papers in conf X in year Y rate- acceptance rate of conf X in year Y.

数据集"DBLP-SIGWEB.zip"源自2015年9月17日快照版dblp文献数据库。该数据集涵盖7个ACM SIGWEB会议(HT、DL、DocEng、WebSci、CIKM、WSDM、UMAP)的全部出版物与作者记录,数据取自dblp数据库及ACM元数据。文件dblp-sigweb.sql可在MySQL中创建15张数据表。以下为本数据集所用全部数据表及属性的列表与说明,不同数据表中重复出现的属性仅列出一次。 1. 表papers: dblp_key:dblp数据库中每篇出版物的唯一标识符 crossref:dblp数据库中每个会议的唯一标识符 doi:指向出版商页面的唯一DOI网址 paper_id:ACM数字图书馆(Digital Library, DL)中每篇文章的唯一标识符 cite_count:针对ACM DL发表论文计算得到的单篇文章被引次数 pages:会议论文集中单篇文章的总页数 conf_id:ACM DL中每个会议的唯一标识符 funding:文章的资助来源信息,若无资助来源则为NULL值 2. 表paper_authors: author_id:ACM DL中作者的唯一标识符 affiliation:对应文章作者的所属机构信息 3. 表concepts: concept:由ACM标注的文章主题概念 4. 表author_tags: author_tag:作者提供的关键词/标签 5. 表cited_by: paper_id:待被引文章A的ACM DL标识符 cite_id:引用文章A的文章的唯一标识符 6. 表paper_references: refer_id:文章A所引用的、发表于SIGWEB会议的文章的唯一标识符 7. 表conferences: dblp_key:dblp数据库中每个会议的唯一标识符 year:会议举办年份 publisher:各会议的出版商名称(如ACM、Springer、IEEE等) title:会议论文集的完整名称 doi:指向会议出版商页面的唯一DOI网址 8. 表general_chairs、program_chairs、editors: author_id:作者的唯一标识符 affiliation:作者的所属机构信息 9. 表authors_affiliation_history、colleagues: author_id:ACM DL中作者A的唯一标识符 position:所属机构的索引,起始值为0 affiliation:作者的全部所属机构列表 colleague_id:所有与作者A在ACM平台合作发表过论文的作者的ACM标识符 11. 表authors_info: author_name:从ACM出版商页面获取的作者全名 year_first:作者在ACM平台发表的第一篇文章的年份 year_last:作者在ACM平台发表的最新文章的年份 pub_count:作者在ACM DL发表的论文总数量 cite_count:ACM平台发表文章的总被引次数 avg_cite:ACM平台发表文章的平均被引次数 12. 表affiliations_info: affiliation:所属机构的名称 affiliation_type:所属机构类型(企业、学术机构) city、state、country:所属机构的地理位置信息 lat、lng:所属机构的地理编码坐标 13. 表acceptance_rate: conf_id:会议的ACM标识符 dblp_key:会议的dblp标识符 submitted:会议X在年份Y收到的投稿总数 accepted:会议X在年份Y收录的论文总数 rate:会议X在年份Y的录用率
创建时间:
2024-01-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作