Artificial Intelligence and the Fight Against COVID-19
收藏DataCite Commons2020-08-25 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/Artificial_Intelligence_and_the_Fight_Against_COVID-19/12479570/4
下载链接
链接失效反馈官方服务:
资源简介:
Datasets analysed in a paper mapping AI research activity against COVID-19. Includes:<br><b>-rxiv metadata:</b> A dataset with metadata about 1.8m papers from arXiv, biorXiv and medrXiv as of end May 2020 enriched with dummies about whether the papers are related to AI and/or COVID-19 research (<i>updated 22/06/2020 to fix some ids)</i><br><br><b>-rxi_geo:</b> A dataset with geographical metadata for papers based on the institutional affiliations of their authors after matching with the GRID database.<br><b>-covid_semantic:</b> A dataset with topic information about COVID-19 papers based on a semantic analysis of their abstracts, including the clusters where papers have been classified and their topic mixes (<i>updated 22/06/2020 to fix some ids).</i><br><b>-citation_metadata:</b> Two JSON objects. One contains a lookup between COVID-19 related papers in the rXiv corpus and the papers they cite. Another contains metadata about the cited papers including their fields of study.<br><b>-mag_fos: </b>A dataset with the Microsoft Academic Graph field of study hierarchy we use in our analysis (added 22 June 2020)<br>Each zipped folder includes a data dictionary.<br>For information about data processing and analysis in: https://github.com/nestauk/ai_covid_19
本研究用于映射人工智能针对新冠疫情(COVID-19)的科研活动的分析数据集,包含以下组成部分:
- 预印本平台元数据(rxiv metadata):该数据集收录了截至2020年5月末的180万篇arXiv、bioRxiv及medRxiv平台论文的元数据,并补充了哑变量标记,用于标识论文是否涉及人工智能或新冠疫情相关研究(已于2020年6月22日更新以修复部分ID问题)。
- 预印本地理元数据(rxi_geo):该数据集基于作者所属机构与全球研究机构识别数据库(Global Research Identifier Database, GRID)的匹配结果,为相关论文补充了地理信息元数据。
- 新冠语义数据集(covid_semantic):该数据集基于论文摘要的语义分析,为新冠疫情相关论文提供主题信息,包括论文所属聚类及其主题构成(已于2020年6月22日更新以修复部分ID问题)。
- 引用元数据(citation_metadata):包含两个JSON对象:其一为预印本平台语料库中新冠疫情相关论文与其引用文献的映射表;其二为被引文献的元数据,涵盖其研究领域信息。
- 微软学术图谱研究领域层级数据集(mag_fos):本研究分析所用的微软学术图谱(Microsoft Academic Graph, MAG)研究领域层级数据集,于2020年6月22日新增。
每个压缩包均附带数据字典。
如需了解数据处理与分析相关细节,请访问:https://github.com/nestauk/ai_covid_19
提供机构:
figshare
创建时间:
2020-06-22



