five

Impact of the COVID-19 pandemic on academic productivity

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6615683
下载链接
链接失效反馈
官方服务:
资源简介:
This data set was used to analyse changes in academic productivity due to the COVID-19 pandemic. Here we describe how those data were collected. We downloaded the arXiv dataset hosted by Kaggle on 2022 June 6. This dataset is a JSON file (named `arxiv-metadata-oai-snapshot.json`) compressed into a zip file. We extracted this file and then executed `process.py` to create the following three files: - `metadata.json`: titles, abstract, and affiliations of authors - `authors.json`: author names - `records.csv`: arXiv identifier, categories, number of authors and affiliations, abstract length   We excluded pre-prints from 2022 June when processing files. The `process.py` script also adds a 'dummy' line to the `records.csv` to make Python load arXiv identifiers as strings. These three files (`metadata.json`, `authors.json`, and `records.csv`) were used for this analysis.
创建时间:
2022-06-22
二维码
社区交流群
二维码
科研交流群
商业服务