MongoDB database dump for the analysis of the current sustainability state of research software
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/4260760
下载链接
链接失效反馈官方服务:
资源简介:
This data set is the MongoDB dump (bson files) of the data created and analyzed with the rsps framework. In the first step, a research subject is assigned to the research software repositories. Afterwards, the current sustainability state is evaluated. The data set comprises the following six bson files:
repositories: metadata, received from the GitHub REST API, for repositories containing the search terms "doi+10" or "doi+10+in:readme", additional information are the request date, the contained search term, and the repository hosting service, in this case for all repositories "github". For repositories the Readme files are available.
publications: metadata of publications, published on arXiv and ACM, that contain the search term "github.com".
rs_repositories: research software candidates containing a DOI or that are referenced by the publications contained in the publications data set.
rs_artifacts: research software artifacts that are referenced in the harvested GitHub repositories by a DOI and the harvested publications.
publication_subjects: All Science Journal Classification (ASJC) of Scopus combined with the Scopus source list and Scopus book title list (https://www.scopus.com/home.uri)
arxiv_subjects: arXiv taxonomy complemented with the ASJC research subject.
创建时间:
2024-07-19



