five

基于开放评审的数据集(ORB)

收藏
arXiv2023-11-30 更新2024-06-21 收录
下载链接:
https://gitlab.cern.ch/irrad/orb-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
基于开放评审的数据集(ORB)是由欧洲核子研究组织和巴黎矿业大学等机构合作创建的,旨在支持高能物理领域的科学论文和实验提案的自动评估。该数据集包含超过36,000篇科学论文及其超过89,000次评审和最终决策,数据来源于OpenReview.net和SciPost.org。ORB数据集的设计考虑了未来可能的扩展性,提供了Python代码和ETL过程以支持数据的结构化和自动更新。该数据集不仅适用于高能物理领域,也适用于研究开放科学和评审过程的影响,旨在通过提供大规模的开放评审数据,促进更客观的文档评估过程和减少潜在的偏见。

The Open Review-based Dataset (ORB) was co-created by institutions including the European Organization for Nuclear Research (CERN) and Mines Paris University, with the goal of supporting automated evaluation of scientific papers and experimental proposals in the high-energy physics domain. This dataset encompasses over 36,000 scientific papers, paired with more than 89,000 reviews and final decisions, with data sourced from OpenReview.net and SciPost.org. Designed with future scalability in mind, the ORB dataset provides Python code and ETL (Extract, Transform, Load) workflows to facilitate data structuring and automated updates. Beyond the high-energy physics field, this dataset is also applicable to research on the impacts of open science and peer review processes, and aims to promote more objective document evaluation procedures and reduce potential biases by providing large-scale open review data.
提供机构:
欧洲核子研究组织
创建时间:
2023-11-30
二维码
社区交流群
二维码
科研交流群
商业服务