ROSCOE
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/facebookresearch/parlai/tree/main/projects/roscoe
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在评估逻辑和数学推理任务,其优势在于拥有详尽的评分细则信息,这有助于进行有效的评估。该数据集的任务是对逻辑推理能力进行评价。
This dataset is intended to evaluate logical and mathematical reasoning tasks. Its core strength lies in the provision of detailed scoring rubrics, which supports effective evaluation. The tasks of this dataset aim to assess logical reasoning capabilities.



