ROPES
收藏arXiv2025-09-30 收录
下载链接:
https://allenai.org/data/ropes
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为ROPES,它是一个需要多步推理并涉及定性关系的评估数据集。在针对不可记忆子集的额外训练之后,ROPES在性能上提高了25.7%。该数据集的规模属于评估数据集,其任务类型为问答。
This dataset, named ROPES, is an evaluation dataset that requires multi-step reasoning and involves qualitative relationships. Following additional training on the non-memorizable subset, ROPES achieved a 25.7% improvement in performance. This dataset falls into the category of evaluation datasets, with its task type being question answering.



