RLAIF/Value-v1-NUMINA-V1-Blocks-Merged
收藏Hugging Face2024-11-22 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/RLAIF/Value-v1-NUMINA-V1-Blocks-Merged
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了问题、解决方案、解决方案步骤、即时回报率(rtgs)、未折现即时回报率(undiscounted_rtgs)、是否正确、目标答案以及解决方案计数等字段。数据集分为训练集和测试集,训练集包含63455个示例,测试集包含557个示例。
The dataset includes fields such as problem, solution, solution steps, immediate reward rates (rtgs), undiscounted immediate reward rates (undiscounted_rtgs), correctness, target answer, and solution counts. The dataset is split into a training set with 63,455 examples and a test set with 557 examples.
提供机构:
RLAIF



