hahayhe/prm800k-step-reward
收藏Hugging Face2025-10-26 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/hahayhe/prm800k-step-reward
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:提示(prompt)、完成(completions)、标签(labels)和索引(index)。提示是字符串类型,完成是一个字符串列表,标签是一个布尔值列表,索引是整型。数据集分为训练集和测试集,训练集有400906个示例,测试集有10779个示例。
The dataset includes four fields: prompt, completions, labels, and index. Prompt is a string type, completions are a list of strings, labels are a list of booleans, and index is an integer. The dataset is split into a training set and a test set, with the training set having 400906 examples and the test set having 10779 examples.
提供机构:
hahayhe



