PRIME-RL/EurusPRM-Stage1-Data
收藏Hugging Face2025-02-19 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/PRIME-RL/EurusPRM-Stage1-Data
下载链接
链接失效反馈官方服务:
资源简介:
EurusPRM-Stage1-Data是一个包含多个子数据集的综合数据集,这些子数据集使用不同的生成模型生成,如Llama-3.1和Qwen2.5等。每个子数据集包含的实例数量和响应实例数不同,且数据分为响应级别。数据集的字段包括整个对话内容(包括指令和模型输出)、选择标签、所属指令数据集名称以及使用的生成模型名称。
EurusPRM-Stage1-Data is a comprehensive dataset consisting of multiple sub-datasets generated by different generator models such as Llama-3.1 and Qwen2.5. Each sub-dataset contains a different number of instances and response instances, and the data is categorized at the response level. The fields of the dataset include the entire conversation content (including instructions and model outputs), selection label, the name of the instruction dataset it belongs to, and the name of the generator model used.
提供机构:
PRIME-RL



