MoeReward/combined_rlhf_dataset_grpo_metamath_main
收藏Hugging Face2025-04-01 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/MoeReward/combined_rlhf_dataset_grpo_metamath_main
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个字段:提示(prompt)和回答(answer),均为文本类型。数据集分为训练集,共有3999个示例,总大小约为2335.64KB。数据集下载大小约为1446.21KB。
The dataset includes two fields: prompt and answer, both of which are text types. The dataset is split into a training set with a total of 3999 examples, with a total size of approximately 2335.64KB. The download size of the dataset is about 1446.21KB.
提供机构:
MoeReward



