mytestdpo/morecorrmodel_gen_aug_gsm8k_generation
收藏Hugging Face2025-01-05 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/morecorrmodel_gen_aug_gsm8k_generation
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一个索引字段、提示文本、答案序列、正确答案字段,以及两轮奖励字段。数据集被划分为训练集,共有30448个示例,文件大小为428365253字节。
The dataset includes an index field, prompt text, answer sequence, ground truth field, and two rounds of reward fields. The dataset is split into a training set with a total of 30448 examples, and the file size is 428365253 bytes.
提供机构:
mytestdpo



