1231czx/self_corr_first_wrong_qwenbase_prompt1_gen1
收藏Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/1231czx/self_corr_first_wrong_qwenbase_prompt1_gen1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含五个字段:索引(idx),提示文本(prompt),答案序列(answers),正确答案(gt)和真实奖励(true_reward)。数据集被划分为训练集,共有1752个示例,占用字节数为93985726。数据集提供了一个默认配置,其中指定了训练集的数据文件路径。
The dataset includes five fields: index (idx), prompt text (prompt), answer sequence (answers), correct answer (gt), and true reward (true_reward). The dataset is split into a training set with a total of 1752 examples, occupying 93985726 bytes. The dataset provides a default configuration, which specifies the path to the data files for the training set.
提供机构:
1231czx



