dsrtrain/qwq2ep_raft_iter1_gen_with_rewards
收藏Hugging Face2025-02-12 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/dsrtrain/qwq2ep_raft_iter1_gen_with_rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一个索引字段、提示字段、答案序列字段、包含真实答案和风格的字段以及一个表示奖励的布尔字段。数据集被划分为训练集,共有20000个示例,文件大小为1010939991字节。
The dataset includes an index field, a prompt field, an answer sequence field, a field containing the ground truth and style, and a boolean field indicating rewards. The dataset is split into a training set with a total of 20,000 examples and a file size of 1,010,939,991 bytes.
提供机构:
dsrtrain



