qwselfcorr/qwen2_it_math_test_tmp07_external_orm
收藏Hugging Face2025-02-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/qwselfcorr/qwen2_it_math_test_tmp07_external_orm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含五个字段:索引(idx),提示信息(my_prompt),首次奖励(first_rewards),代理标签(proxy_label)和真实标签(gt)。数据集分为训练集,共有20000个样本,文件大小为74066364字节。
The dataset includes five fields: index (idx), prompt information (my_prompt), first reward (first_rewards), proxy label (proxy_label), and ground truth label (gt). The dataset is split into a training set with a total of 20,000 samples, with a file size of 74,066,364 bytes.
提供机构:
qwselfcorr



