selfcorrexp2/orm-less-corr-scaling-all-yes
收藏Hugging Face2025-01-09 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/orm-less-corr-scaling-all-yes
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如代理奖励(proxy_reward)、索引(idx)、提示(prompt)、答案(answers)、真实标签(gt)、二次奖励(second_rewards)、二次预测(second_preds)、一次奖励(first_rewards)、一次预测(first_preds)和奖励(rewards)。数据集被分为训练集,共有735,000个示例,文件大小为1,925,777,573字节。数据集还提供了一个默认配置,指定了训练集的数据文件路径。
The dataset contains multiple fields such as proxy_reward, idx, prompt, answers, ground truth (gt), second_rewards, second_preds, first_rewards, first_preds, and rewards. The dataset is split into a training set with a total of 735,000 examples and a file size of 1,925,777,573 bytes. The dataset also provides a default configuration specifying the data file path for the training set.
提供机构:
selfcorrexp2



