selfcorrexp2/llama3_openmath_em_ep1_tmp07_with_gold_rewards
收藏Hugging Face2025-01-07 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_openmath_em_ep1_tmp07_with_gold_rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含索引、提示、答案序列、正确答案、代理标签和第二轮奖励等字段。数据集被划分为训练集,共有5000个示例,数据集大小为13477532字节,下载大小为4663294字节。
The dataset includes fields such as index, prompt, answer sequence, correct answer, proxy label, and second round rewards. The dataset is split into a training set with a total of 5000 examples, with a dataset size of 13477532 bytes and a download size of 4663294 bytes.
提供机构:
selfcorrexp2



