Asap7772/ogmath5_onpolicy_multiturn_seprew_prefix0.2_roll4_maxrev100
收藏Hugging Face2024-09-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Asap7772/ogmath5_onpolicy_multiturn_seprew_prefix0.2_roll4_maxrev100
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如输入、输出前缀、输出、下一个输出前缀、下一个输出、终端、完成、轨迹、步骤、标签、奖励、是否被选择、rtg、未折扣的rtg、文本等。数据集分为训练集和测试集,训练集包含815,965个样本,测试集包含824,161个样本。数据集的下载大小为402,951,758字节,总大小为15,094,329,094字节。
The dataset includes multiple features such as input, output prefix, output, next output prefix, next output, terminal, completion, trajectory, step, label, reward, is chosen, rtg, undiscounted rtg, text, etc. The dataset is divided into a training set and a test set, with the training set containing 815,965 samples and the test set containing 824,161 samples. The download size of the dataset is 402,951,758 bytes, and the total size is 15,094,329,094 bytes.
提供机构:
Asap7772



