YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair16
收藏Hugging Face2025-09-10 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/YuchenLi01/MATH_mathgenpair2_1.5Bpretrained_sft_lm1_ebs32_lr1e-06_epoch1.0_42_DPO_pair16
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:提示(prompt)、选中(chosen)和拒绝(rejected),均为文本类型。数据集分为训练集和测试集,训练集包含97584个样本,测试集包含5120个样本。具体用途和背景在README中未提及。
The dataset includes three fields: prompt, chosen, and rejected, all of which are string types. It is divided into training and test sets, with the training set containing 97584 samples and the test set containing 5120 samples. The specific purpose and context are not mentioned in the README.
提供机构:
YuchenLi01



