anthj/dpo_mw_2
收藏Hugging Face2024-07-08 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/anthj/dpo_mw_2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含问题和对应的两个回答选项:一个是被选中的回答(chosen),另一个是被拒绝的回答(rejected)。数据集分为训练集和评估集,训练集包含1,157,600个样本,评估集包含289,400个样本。每个样本包括一个问题和两个回答选项。
This dataset contains questions along with two response options: one is the chosen response (chosen), and the other is the rejected response (rejected). The dataset is divided into a training set and an evaluation set, with the training set containing 1,157,600 samples and the evaluation set containing 289,400 samples. Each sample includes a question and two response options.
提供机构:
anthj



