anthj/dpo_mw
收藏Hugging Face2024-07-06 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/anthj/dpo_mw
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含问题和两个回答选项(一个被选中的回答和一个被拒绝的回答),适用于训练和评估问答系统或偏好学习模型。数据集分为训练集和评估集,分别包含1157和290个样本。
This dataset includes questions and two answer options (a chosen answer and a rejected answer), suitable for training and evaluating question-answering systems or preference learning models. The dataset is divided into a training set and an evaluation set, containing 1157 and 290 samples respectively.
提供机构:
anthj



