preference-700K
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/hendrydong/preference_700K
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多元化的开源偏好数据集集合,其中包括了如HH-RLHF、斯坦福人类偏好数据集以及HelpSteer等数据集。该数据集的主要任务是用于奖励模型的训练。
This is a diverse collection of open-source preference datasets, including HH-RLHF, Stanford Human Preference Dataset, HelpSteer, and other comparable datasets. The core application of this dataset collection is to support the training of reward models.
提供机构:
Hugging Face



