webgpt-comparisons (WebGPT)
收藏OpenXLab2026-04-18 收录
下载链接:
https://openxlab.org.cn/datasets/OpenDataLab/webgpt-comparisons
下载链接
链接失效反馈官方服务:
资源简介:
In the WebGPT paper, the authors trained a reward model from human feedback. They used the reward model to train a long form question answering model to align with human preferences. This is the dataset of all comparisons that were marked as suitable for reward modeling by the end of the WebGPT project. There are 19,578 comparisons in total.
提供机构:
OpenDataLab
创建时间:
2023-12-14



