HH-RLHF dataset
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/anthropics/hh-rlhf
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了9,662个具有争议性的社会问题,旨在评估语言模型在社会一致性方面的表现。此外,该数据集还用于衡量语言模型在处理与社会相关问题的能力。这项任务被称为社会一致性评估。
This dataset contains 9,662 controversial social issues, designed to evaluate the performance of language models in terms of social consistency. Additionally, this dataset is also utilized to gauge the capability of language models in addressing socially relevant problems. This task is referred to as social consistency evaluation.
提供机构:
Anthropic



