HelpSteer2
收藏arXiv2024-06-13 更新2024-06-21 收录
下载链接:
https://huggingface.co/datasets/nvidia/HelpSteer2
下载链接
链接失效反馈官方服务:
资源简介:
HelpSteer2是由NVIDIA创建的一个开源数据集,旨在训练高性能的奖励模型,以指导大型语言模型生成符合人类偏好的高质量响应。该数据集包含10,681对响应,远少于现有数据集,但效率极高。数据集主要来源于ShareGPT平台,涵盖了多样化的实际应用场景。创建过程中,通过BERTopic和Nemotron-2-43B评估复杂度,确保数据质量。HelpSteer2的应用领域广泛,主要用于优化语言模型的对齐技术,解决模型响应与人类偏好不一致的问题。
HelpSteer2 is an open-source dataset developed by NVIDIA, designed to train high-performance reward models that guide large language models (LLMs) to generate high-quality responses aligned with human preferences. This dataset contains 10,681 response pairs, which is far fewer than existing datasets but boasts extremely high efficiency. It is primarily sourced from the ShareGPT platform and covers diverse real-world application scenarios. During its creation, BERTopic and Nemotron-2-43B were used to evaluate complexity to ensure data quality. HelpSteer2 has a wide range of application scenarios, mainly used to optimize the alignment techniques of language models and address the mismatch between model responses and human preferences.
提供机构:
NVIDIA
创建时间:
2024-06-13



