Helpfulness Reward Model

arXiv2025-09-30 收录

下载链接：

https://huggingface.co/Ray2333/gpt2-large-helpful-reward_model

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集旨在评估生成文本的有用性，其中包含了一个用于奖励建模的模型。该模型在强化学习框架中发挥作用，特别关注了多个目标。

This dataset is designed to evaluate the usefulness of generated text, and it includes a model for reward modeling. This model operates within a reinforcement learning framework, with a particular focus on multiple objectives.

5,000+

优质数据集

54 个

任务类型

进入经典数据集