weepcat/compute_rewards_summarization_partial_reward_model_random_length-2
收藏Hugging Face2025-01-21 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/weepcat/compute_rewards_summarization_partial_reward_model_random_length-2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的对话或交互内容,每个交互分为选中的(chosen)和拒绝的(rejected)两部分,每部分都有内容(content)和角色(role)信息。此外,还为不同的模型提供了选中或拒绝内容时的奖励值。数据集仅包含训练集部分。
The dataset consists of a series of conversational or interactive content, divided into chosen and rejected parts, each with content and role information. In addition, reward values for different models are provided for chosen or rejected content. The dataset includes only the training set.
提供机构:
weepcat



