Ashley37sky/cs329x_reward_pair_data
收藏Hugging Face2024-12-04 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Ashley37sky/cs329x_reward_pair_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含用户与模型交互的数据,具体特征包括用户ID、用户提示、选择的模型响应、被拒绝的模型响应、选择的模型响应得分、被拒绝的模型响应得分、用户信息、选择的模型响应内容及角色、被拒绝的模型响应内容及角色等。数据集分为训练集和测试集,分别包含36107和9080个样本。
This dataset contains data on user interactions with models, including features such as user ID, user prompt, chosen model response, rejected model response, score for the chosen response, score for the rejected response, user information, content and role of the chosen response, and content and role of the rejected response. The dataset is divided into training and test sets, containing 36,107 and 9,080 samples respectively.
提供机构:
Ashley37sky



