saepark/hh-rlhf-single-turn-RM-train-furthersplit-RM-train-actual
收藏Hugging Face2025-10-29 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/saepark/hh-rlhf-single-turn-RM-train-furthersplit-RM-train-actual
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了对话提示、提示ID、选中与未选中的内容及其角色、对话消息、选中内容的得分、未选中内容的得分以及额外信息(如来源)。数据集分为训练集,提供了训练集的字节大小和示例数量。
The dataset includes conversation prompts, prompt IDs, chosen and rejected content with their roles, conversation messages, scores for chosen content, scores for rejected content, and additional information such as source. The dataset is split into a training set, with the byte size and number of examples provided for the training set.
提供机构:
saepark



