CodeDPO/rlhf_dataset_20250126_openrlhf_format_hard_r1
收藏Hugging Face2025-01-26 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/CodeDPO/rlhf_dataset_20250126_openrlhf_format_hard_r1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了问题、测试、推理和上下文消息等字段。上下文消息进一步分为内容和角色两个子字段。数据集分为训练集,共有21262个示例,数据集大小为347920642字节。
The dataset includes fields for questions, tests, inferences, and context messages. Context messages are further divided into sub-fields of content and role. The dataset is split into a training set with a total of 21262 examples, and the dataset size is 347920642 bytes.
提供机构:
CodeDPO



