CodeDPO/rlhf_dataset_20250126_openrlhf_format
收藏Hugging Face2025-01-26 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/CodeDPO/rlhf_dataset_20250126_openrlhf_format
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了问题、测试用例、推断和上下文消息等字段。它通过Qwen Coder 32B Instruct的过滤,用于测试用例和准确度的研究。数据集分为训练集,共有84924个示例,大小为1,136,916,895字节。
The dataset includes fields such as questions, test cases, inferences, and context messages. It is generated by filtering with Qwen Coder 32B Instruct for test cases and accuracies. The dataset is split into a training set with a total of 84,924 examples, with a size of 1,136,916,895 bytes.
提供机构:
CodeDPO



