cchoi1/humaneval_qwen7b_sol_best_of_200_relabeled_dpo_30000
收藏Hugging Face2025-04-09 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/cchoi1/humaneval_qwen7b_sol_best_of_200_relabeled_dpo_30000
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了四个字段:prompt、chosen、rejected和task_id,均为文本类型。它被划分为训练集和测试集,训练集有30000个示例,测试集有1200个示例。数据集的总大小为32248437.02字节,下载大小为3014757字节。
The dataset consists of four fields: prompt, chosen, rejected, and task_id, all of which are string types. It is divided into a training set and a test set, with the training set containing 30,000 examples and the test set containing 1,200 examples. The total size of the dataset is 32,248,437.02 bytes, and the download size is 3,014,757 bytes.
提供机构:
cchoi1



