cchoi1/humaneval_qwen7b_att_iter0_ppo_att50_sol50_relabeled_dpo_5000
收藏Hugging Face2025-04-09 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/cchoi1/humaneval_qwen7b_att_iter0_ppo_att50_sol50_relabeled_dpo_5000
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:提示(prompt)、选中(chosen)、拒绝(rejected)和任务ID(task_id),所有字段均为字符串类型。数据集分为训练集和测试集,训练集有5000个示例,测试集有1000个示例。数据集的总大小为9985239.1491808字节,下载大小为289687字节。
The dataset includes four fields: prompt, chosen, rejected, and task_id, all of which are string types. The dataset is divided into a training set and a test set, with the training set containing 5000 examples and the test set containing 1000 examples. The total size of the dataset is 9985239.1491808 bytes, and the download size is 289687 bytes.
提供机构:
cchoi1



