cchoi1/humaneval_qwen7b_sol_iter0_ppo_att20_sol50_relabeled_dpo_1000
收藏Hugging Face2025-03-31 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/cchoi1/humaneval_qwen7b_sol_iter0_ppo_att20_sol50_relabeled_dpo_1000
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:提示(prompt)、选择(chosen)、拒绝(rejected)和任务ID(task_id),所有字段均为字符串类型。数据集分为训练集和测试集,其中训练集有1000个示例,测试集有200个示例。数据集的总大小为2270958.0407859203字节,下载大小为44335字节。
The dataset includes four fields: prompt, chosen, rejected, and task_id, all of which are of string type. The dataset is divided into a training set and a test set, with the training set containing 1000 examples and the test set containing 200 examples. The total size of the dataset is 2270958.0407859203 bytes, with a download size of 44335 bytes.
提供机构:
cchoi1



