cchoi1/humaneval_qwen32b_att_iter0_ppo_att20_sol10_v2_dpo_1000
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/cchoi1/humaneval_qwen32b_att_iter0_ppo_att20_sol10_v2_dpo_1000
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:prompt、chosen、rejected和task_id,均为文本类型。数据集分为训练集和测试集,训练集包含1000个样本,测试集包含200个样本。数据集总大小约为2.23MB。
The dataset includes four fields: prompt, chosen, rejected, and task_id, all of which are string types. The dataset is divided into training and test sets, with the training set containing 1000 samples and the test set containing 200 samples. The total size of the dataset is approximately 2.23MB.
提供机构:
cchoi1



