Bmingg/DPO_2a_llama_3600_constrained_MBR_bleu
收藏Hugging Face2025-10-12 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Bmingg/DPO_2a_llama_3600_constrained_MBR_bleu
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:prompt(提示)、chosen(选择的回答)和rejected(被拒绝的回答)。它包含一个训练集split,共有3600个示例,数据集大小为1704882字节。
The dataset includes three fields: prompt (prompt), chosen (selected response), and rejected (rejected response). It contains a training set split with 3600 examples and the dataset size is 1704882 bytes.
提供机构:
Bmingg



