hssarah/mil_qwen_dpo_1112_304
收藏Hugging Face2025-11-12 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/hssarah/mil_qwen_dpo_1112_304
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:问题(question)、两个可能的回答(response_j和response_k)以及带提示的提示语(prompt_with_hint)。数据集仅包含训练集,共有304个示例。
The dataset includes four fields: question, two possible responses (response_j and response_k), and a prompt with a hint. The dataset consists only of a training set with a total of 304 examples.
提供机构:
hssarah



