hssarah/mil_qwen_dpo_train_258
收藏Hugging Face2025-11-12 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/hssarah/mil_qwen_dpo_train_258
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:问题(question)、两个可能的回答(response_j 和 response_k)以及带提示的问题(prompt_with_hint)。数据集主要用于训练,包含258个示例。
The dataset includes four fields: question, two possible answers (response_j and response_k), and a question with a hint (prompt_with_hint). It is primarily for training and contains 258 examples.
提供机构:
hssarah



