violetxi/qwen4b-thinking-omni-l1_4-dpo-pairs
收藏Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/violetxi/qwen4b-thinking-omni-l1_4-dpo-pairs
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了三个主要字段:prompt、chosen和rejected。prompt字段由content和role两个子字段组成,可能表示某种对话或交互的上下文和角色信息。chosen和rejected字段可能是根据prompt字段选择的响应或选项。数据集仅包含训练集划分,共有3052个示例,总大小约为252MB。根据这些信息,可以推断这是一个用于训练某种选择或分类模型的数据集。
The dataset includes three main fields: prompt, chosen, and rejected. The prompt field consists of two sub-fields, content and role, which may represent the context and role information of some kind of dialogue or interaction. The chosen and rejected fields might be the responses or options selected based on the prompt field. The dataset contains only the training split, with a total of 3052 examples and a size of approximately 252MB. Based on this information, it can be inferred that this is a dataset for training some type of selection or classification model.
提供机构:
violetxi



