scottgeng00/olmo-3-preference-mix-deltas_reasoning-chosen_qwen8b-chat35-DECON

Name: scottgeng00/olmo-3-preference-mix-deltas_reasoning-chosen_qwen8b-chat35-DECON
Creator: scottgeng00
Published: 2025-09-24 21:14:08
License: 暂无描述

Hugging Face2025-09-24 更新2025-10-25 收录

下载链接：

https://hf-mirror.com/datasets/scottgeng00/olmo-3-preference-mix-deltas_reasoning-chosen_qwen8b-chat35-DECON

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了提示(prompt)、选中(chosen)和拒绝(rejected)的内容及角色信息，以及选中模型(chosen_model)和拒绝模型(rejected_model)的相关信息。数据集被划分为训练集(train)，共有220,840个示例，大小为约4.84GB。此外，提供了默认配置(default)，指定了训练数据文件的路径。

The dataset includes prompt, chosen, and rejected content and role information, as well as information about the chosen model (chosen_model) and the rejected model (rejected_model). The dataset is split into a training set (train) with a total of 220,840 examples and a size of approximately 4.84GB. In addition, a default configuration (default) is provided, specifying the path to the training data files.

提供机构：

scottgeng00

5,000+

优质数据集

54 个

任务类型

进入经典数据集