scottgeng00/olmo-3-rlvr-nonreasoner_integration_mix-deltas

Name: scottgeng00/olmo-3-rlvr-nonreasoner_integration_mix-deltas
Creator: scottgeng00
Published: 2025-09-16 00:43:49
License: 暂无描述

Hugging Face2025-09-16 更新2025-10-25 收录

下载链接：

https://hf-mirror.com/datasets/scottgeng00/olmo-3-rlvr-nonreasoner_integration_mix-deltas

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个包含prompt、选择(chosen)和拒绝(rejected)内容及其角色的数据集，用于训练和评估模型。数据集分为训练集，共有190000个示例，每个示例包含了prompt文本、选择内容、拒绝内容、选择模型、拒绝模型、数据集来源和真实标签。

This dataset includes prompt, chosen and rejected content along with their roles, used for training and evaluating models. The dataset is split into a training set with a total of 190,000 examples, each containing prompt text, chosen content, rejected content, chosen model, rejected model, dataset source, and ground truth label.

提供机构：

scottgeng00

5,000+

优质数据集

54 个

任务类型

进入经典数据集