saurabh5/code_rlvr_mixture_dpo
收藏Hugging Face2025-11-11 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/saurabh5/code_rlvr_mixture_dpo
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含消息内容及其相关特征,如发送者角色和真实标签。它还包括示例的唯一标识符、数据来源和可能的输出。数据集分为训练集,其大小为约2.45GB,包含21291个示例。整个数据集的大小约为2.45GB,下载大小为302MB。
The dataset includes message content and related features such as sender role and ground truth labels. It also contains unique identifiers for examples, the source of the data, and potential outputs. The dataset is split into a training set, which is approximately 2.45GB in size and contains 21,291 examples. The total size of the dataset is approximately 2.45GB, with a download size of 302MB.
提供机构:
saurabh5



