team-suzuki/GRPO_from_SFT_004
收藏Hugging Face2025-08-20 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/team-suzuki/GRPO_from_SFT_004
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:prompt、chosen和rejected,均为字符串类型。数据集分为训练集和测试集,训练集有10843个样本,测试集有2711个样本。数据集的下载大小为3.04MB,总大小为5.15MB。
The dataset includes three fields: prompt, chosen, and rejected, all of which are string types. It is divided into a training set and a test set, with the training set containing 10,843 samples and the test set containing 2,711 samples. The download size of the dataset is 3.04MB, and the total size is 5.15MB.
提供机构:
team-suzuki



