MoeReward/combined_rlhf_dataset_grpo_imdb_main
收藏Hugging Face2025-04-01 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/MoeReward/combined_rlhf_dataset_grpo_imdb_main
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含prompt和answer字符串对的数据集,用于训练模型进行问答或其他相关任务。数据集包含一个训练集,共有3999个样本,总大小约为2.33MB。
This dataset contains pairs of prompt and answer strings, which can be used to train models for question answering or other related tasks. The dataset includes a training set with 3999 samples, totaling approximately 2.33MB in size.
提供机构:
MoeReward



