RLHFlow/numia_prompt_ppo
收藏Hugging Face2025-02-13 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/RLHFlow/numia_prompt_ppo
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含以下字段:数据源、能力、奖励模型(包括地面真实和风格)和问题。数据集分为训练集和测试集,其中训练集包含400,000个示例,测试集包含4,000个示例。数据集的总下载大小为56062366字节,总存储大小为109902972.2642616字节。
The dataset includes the following fields: data source, ability, reward model (including ground truth and style), and problem. The dataset is split into a training set and a test set, with the training set containing 400,000 examples and the test set containing 4,000 examples. The total download size of the dataset is 56062366 bytes, and the total storage size is 109902972.2642616 bytes.
提供机构:
RLHFlow



