pvduy/ppo_verl_math
收藏Hugging Face2024-12-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/pvduy/ppo_verl_math
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含问题、解决方案、最终答案、数据源和其他额外信息(如索引、分割信息、奖励模型、提示和数据来源)的结构化数据集。数据集分为训练集,共有116812个示例,数据大小为180796352字节。
This is a structured dataset containing problem, solution, final answer, data source, and additional information such as index, split, reward model, prompt, and data source. The dataset is split into a training set with a total of 116812 examples, with a data size of 180796352 bytes.
提供机构:
pvduy



