RLHFlow/self_rewarding_rl_prompt
收藏Hugging Face2025-03-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/RLHFlow/self_rewarding_rl_prompt
下载链接
链接失效反馈官方服务:
资源简介:
该数据集来自AI-MO/NuminaMath-7B-CoT,由Prime团队处理,但具体内容描述未提供。
The dataset is from AI-MO/NuminaMath-7B-CoT and was processed by the Prime team, but no specific description of the dataset content is provided.
提供机构:
RLHFlow



