cx-cmu/repro-rl-data
收藏Hugging Face2025-10-18 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/cx-cmu/repro-rl-data
下载链接
链接失效反馈官方服务:
资源简介:
这是为repro-rephraser-4B模型准备的强化学习训练数据集,每个数据条目包括dataman_score评分和text文本两部分。
This is the reinforcement learning training data for the repro-rephraser-4B model, with each data entry including a dataman_score and a text section.
提供机构:
cx-cmu



