rasdani/simplerl_qwen_level1to4
收藏Hugging Face2025-03-29 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/rasdani/simplerl_qwen_level1to4
下载链接
链接失效反馈官方服务:
资源简介:
SimpleRL-Zoo-Data数据集是一个用于简单强化学习任务的数据集,从文件名simplelr_qwen_level1to4可以推测,这个数据集可能包含了不同难度级别(level1到level4)的中文问答对,用于训练和评估模型在简单强化学习场景下的表现。
SimpleRL-Zoo-Data is a dataset for simple reinforcement learning tasks, which may include Chinese question-answer pairs at different difficulty levels (level1 to level4) for training and evaluating models performance in simple reinforcement learning scenarios.
提供机构:
rasdani



