Intelligent-Internet/II-Thought-RL-v0
收藏Hugging Face2025-03-28 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Intelligent-Internet/II-Thought-RL-v0
下载链接
链接失效反馈官方服务:
资源简介:
II-Thought RL v0是一个大规模、多任务的强化学习数据集,包含了经过严格多步骤筛选的高质量问题-答案对。数据集涵盖了数学、编程、科学等多个领域,旨在为强化学习模型提供多样化的训练材料。
II-Thought RL v0 is a large-scale, multi-task dataset for Reinforcement Learning, consisting of high-quality question-answer pairs that have undergone rigorous multi-step filtering. The dataset covers various domains such as mathematics, coding, and science, providing diverse training materials for Reinforcement Learning models.
提供机构:
Intelligent-Internet



