sungyub/guru-logic-verl
收藏Hugging Face2025-11-06 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/sungyub/guru-logic-verl
下载链接
链接失效反馈官方服务:
资源简介:
GURU Logic VERL数据集是一个包含逻辑推理问题的样本集合,适用于逻辑推理任务的强化学习应用。数据集包含了四种类型的任务:ordering_puzzle、zebra_puzzle、arcagi1和graph_logical,每个任务类型都有对应的样本。数据集遵循VERL格式规范,提供了结构化的奖励信号和标准化的地面真相,以支持多任务训练。
The GURU Logic VERL Dataset is a collection of logic reasoning problem samples designed for reinforcement learning applications in logic reasoning tasks. The dataset includes four types of tasks: ordering_puzzle, zebra_puzzle, arcagi1, and graph_logical, each with corresponding samples. The dataset follows the VERL format specifications, providing structured reward signals and standardized ground truth to support multi-task training.
提供机构:
sungyub



