yuruny/agentic-sokoban-qwen2.5-3B-eval_results

Name: yuruny/agentic-sokoban-qwen2.5-3B-eval_results
Creator: yuruny
Published: 2025-12-18 22:52:41
License: 暂无描述

Hugging Face2025-12-18 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/yuruny/agentic-sokoban-qwen2.5-3B-eval_results

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含一系列步骤，每个步骤具有多个属性，包括动作、聊天完成（包含内容和角色）、完成状态、mc_return、模型响应、观察和奖励。此外，还有一个顶层的奖励特征。数据集包含一个训练分割，共有12800个示例，总大小为45273408字节。数据集用于训练目的，数据文件路径指向训练数据。

The dataset consists of a series of steps, each with multiple attributes including action, chat_completions (which contains content and role), done, mc_return, model_response, observation, and reward. Additionally, there is a top-level reward feature. The dataset includes a train split with 12,800 examples and a total size of 45,273,408 bytes. The dataset is intended for training purposes, with the data file path pointing to the training data.

提供机构：

yuruny

5,000+

优质数据集

54 个

任务类型

进入经典数据集