yuruny/agentic-sudoku-Markov_qwen2.5-3B-it-1e-6_9x9_6-6_gt-SFT_ans1-markovian-eval_results

Name: yuruny/agentic-sudoku-Markov_qwen2.5-3B-it-1e-6_9x9_6-6_gt-SFT_ans1-markovian-eval_results
Creator: yuruny
Published: 2025-12-11 14:32:52
License: 暂无描述

Hugging Face2025-12-11 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/yuruny/agentic-sudoku-Markov_qwen2.5-3B-it-1e-6_9x9_6-6_gt-SFT_ans1-markovian-eval_results

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个步骤，每个步骤包含动作、聊天完成（包括内容和角色）、完成状态、mc_return、模型响应、观察和奖励等子特征。此外，还有一个顶层的奖励特征。数据集有一个名为train的训练分割，包含100个示例，总大小为223,689字节，下载大小为37,248字节。数据集的结构表明其用于训练目的。

The dataset contains multiple steps, each with sub-features such as action, chat_completions (including content and role), done, mc_return, model_response, observation, and reward. Additionally, there is a top-level reward feature. The dataset has a single split named train for training purposes, consisting of 100 examples, with a total size of 223,689 bytes and a download size of 37,248 bytes.

提供机构：

yuruny