raca-workspace-v1/ttt-discover-circle_packing_26-qwen3-8b-v1
收藏Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/raca-workspace-v1/ttt-discover-circle_packing_26-qwen3-8b-v1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: metrics
features:
- name: step
dtype: int64
- name: avg_reward
dtype: float64
- name: best_reward
dtype: float64
- name: nonzero_frac
dtype: float64
- name: advantage_mean
dtype: float64
- name: advantage_std
dtype: float64
- name: num_unique_states
dtype: int64
splits:
- name: train
num_bytes: 56
num_examples: 1
download_size: 3565
dataset_size: 56
- config_name: tree
features:
- name: node_id
dtype: string
- name: parent_id
dtype: 'null'
- name: visit_count
dtype: int64
- name: best_value
dtype: float64
- name: reward
dtype: float64
- name: puct_score
dtype: float64
- name: code
dtype: string
- name: depth
dtype: int64
- name: was_selected
dtype: bool
- name: step
dtype: int64
splits:
- name: train
num_bytes: 69
num_examples: 1
download_size: 4448
dataset_size: 69
configs:
- config_name: metrics
data_files:
- split: train
path: metrics/train-*
- config_name: tree
data_files:
- split: train
path: tree/train-*
---
数据集信息:
- 配置名称:指标(metrics)
特征字段:
- 字段名:训练步数(step),数据类型:64位整数(int64)
- 字段名:平均奖励(avg_reward),数据类型:64位浮点数(float64)
- 字段名:最优奖励(best_reward),数据类型:64位浮点数(float64)
- 字段名:非零占比(nonzero_frac),数据类型:64位浮点数(float64)
- 字段名:优势均值(advantage_mean),数据类型:64位浮点数(float64)
- 字段名:优势标准差(advantage_std),数据类型:64位浮点数(float64)
- 字段名:唯一状态数(num_unique_states),数据类型:64位整数(int64)
划分集:
- 名称:训练集(train),字节占用量:56,样本数量:1
下载大小:3565,数据集存储大小:56
- 配置名称:树结构(tree)
特征字段:
- 字段名:节点ID(node_id),数据类型:字符串(string)
- 字段名:父节点ID(parent_id),数据类型:空类型(null)
- 字段名:访问次数(visit_count),数据类型:64位整数(int64)
- 字段名:最优价值(best_value),数据类型:64位浮点数(float64)
- 字段名:奖励(reward),数据类型:64位浮点数(float64)
- 字段名:PUCT评分(puct_score),数据类型:64位浮点数(float64)
- 字段名:代码(code),数据类型:字符串(string)
- 字段名:深度(depth),数据类型:64位整数(int64)
- 字段名:是否被选中(was_selected),数据类型:布尔型(bool)
- 字段名:训练步数(step),数据类型:64位整数(int64)
划分集:
- 名称:训练集(train),字节占用量:69,样本数量:1
下载大小:4448,数据集存储大小:69
配置项:
- 配置名称:指标(metrics),数据文件:
- 划分:训练集(train),路径:metrics/train-*
- 配置名称:树结构(tree),数据文件:
- 划分:训练集(train),路径:tree/train-*
提供机构:
raca-workspace-v1



