novastar111/swanlake_hard
收藏Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/novastar111/swanlake_hard
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: Swanlake Hard
task_categories:
- reinforcement-learning
- other
tags:
- sokoban
- planning
- synthetic-data
- visgym
size_categories:
- 100K<n<1M
configs:
- config_name: default
data_files:
- split: test
path: trajectories/sokoban_hard/test/*.jsonl
- split: train
path: trajectories/sokoban_hard/train/*.jsonl
---
# swanlake_hard
Synthetic Sokoban hard dataset generated from the local VisGym Sokoban environment.
## Contents
- `trajectories/sokoban_hard/test/*.jsonl`
- `trajectories/sokoban_hard/train/*.jsonl`
- `manifests/`
- `metadata/`
## Generation Summary
- Task: `sokoban_hard`
- Raw test generated: `1200`
- Final test after dedupe: `1200`
- Raw train generated: `120000`
- Final train after dedupe: `112000`
- Train samples removed by test-hash filter: `0`
- Train samples removed by train self-dedupe: `8000`
- Raw train chunk count: `12`
## Leakage / Hash Logic
- Hash basis: `init_state` only
- Test split:
- raw generation first
- dedupe within test
- write `metadata/test_init_state_hashes.json`
- Train split:
- raw generation only during chunked production
- final dedupe order:
1. remove records whose `init_state` hash appears in test
2. remove duplicate `init_state` hashes within train
- Final leakage report:
- `clean = True`
- `overlap_total = 0`
## Notes
- This dataset intentionally keeps the visual assets and prompt/action semantics from the integrated Sokoban environment while using the repo's current generation pipeline.
- Review artifacts are not included in the upload path by default.
数据集名称:天鹅湖难题(Swanlake Hard)
任务类别:
- 强化学习
- 其他
数据集标签:
- 推箱子(Sokoban)
- 规划
- 合成数据
- VisGym
样本规模区间:
- 10万<n<100万
配置项:
- 配置名称:默认
数据文件:
- 数据拆分:测试集,文件路径:trajectories/sokoban_hard/test/*.jsonl
- 数据拆分:训练集,文件路径:trajectories/sokoban_hard/train/*.jsonl
# 天鹅湖难题(swanlake_hard)
本数据集为基于本地VisGym推箱子(Sokoban)环境生成的合成式推箱子难题数据集。
## 内容说明
- `trajectories/sokoban_hard/test/*.jsonl`
- `trajectories/sokoban_hard/train/*.jsonl`
- `manifests/`
- `metadata/`
## 生成概况
- 任务标识:`sokoban_hard`
- 原始测试集生成量:`1200`
- 去重后最终测试集规模:`1200`
- 原始训练集生成量:`120000`
- 去重后最终训练集规模:`112000`
- 经测试集哈希过滤移除的训练集样本数:`0`
- 经训练集自身去重移除的训练集样本数:`8000`
- 原始训练集分块数量:`12`
## 数据泄露与哈希逻辑
- 哈希基准:仅以初始状态(`init_state`)为依据
- 测试集拆分流程:
1. 先生成原始测试集
2. 对测试集内部进行去重处理
3. 写入元数据文件`metadata/test_init_state_hashes.json`
- 训练集拆分流程:
1. 仅在分块生产阶段生成原始训练集
2. 最终去重顺序:
1. 移除初始状态哈希值出现在测试集中的训练样本
2. 移除训练集内部重复的初始状态哈希值样本
- 最终数据泄露检测报告:
- 数据无泄露(`clean = True`)
- 总重叠样本数:`0`(`overlap_total = 0`)
## 补充说明
- 本数据集在使用当前仓库生成流水线的同时,保留了集成推箱子环境的视觉资源与提示/动作语义。
- 默认情况下,审核工件不会包含在上传路径中。
提供机构:
novastar111



