five

novastar111/sokoban_medium

收藏
Hugging Face2026-04-26 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/novastar111/sokoban_medium
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Sokoban Medium task_categories: - reinforcement-learning - other tags: - sokoban - planning - synthetic-data - visgym size_categories: - 100K<n<1M configs: - config_name: default data_files: - split: test path: trajectories/sokoban_medium/test/*.jsonl - split: train path: trajectories/sokoban_medium/train/*.jsonl --- # sokoban_medium Synthetic Sokoban dataset generated from the local VisGym Sokoban environment. ## Contents - `trajectories/sokoban_medium/test/*.jsonl` - `trajectories/sokoban_medium/train/*.jsonl` - `manifests/` - `metadata/` ## Generation Summary - Task: `sokoban_medium` - Raw test generated: `1000` - Final test after dedupe: `1000` - Raw train generated: `200000` - Final train after dedupe: `199437` - Train samples removed by test-hash filter: `500` - Train samples removed by train self-dedupe: `63` - Raw train chunk count: `20` ## Leakage / Hash Logic - Hash basis: `init_state` only - Test split: - raw generation first - dedupe within test - write `metadata/test_init_state_hashes.json` - Train split: - raw generation only during chunked production - final dedupe order: 1. remove records whose `init_state` hash appears in test 2. remove duplicate `init_state` hashes within train - Final leakage report: - `clean = True` - `overlap_total = 0` ## Notes - This dataset intentionally keeps the visual assets and prompt/action semantics from the integrated Sokoban environment while using the repo's current generation pipeline. - Review artifacts are not included in the upload path by default.
提供机构:
novastar111
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作