five

yuruny/agentic-sudoku-NonMarkov_qwen2.5-3B-it-1e-6_9x9_6-6_gt-SFT_ans1-non_markovian-eval_results_dev

收藏
Hugging Face2025-12-11 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/yuruny/agentic-sudoku-NonMarkov_qwen2.5-3B-it-1e-6_9x9_6-6_gt-SFT_ans1-non_markovian-eval_results_dev
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: steps list: - name: action dtype: string - name: chat_completions list: - name: content dtype: string - name: role dtype: string - name: done dtype: bool - name: mc_return dtype: float64 - name: model_response dtype: string - name: observation dtype: string - name: reward dtype: float64 - name: reward dtype: float64 splits: - name: train num_bytes: 203233 num_examples: 100 download_size: 36456 dataset_size: 203233 configs: - config_name: default data_files: - split: train path: data/train-* ---
提供机构:
yuruny
二维码
社区交流群
二维码
科研交流群
商业服务