five

raca-workspace-v1/autotrainer-v0

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/raca-workspace-v1/autotrainer-v0
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit tags: - autotrainer - rlvr - countdown - agent-controlled-training configs: - config_name: eval_traces data_files: - split: train path: eval_traces/train-* - config_name: rounds data_files: - split: train path: rounds/train-* - config_name: state data_files: - split: train path: state/train-* - config_name: train_traces data_files: - split: train path: train_traces/train-* dataset_info: - config_name: eval_traces features: - name: question_id dtype: string - name: input dtype: string - name: target dtype: int64 - name: numbers dtype: string - name: expected_answer dtype: string - name: model_response dtype: string - name: correct dtype: bool - name: score dtype: float64 - name: error_type dtype: string - name: num_tokens dtype: int64 - name: round_id dtype: string splits: - name: train num_bytes: 2511714 num_examples: 668 download_size: 433635 dataset_size: 2511714 - config_name: rounds features: - name: round_id dtype: string - name: config dtype: string - name: reward_function_code dtype: string - name: prompt_template dtype: string - name: agent_reasoning dtype: string - name: agent_trace dtype: string - name: training_metrics_per_step dtype: string - name: eval_accuracy dtype: float64 - name: eval_error_breakdown dtype: string - name: crash_log dtype: string - name: status dtype: string splits: - name: train num_bytes: 34564 num_examples: 5 download_size: 24843 dataset_size: 34564 - config_name: state features: - name: state_json dtype: string splits: - name: train num_bytes: 2441 num_examples: 1 download_size: 11693 dataset_size: 2441 - config_name: train_traces features: - name: step dtype: int64 - name: question_id dtype: string - name: input dtype: string - name: model_response dtype: string - name: reward dtype: float64 - name: target dtype: int64 - name: numbers dtype: string - name: round_id dtype: string splits: - name: train num_bytes: 3880910 num_examples: 1280 download_size: 1564967 dataset_size: 3880910 --- # autotrainer-v0 AutoTrainer-v0: LLM-agent-controlled GRPO training on Countdown ## Dataset Info - **Rows**: 1 - **Columns**: 1 ## Columns | Column | Type | Description | |--------|------|-------------| | state_json | Value('string') | Full autotrainer state as JSON string | ## Generation Parameters ```json { "script_name": "run_round.py", "model": "Qwen/Qwen2.5-1.5B-Instruct", "description": "AutoTrainer-v0: LLM-agent-controlled GRPO training on Countdown", "experiment_id": "autotrainer-v0", "hyperparameters": {}, "input_datasets": [] } ``` ## Usage ```python from datasets import load_dataset dataset = load_dataset("raca-workspace-v1/autotrainer-v0", split="train") print(f"Loaded {len(dataset)} rows") ``` --- *Uploaded via [RACA](https://github.com/Zayne-sprague/Dr-Claude-Code) hf_utility.*
提供机构:
raca-workspace-v1
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作