harithoppil/ml-swe-prompts

Name: harithoppil/ml-swe-prompts
Creator: harithoppil
Published: 2026-04-28 10:45:18
License: 暂无描述

Hugging Face2026-04-28 更新2026-05-03 收录

下载链接：

https://hf-mirror.com/datasets/harithoppil/ml-swe-prompts

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 pretty_name: ML SWE Prompts tags: - code - swe-bench - terminal-bench - software-engineering - ml - pytorch task_categories: - text-generation language: - en size_categories: - 1K<n<10K configs: - config_name: all data_files: - split: train path: data/all.jsonl - config_name: swe_bench_ml data_files: - split: train path: data/swe_bench_ml.jsonl - config_name: swe_dev_sft data_files: - split: train path: data/swe_dev_sft.jsonl - config_name: swe_dev_rft data_files: - split: train path: data/swe_dev_rft.jsonl - config_name: terminal_bench_verified data_files: - split: train path: data/terminal_bench_verified.jsonl - config_name: terminal_bench_trajectories data_files: - split: train path: data/terminal_bench_trajectories.jsonl dataset_info: features: - name: source dtype: string - name: prompt dtype: string - name: instance_id dtype: string - name: repo dtype: string - name: task_name dtype: string - name: model dtype: string - name: agent dtype: string - name: reward dtype: float64 - name: version dtype: string splits: - name: train num_examples: 6220 --- # ML SWE Prompts Unified collection of ML/training-related software engineering prompts for OPD distillation training. All prompts are in English. Filtered to core ML repos: huggingface (1,058), numpy (937), Lightning-AI (377), ray-project (342). Excludes pandas-dev, qiskit, open-mmlab, scipy, tensorflow, spaCy. ## Splits | Config | Source | Rows | Description | |--------|--------|------|-------------| | `all` | Combined | 6,220 | All prompts combined | | `swe_bench_ml` | SWE-bench train | 2,714 | Problem statements from core ML repos (HF, numpy, Lightning, Ray) + keyword-matched from SWE-Dev | | `swe_dev_sft` | SWE-Dev-train | 3,054 | ML-related agent conversations (SFT format) | | `swe_dev_rft` | SWE-Dev-train | 345 | ML-related agent conversations (RFT format, with rewards) | | `terminal_bench_verified` | Terminal-Bench 2 Verified | 89 | Task instructions from TB2 verified tasks | | `terminal_bench_trajectories` | TB2 Leaderboard | 18 | Unique ML task prompts from agent trajectories (deduplicated) | ## Schema All rows have at minimum `source` and `prompt`. Additional fields vary by source: - **swe_bench_ml**: `instance_id`, `repo`, `version` - **swe_dev_sft/rft**: `instance_id`, `repo`, `reward` - **terminal_bench_verified**: `task_name` - **terminal_bench_trajectories**: `task_name`, `model`, `agent`, `reward` ## Citation ```bibtex @article{merrill2026terminal, title={Terminal-bench: Benchmarking agents on hard, realistic tasks in command line interfaces}, author={Merrill, Mike A and Shaw, Alexander G and Carlini, Nicholas and others}, journal={arXiv preprint arXiv:2601.11868}, year={2026} } @inproceedings{princeton2023swebench, title={SWE-bench: Can Language Models Resolve Real-World GitHub Issues?}, author={Princeton NLP}, year={2023} } ```

提供机构：

harithoppil

5,000+

优质数据集

54 个

任务类型

进入经典数据集