five

LocoreMind/agent-sft-example

收藏
Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/LocoreMind/agent-sft-example
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 language: - en tags: - agent - tool-use - sft - ms-swift - copaw size_categories: - n<1K --- # Agent SFT Example Dataset (CoPaw Format) This is an example dataset for training agent models with tool calling capabilities using MS-SWIFT, formatted for CoPaw-Flash-9B compatibility. ## Format The dataset uses the standard **messages format** with only three roles: `system`, `user`, `assistant`. - **tool_call**: Placed in `assistant` content using XML tags with **snake_case** tool names - **tool_response**: Placed in `user` content using `<tool_response></tool_response>` tags ## Tool Call Format (CoPaw-Flash-9B Compatible) ```xml <tool_call> <function=bash> <parameter=command> ls -la </parameter> </function> </tool_call> ``` **Important**: Tool names use **snake_case** format (e.g., `bash`, `read`, `write`, `file_tree`) to match CoPaw-Flash-9B's chat_template.jinja. ## Example Structure ### Single Tool Call ``` system: System prompt with tool definitions user: User question assistant: [optional explanation] + <tool_call><function=bash>...</function></tool_call> user: <tool_response>tool output</tool_response> assistant: Final answer ``` ### Multi-turn Tool Calls ``` system: System prompt user: User question assistant: Explanation + <tool_call><function=bash>...</function></tool_call> user: <tool_response>first result</tool_response> assistant: Explanation + <tool_call><function=read>...</function></tool_call> user: <tool_response>second result</tool_response> assistant: Final summary ``` ### Regular Conversation (no tools) ``` user: Regular question assistant: Direct answer ``` ## Available Tools (snake_case) | Tool Name | Description | |-----------|-------------| | `bash` | Execute shell commands | | `read` | Read file contents | | `write` | Write content to file | | `edit` | Edit file contents | | `glob` | Find files matching pattern | | `grep` | Search for patterns in files | | `file_tree` | View directory structure | | `file_list` | List files in directory | ## Usage with MS-SWIFT ```bash swift sft \ --model Qwen/Qwen3-4B-Instruct \ --dataset LocoreMind/agent-sft-example \ --tuner_type lora \ --output_dir output ``` ## Compatible Models This format is compatible with: - CoPaw-Flash-9B - CoPaw-Flash-9B-Agent-Merged - Qwen3 series (with appropriate chat template) - Other models using CoPaw-style XML tool format ## Related Datasets - [LocoreMind/agent-trajectories-msswift](https://huggingface.co/datasets/LocoreMind/agent-trajectories-msswift) - Full training dataset
提供机构:
LocoreMind
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作