tuandunghcmut/toolbench-v1

Name: tuandunghcmut/toolbench-v1
Creator: tuandunghcmut
Published: 2025-11-21 03:55:09
License: 暂无描述

Hugging Face2025-11-21 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/tuandunghcmut/toolbench-v1

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - question-answering - text-generation language: - en tags: - tool-use - function-calling - toolbench size_categories: - 100K<n<1M configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - config_name: benchmark data_files: - split: g1_instruction path: benchmark/g1_instruction-* - split: g1_category path: benchmark/g1_category-* - split: g1_tool path: benchmark/g1_tool-* - split: g2_instruction path: benchmark/g2_instruction-* - split: g2_category path: benchmark/g2_category-* - split: g3_instruction path: benchmark/g3_instruction-* --- # ToolBench Dataset ## Dataset Description **ToolBench** is an open-source, large-scale, high-quality instruction tuning SFT dataset designed to facilitate the construction of powerful LLMs with general tool-use capability. It was constructed automatically using ChatGPT (gpt-3.5-turbo-16k) upgraded with enhanced function call capabilities. This dataset corresponds to the training data used for **ToolLLaMA**. - **Repository:** [OpenBMB/ToolBench](https://github.com/OpenBMB/ToolBench) - **Paper:** [ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs](https://arxiv.org/abs/2307.16789) ## Dataset Structure The dataset contains conversations between a user and an assistant (AutoGPT/ToolLLaMA) involving tool calls. ### Data Fields - `id`: Unique identifier for the conversation. - `conversations`: A list of messages between the "user", "system", and "assistant" (or "function"). ### Example ```json { "id": "12345", "conversations": [ { "from": "system", "value": "You are AutoGPT..." }, { "from": "user", "value": "I need to find..." }, ... ] } ``` ## Dataset Configuration The dataset is available in two configurations: ### 1. Default (`default`) Contains the conversation data for training and validation. - **Splits**: `train`, `validation` - **Schema**: `id`, `conversations` ```python from datasets import load_dataset ds = load_dataset("tuandunghcmut/toolbench-v1", "default") ``` ### 2. Benchmark (`benchmark`) Contains the test instructions for evaluation. - **Splits**: `g1_instruction`, `g1_category`, `g1_tool`, `g2_instruction`, `g2_category`, `g3_instruction` - **Schema**: `query_id`, `query`, `api_list` (JSON string), `relevant_apis` (JSON string) ```python from datasets import load_dataset ds = load_dataset("tuandunghcmut/toolbench-v1", "benchmark") ``` ## Citation ```bibtex @misc{qin2023toolllm, title={ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs}, author={Yujia Qin and Shihao Liang and Yining Ye and Kunlun Zhu and Lan Yan and Yaxi Lu and Yankai Lin and Xin Cong and Xiangru Tang and Bill Qian and Sihan Zhao and Runchu Tian and Ruobing Xie and Jie Zhou and Mark Gerstein and Dahai Li and Zhiyuan Liu and Maosong Sun}, year={2023}, eprint={2307.16789}, archivePrefix={arXiv}, primaryClass={cs.AI} } ```

--- 许可证: Apache-2.0 任务类别: - 问答 - 文本生成语言: - 英语标签: - 工具使用 - 函数调用 - ToolBench 样本规模: - 100K < n < 1M 配置项: - 配置名称: default 数据文件: - 拆分集: train 路径: data/train-* - 拆分集: validation 路径: data/validation-* - 配置名称: benchmark 数据文件: - 拆分集: g1_instruction 路径: benchmark/g1_instruction-* - 拆分集: g1_category 路径: benchmark/g1_category-* - 拆分集: g1_tool 路径: benchmark/g1_tool-* - 拆分集: g2_instruction 路径: benchmark/g2_instruction-* - 拆分集: g2_category 路径: benchmark/g2_category-* - 拆分集: g3_instruction 路径: benchmark/g3_instruction-* --- # ToolBench 数据集 ## 数据集描述 **ToolBench** 是一款开源、大规模、高质量的面向指令微调的监督微调（Supervised Fine-Tuning, SFT）数据集，旨在助力构建具备通用工具使用能力的强大大语言模型（Large Language Model, LLM）。该数据集通过搭载增强函数调用能力的ChatGPT（gpt-3.5-turbo-16k）自动构建完成。本数据集对应 **ToolLLaMA** 所使用的训练数据。 - **仓库地址**: [OpenBMB/ToolBench](https://github.com/OpenBMB/ToolBench) - **论文链接**: [ToolLLM: 助力大语言模型掌握16000+真实世界API](https://arxiv.org/abs/2307.16789) ## 数据集结构本数据集包含用户与助手（AutoGPT/ToolLLaMA）之间涉及工具调用的对话数据。 ### 数据字段 - `id`: 对话的唯一标识符。 - `conversations`: 包含“用户（user）”“系统（system）”与“助手（assistant）”（或“函数（function）”）之间的消息列表。 ### 示例 json { "id": "12345", "conversations": [ { "from": "system", "value": "You are AutoGPT..." }, { "from": "user", "value": "I need to find..." }, ... ] } ## 数据集配置本数据集提供两种配置方案： ### 1. 默认配置（`default`）包含用于训练与验证的对话数据。 - **拆分集**: `train`、`validation` - **数据 Schema**: `id`、`conversations` python from datasets import load_dataset ds = load_dataset("tuandunghcmut/toolbench-v1", "default") ### 2. 基准测试配置（`benchmark`）包含用于评估的测试指令数据。 - **拆分集**: `g1_instruction`、`g1_category`、`g1_tool`、`g2_instruction`、`g2_category`、`g3_instruction` - **数据 Schema**: `query_id`、`query`、`api_list`（JSON字符串）、`relevant_apis`（JSON字符串） python from datasets import load_dataset ds = load_dataset("tuandunghcmut/toolbench-v1", "benchmark") ## 引用 bibtex @misc{qin2023toolllm, title={ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs}, author={Yujia Qin and Shihao Liang and Yining Ye and Kunlun Zhu and Lan Yan and Yaxi Lu and Yankai Lin and Xin Cong and Xiangru Tang and Bill Qian and Sihan Zhao and Runchu Tian and Ruobing Xie and Jie Zhou and Mark Gerstein and Dahai Li and Zhiyuan Liu and Maosong Sun}, year={2023}, eprint={2307.16789}, archivePrefix={arXiv}, primaryClass={cs.AI} }

提供机构：

tuandunghcmut

5,000+

优质数据集

54 个

任务类型

进入经典数据集