AgentInstruct

Name: AgentInstruct
Creator: maas
Published: 2025-07-23 15:34:06
License: 暂无描述

魔搭社区2025-07-23 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/ZhipuAI/AgentInstruct

下载链接

链接失效反馈

官方服务：

资源简介：

# AgentInstruct Dataset 🤖 [Models] • 💻 [Github Repo] • 📌 [Project Page] • 📃 [Paper] **AgentInstruct** is a meticulously curated dataset featuring **1,866** high-quality interactions, designed to enhance AI agents across six diverse real-world tasks, leveraging innovative methods like **Task Derivation** and **Self-Instruct**. - 🔍 **CoT** - Harness the power of [ReAct](https://react-lm.github.io/), offering detailed thought explanations for each action, ensuring an intricate understanding of the model's decision-making journey. - 🌍 **Diversity** - Spanning 6 real-world scenarios, from Daily Household Routines to Database Operations, and their average turns range from 5 to 35. - 🎯 **Precision** - Not all trajectories of GPT-4 are effective! Ours are rigorously filtered using strict rewards to ensure top-notch quality. - ✅ **Assurance** - Rigorous checks to avoid data leakage, ensuring pristine dataset quality. ## Task Overview | Task | # Filt. Traj. | Avg # Filt. Traj. Turns | |---|---|---| |ALFWorld|336|13.52| |WebShop|351|3.68| |Mind2Web|122|1.00| |Knowledge Graph|324|6.04| |Operating System|195|3.85| |Database|538|2.06| |**AgentInstruct**|1866|5.24| AgentInstruct includes 1,866 trajectories from 6 agents tasks. "Traj." stands for interaction trajectory. "Filt. Traj." stands for filtered trajectories. ## Models **AgentLM** models are produced by mixed training on AgentInstruct dataset and ShareGPT dataset from Llama-2-chat models. The models follow the conversation format of [Llama-2-chat](https://huggingface.co/blog/llama2#how-to-prompt-llama-2), with system prompt fixed as ``` You are a helpful, respectful and honest assistant. ``` 7B, 13B, and 70B models are available on ModelScope model hub. |Model|ModelScope Repo| |---|---| |AgentLM-7B| [ModelScope Repo](https://modelscope.cn/models/ZhipuAI/agentlm-7b) | |AgentLM-13B| [ModelScope Repo](https://modelscope.cn/models/ZhipuAI/agentlm-13b) | |AgentLM-70B| [ModelScope Repo](https://modelscope.cn/models/ZhipuAI/agentlm-70b) | Check our [[Github Repo]](https://github.com/THUDM/AgentTuning) for details about **AgentTuning**. ## Citation If you find our work useful, please consider citing AgentTuning: ``` @misc{zeng2023agenttuning, title={AgentTuning: Enabling Generalized Agent Abilities for LLMs}, author={Aohan Zeng and Mingdao Liu and Rui Lu and Bowen Wang and Xiao Liu and Yuxiao Dong and Jie Tang}, year={2023}, eprint={2310.12823}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```

# AgentInstruct 数据集 🤖 [模型] • 💻 [GitHub仓库] • 📌 [项目主页] • 📃 [论文] **AgentInstruct** 是一套经过精心整理的数据集，包含**1866**条高质量交互样本，旨在通过任务推导（Task Derivation）与自指令（Self-Instruct）等创新方法，提升AI智能体（AI Agent）在6种多样化真实场景任务中的表现。 - 🔍 **思维链（CoT）**：采用[ReAct](https://react-lm.github.io/)框架，为每一步操作提供详细的思维解释，助力深入理解模型的决策过程。 - 🌍 **多样性**：覆盖6类真实应用场景，涵盖日常家庭事务至数据库操作等领域，单条轨迹的平均交互轮次介于5至35之间。 - 🎯 **精准性**：并非所有GPT-4生成的交互轨迹都具备有效性！本数据集通过严格的奖励机制进行多轮筛选，以确保数据质量达到顶尖水准。 - ✅ **可靠性**：经过严格的合规检查以规避数据泄露问题，保障数据集的纯净性与高质量。 ## 任务概览 | 任务名称 | 筛选后交互轨迹数 | 单条轨迹平均交互轮次 | |---|---|---| | ALFWorld | 336 | 13.52 | | WebShop | 351 | 3.68 | | Mind2Web | 122 | 1.00 | | 知识图谱（Knowledge Graph） | 324 | 6.04 | | 操作系统（Operating System） | 195 | 3.85 | | 数据库（Database） | 538 | 2.06 | | **AgentInstruct** | 1866 | 5.24 | AgentInstruct 共包含来自6个智能体任务的1866条交互轨迹。其中“Traj.”为交互轨迹（interaction trajectory）的缩写，“Filt. Traj.”为筛选后交互轨迹的缩写。 ## 模型 **AgentLM** 模型是以Llama-2-chat模型为基础，在AgentInstruct数据集与ShareGPT数据集上进行混合训练得到的。该模型遵循[Llama-2-chat](https://huggingface.co/blog/llama2#how-to-prompt-llama-2)的对话格式，系统提示词固定为： You are a helpful, respectful and honest assistant. 7B、13B与70B参数版本的模型已在ModelScope模型 Hub 上线。 | 模型名称 | ModelScope 仓库地址 | |---|---| | AgentLM-7B | [ModelScope 仓库](https://modelscope.cn/models/ZhipuAI/agentlm-7b) | | AgentLM-13B | [ModelScope 仓库](https://modelscope.cn/models/ZhipuAI/agentlm-13b) | | AgentLM-70B | [ModelScope 仓库](https://modelscope.cn/models/ZhipuAI/agentlm-70b) | 如需了解**AgentTuning**的详细信息，请访问我们的[GitHub仓库](https://github.com/THUDM/AgentTuning)。 ## 引用若您的研究工作用到了本数据集，请引用AgentTuning： @misc{zeng2023agenttuning, title={AgentTuning: Enabling Generalized Agent Abilities for LLMs}, author={Aohan Zeng and Mingdao Liu and Rui Lu and Bowen Wang and Xiao Liu and Yuxiao Dong and Jie Tang}, year={2023}, eprint={2310.12823}, archivePrefix={arXiv}, primaryClass={cs.CL} }

提供机构：

maas

创建时间：

2023-10-21

5,000+

优质数据集

54 个

任务类型

进入经典数据集