ginipick/Toucan-1.5M
收藏Hugging Face2025-11-02 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/ginipick/Toucan-1.5M
下载链接
链接失效反馈官方服务:
资源简介:
Toucan-1.5M是一个最大的完全合成的工具代理数据集,旨在推进代理型LLM的工具使用。它由超过150万个轨迹组成,这些轨迹是从495个现实世界的模型上下文协议(MCPs)中合成的,跨越了2000多个工具。通过利用真实的MCP环境,Toucan-1.5M生成了多样化的、现实的和具有挑战性的任务,这些任务需要使用多个工具,其轨迹涉及多轮、多转、顺序和并行的工具调用。
Toucan-1.5M is the largest fully synthetic tool-agent dataset to date, designed to advance tool use in agentic LLMs. It comprises over 1.5 million trajectories synthesized from 495 real-world Model Context Protocols (MCPs) spanning 2,000+ tools. By leveraging authentic MCP environments, Toucan-1.5M generates diverse, realistic, and challenging tasks requires using multiple tools, with trajectories involving real tool executions across multi-round, multi-turn, sequential, and parallel tool calls.
提供机构:
ginipick



