fantaxy/Toucan-1.5M
收藏Hugging Face2025-11-02 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/fantaxy/Toucan-1.5M
下载链接
链接失效反馈官方服务:
资源简介:
Toucan-1.5M是一个旨在提高大型语言模型中工具使用的数据集,包含了超过1.5百万个轨迹,这些轨迹由495个真实世界的模型上下文协议(MCPs)合成。数据集包含多个配置,如Kimi-K2、OSS、Qwen3和SFT,每个配置都有不同的特征和训练集统计信息。数据集适用于多种工具的使用,涵盖了多轮次、顺序和并行的工具调用。
Toucan-1.5M is a dataset designed to advance tool use in large language models, consisting of over 1.5 million trajectories synthesized from 495 real-world Model Context Protocols (MCPs). The dataset includes multiple configurations such as Kimi-K2, OSS, Qwen3, and SFT, each with different features and training set statistics. It covers the use of multiple tools, including multi-turn, sequential, and parallel tool calls.
提供机构:
fantaxy



