agent-distillation/Qwen2.5-32B-Instruct_agent_trajectories_2k_prefix

Name: agent-distillation/Qwen2.5-32B-Instruct_agent_trajectories_2k_prefix
Creator: agent-distillation
Published: 2025-06-04 02:14:05
License: 暂无描述

Hugging Face2025-06-04 更新2025-11-01 收录

下载链接：

https://hf-mirror.com/datasets/agent-distillation/Qwen2.5-32B-Instruct_agent_trajectories_2k_prefix

下载链接

链接失效反馈

官方服务：

资源简介：

本数据集包含2000个由Qwen2.5-32B-Instruct模型使用smolagents库作为框架生成的智能体轨迹。这些轨迹采用“first-thought prefix”方法收集，每个轨迹以模型的初始推理步骤作为前缀，这些步骤来源于Chain-of-Thought（CoT）提示。数据集的更多细节可以在提供的仓库和论文中找到。

This dataset consists of 2k agent trajectories generated by the Qwen2.5-32B-Instruct model using the smolagents library as the agent framework. The trajectories are collected using the first-thought prefix method, where each trajectory is prefixed by the models initial reasoning steps derived from Chain-of-Thought (CoT) prompting. More details about the dataset can be found in the provided repository and paper.

提供机构：

agent-distillation

5,000+

优质数据集

54 个

任务类型

进入经典数据集