agent-distillation/Qwen2.5-32B-Instruct_agent_trajectories_2k_prefix
收藏Hugging Face2025-06-04 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/agent-distillation/Qwen2.5-32B-Instruct_agent_trajectories_2k_prefix
下载链接
链接失效反馈官方服务:
资源简介:
本数据集包含2000个由Qwen2.5-32B-Instruct模型使用smolagents库作为框架生成的智能体轨迹。这些轨迹采用“first-thought prefix”方法收集,每个轨迹以模型的初始推理步骤作为前缀,这些步骤来源于Chain-of-Thought(CoT)提示。数据集的更多细节可以在提供的仓库和论文中找到。
This dataset consists of 2k agent trajectories generated by the Qwen2.5-32B-Instruct model using the smolagents library as the agent framework. The trajectories are collected using the first-thought prefix method, where each trajectory is prefixed by the models initial reasoning steps derived from Chain-of-Thought (CoT) prompting. More details about the dataset can be found in the provided repository and paper.
提供机构:
agent-distillation



