five

kshitijthakkar/agent-eval-results-20251021_135003

收藏
Hugging Face2025-10-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/agent-eval-results-20251021_135003
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了一系列字段,涉及模型信息、测试的日期和ID、代理类型、任务难度、提示信息、任务成功与否的标志、是否调用了工具、调用的是否是正确的工具、是否调用了最终答案、响应是否正确(可能未提供)、在任务中使用的工具列表、任务步骤数、响应内容、错误信息以及额外的跟踪信息。数据集被划分为训练集,包含15个示例。数据集的下载大小为21408字节,实际大小为1693494字节。

The dataset includes a series of fields involving model information, test date and ID, agent type, task difficulty, prompt information, whether the task was successful, whether tools were called, whether the correct tools were used, whether the final answer was called, whether the response was correct (possibly not provided), a list of tools used in the task, the number of steps in the task, the response content, error information, and additional trace information. The dataset is split into a training set, which contains 15 examples. The download size of the dataset is 21408 bytes, and the actual size is 1693494 bytes.
提供机构:
kshitijthakkar
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作