kshitijthakkar/agent-eval-results-20251021_140246
收藏Hugging Face2025-10-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/agent-eval-results-20251021_140246
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了模型名称、评估日期、测试ID、代理类型、难度、提示信息、成功标志、工具调用情况、正确工具使用情况、最终答案调用情况、响应正确性、使用工具列表、步骤数量、响应内容、错误信息和增强追踪信息等字段。数据集被分为训练集,共有15个示例,大小为1368201字节。具体用途和详细内容未在README中说明。
The dataset includes fields such as model name, evaluation date, test ID, agent type, difficulty, prompt information, success flag, tool invocation, correct tool usage, final answer invocation, response correctness, list of tools used, number of steps, response content, error information, and enhanced trace information. The dataset is split into a training set with a total of 15 examples, totaling 1368201 bytes in size. The specific purpose and detailed content are not described in the README.
提供机构:
kshitijthakkar



