kshitijthakkar/agent-eval-results-20251021_140601

Name: kshitijthakkar/agent-eval-results-20251021_140601
Creator: kshitijthakkar
Published: 2025-10-21 08:38:18
License: 暂无描述

Hugging Face2025-10-21 更新2025-10-25 收录

下载链接：

https://hf-mirror.com/datasets/kshitijthakkar/agent-eval-results-20251021_140601

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了一系列字段，用于描述某个模型的测试过程和结果，包括测试日期、测试ID、测试难度、提示信息、是否成功、是否调用工具、工具是否正确、是否调用最终答案、响应是否正确等信息。数据集分为训练集部分，共有15个示例。

The dataset includes a series of fields that describe the testing process and results of a model, such as test date, test ID, difficulty, prompt, success or failure, tool invocation, correctness of the tool, invocation of the final answer, correctness of the response, etc. The dataset is split into a training set with a total of 15 examples.

提供机构：

kshitijthakkar

5,000+

优质数据集

54 个

任务类型

进入经典数据集