kshitijthakkar/agent-eval-results-20251021_140601
收藏Hugging Face2025-10-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/agent-eval-results-20251021_140601
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列字段,用于描述某个模型的测试过程和结果,包括测试日期、测试ID、测试难度、提示信息、是否成功、是否调用工具、工具是否正确、是否调用最终答案、响应是否正确等信息。数据集分为训练集部分,共有15个示例。
The dataset includes a series of fields that describe the testing process and results of a model, such as test date, test ID, difficulty, prompt, success or failure, tool invocation, correctness of the tool, invocation of the final answer, correctness of the response, etc. The dataset is split into a training set with a total of 15 examples.
提供机构:
kshitijthakkar



