kshitijthakkar/smoalagent-results-21102025
收藏Hugging Face2025-10-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/smoalagent-results-21102025
下载链接
链接失效反馈官方服务:
资源简介:
该数据集记录了模型在处理不同测试时的表现,包括模型名称、评估日期、测试ID等详细信息,以及模型是否成功、是否正确使用工具等信息。每个测试案例包含了步骤的详细跟踪信息,包括代理类型、提示、测试难度、测试ID、事件和状态等。
This dataset logs the performance of a model on various tests, including details such as model name, evaluation date, test ID, and information on whether the model was successful, whether tools were used correctly, etc. Each test case includes detailed trace information of the steps, including agent type, prompt, test difficulty, test ID, events, and status.
提供机构:
kshitijthakkar



