kshitijthakkar/smoltrace-results-20260424_105907
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/smoltrace-results-20260424_105907
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自SMOLTRACE基准测试运行的评估结果,提供了详细的模型性能指标和执行数据。数据集涵盖了模型标识、评估日期、任务ID、代理类型、难度级别、测试提示/问题、是否成功、工具调用情况、正确工具使用情况、最终答案调用情况、响应正确性、使用工具列表、步骤数、代理最终响应、错误信息、跟踪ID、执行时间、总令牌数、API成本以及详细跟踪数据等信息。
This dataset contains evaluation results from a SMOLTRACE benchmark run, providing detailed model performance metrics and execution data. The dataset includes model identifier, evaluation date, task ID, agent type, difficulty level, test prompt/question, success status, tool invocation, correct tool usage, final answer call, response correctness, list of tools used, number of steps, agents final response, error message, trace ID, execution time, total tokens, API cost, and detailed trace data.
提供机构:
kshitijthakkar



