DCAgent2/swebench_verified_OpenThinker3_7B_20260426_065018
收藏Hugging Face2026-04-27 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/DCAgent2/swebench_verified_OpenThinker3_7B_20260426_065018
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多轮对话记录,涉及AI代理与模型的交互,用于任务执行和结果验证。每个示例包括对话内容(conversations,含角色和消息)、代理类型(agent)、模型名称(model)、模型提供者(model_provider)、日期(date)、任务类型(task)、回合编号(episode)、运行ID(run_id)、试验名称(trial_name)、结果(result)和验证器输出(verifier_output)。数据集可能用于评估AI模型在特定任务中的表现,例如通过对话完成操作或解决问题。
This dataset contains multi-turn conversation records involving interactions between AI agents and models, designed for task execution and result verification. Each example includes conversations (with role and content), agent type, model name, model provider, date, task type, episode number, run ID, trial name, result, and verifier output. It is likely used to evaluate the performance of AI models on specific tasks, such as completing operations or solving problems through dialogue.
提供机构:
DCAgent2



