DCAgent2/medagentbench_SWE_agent_LM_7B_20260429_173747
收藏Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/DCAgent2/medagentbench_SWE_agent_LM_7B_20260429_173747
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含896个训练样本,用于记录AI代理在特定任务中的多轮对话交互和性能评估。每个样本包括对话内容(conversations,含角色和消息)、代理类型(agent)、模型信息(model和model_provider)、日期(date)、任务类型(task)、运行标识(episode、run_id、trial_name)、结果(result)和验证输出(verifier_output)。数据集适用于AI代理训练、对话系统评估和任务性能分析。
This dataset contains 896 training examples for recording multi-turn conversational interactions and performance evaluations of AI agents on specific tasks. Each example includes conversations (with roles and content), agent type, model information (model and model_provider), date, task type, run identifiers (episode, run_id, trial_name), result, and verifier output. It is suitable for AI agent training, dialogue system evaluation, and task performance analysis.
提供机构:
DCAgent2



