DCAgent2/medagentbench_SWE_agent_LM_7B_20260429_173747

Name: DCAgent2/medagentbench_SWE_agent_LM_7B_20260429_173747
Creator: DCAgent2
Published: 2026-04-29 19:38:00
License: 暂无描述

Hugging Face2026-04-29 更新2026-05-03 收录

下载链接：

https://hf-mirror.com/datasets/DCAgent2/medagentbench_SWE_agent_LM_7B_20260429_173747

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含896个训练样本，用于记录AI代理在特定任务中的多轮对话交互和性能评估。每个样本包括对话内容（conversations，含角色和消息）、代理类型（agent）、模型信息（model和model_provider）、日期（date）、任务类型（task）、运行标识（episode、run_id、trial_name）、结果（result）和验证输出（verifier_output）。数据集适用于AI代理训练、对话系统评估和任务性能分析。

This dataset contains 896 training examples for recording multi-turn conversational interactions and performance evaluations of AI agents on specific tasks. Each example includes conversations (with roles and content), agent type, model information (model and model_provider), date, task type, run identifiers (episode, run_id, trial_name), result, and verifier output. It is suitable for AI agent training, dialogue system evaluation, and task performance analysis.

提供机构：

DCAgent2

5,000+

优质数据集

54 个

任务类型

进入经典数据集