five

searchsim/agentsim-atc

收藏
Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/searchsim/agentsim-atc
下载链接
链接失效反馈
官方服务:
资源简介:
AgentSim Agent-Trace Corpus (ATC)是一个基于检索增强的问答代理的推理轨迹数据集,包含103,567个推理步骤,覆盖了Quasar-T、CausalQA和MSMARCO三个IR基准测试。数据集提供了20,548个监督查询-文档-答案三元组用于微调,以及199,968个独特的检索文档。每个推理步骤都可以追溯到源语料库中的特定文档,提供了步骤级可审计性。数据集适用于行为分析、链式思维微调、模仿学习等多种用途,但需遵守上游数据集的许可限制。

The AgentSim Agent-Trace Corpus (ATC) is a dataset of grounded reasoning traces for retrieval-augmented question-answering agents, containing 103,567 reasoning steps spanning three IR benchmarks (Quasar-T, CausalQA, and MSMARCO). It includes 20,548 supervised query-document-answer triples for fine-tuning and 199,968 unique retrieved documents. Each reasoning step is traceable to specific documents in the source corpus, enabling step-level auditability. The dataset is suitable for various uses such as behavioral analysis, chain-of-thought fine-tuning, and imitation learning, but users must adhere to the licensing restrictions of the upstream source datasets.
提供机构:
searchsim
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作