DeepDive

Name: DeepDive
Creator: maas
Published: 2026-01-09 18:44:21
License: 暂无描述

魔搭社区2026-01-09 更新2025-09-20 收录

下载链接：

https://modelscope.cn/datasets/ZhipuAI/DeepDive

下载链接

链接失效反馈

官方服务：

资源简介：

# DeepDive Dataset <div align="center"> [![GitHub](https://img.shields.io/github/stars/THUDM/DeepDive?style=social)](https://github.com/THUDM/DeepDive) [![arXiv](https://img.shields.io/badge/arXiv-2509.10446-b31b1b.svg)](https://arxiv.org/pdf/2509.10446) [![Dataset](https://img.shields.io/badge/🤗%20Dataset-DeepDive-blueviolet)](https://huggingface.co/datasets/zai-org/DeepDive) [![Model](https://img.shields.io/badge/🤗%20Model-Coming%20soon-ffcc00)](#) </div> ## Overview This is the training dataset for [DeepDive](https://github.com/THUDM/DeepDive), an automated approach for training deep search agents with complex, multi-step reasoning capabilities. The dataset is constructed through automated knowledge graph random walks, entity obfuscation, and difficulty filtering to create challenging questions that require sophisticated search and retrieval skills. <div align="center"> <img src="./assets/kg_data_pipeline.svg" alt="Data Synthesis Pipeline" width="75%"> <p><em></em></p> </div> ## Dataset Statistics | Component | Split | Size | Description | | :--------------------- | :--------------- | :---- | :-------------------------------------------------------- | | **Total Dataset** | qa_sft, qa_rl | 3,250 | Complete collection of QA pairs | | **SFT Portion** | qa_sft | 1,016 | Question-answer pairs for Supervised Fine-Tuning | | ↳ **SFT Trajectories** | trajectories_sft | 858 | Search trajectories from SFT QA pairs via reject sampling | | **RL Portion** | qa_rl | 2,234 | Question-answer pairs for Reinforcement Learning | ## Data Structure ### `qa_sft` and `qa_rl` Split **Fields:** - `id`: Unique identifier for the QA pair - `question`: Multi-hop reasoning question requiring search - `answer`: Ground truth answer - `conversation`: [] (empty) ### `trajectories_sft` Split **Fields:** - `id`: Unique identifier for the trajectory - `question`: The original question - `answer`: Ground truth answer - `conversation`: List of conversation turns showing the search process (role, content) ## Usage ```python from datasets import load_dataset dataset = load_dataset("zai-org/DeepDive") # Access splits sft_data = dataset["qa_sft"] rl_data = dataset["qa_rl"] trajectories = dataset["trajectories_sft"] ``` ## Citation If you find DeepDive useful for your research, please cite our paper: ```bibtex @misc{lu2025deepdiveadvancingdeepsearch, title={DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL}, author={Rui Lu and Zhenyu Hou and Zihan Wang and Hanchen Zhang and Xiao Liu and Yujiang Li and Shi Feng and Jie Tang and Yuxiao Dong}, year={2025}, eprint={2509.10446}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2509.10446}, } ```

# DeepDive 数据集 <div align="center"> [![GitHub](https://img.shields.io/github/stars/THUDM/DeepDive?style=social)](https://github.com/THUDM/DeepDive) [![arXiv](https://img.shields.io/badge/arXiv-2509.10446-b31b1b.svg)](https://arxiv.org/pdf/2509.10446) [![Dataset](https://img.shields.io/badge/%F0%9F%A4%97%20Dataset-DeepDive-blueviolet)](https://huggingface.co/datasets/zai-org/DeepDive) [![Model](https://img.shields.io/badge/%F0%9F%A4%97%20Model-Coming%20soon-ffcc00)](#) </div> ## 概览本数据集为[DeepDive](https://github.com/THUDM/DeepDive)的训练数据集，后者是一种用于训练具备复杂多步推理能力的深度搜索智能体（AI Agent）的自动化方法。本数据集通过自动化知识图谱（Knowledge Graph）随机游走、实体混淆与难度筛选构建，生成需要运用高级搜索与检索技能的挑战性问题。 <div align="center"> <img src="./assets/kg_data_pipeline.svg" alt="数据合成流程" width="75%"> <p><em></em></p> </div> ## 数据集统计 | 组件 | 拆分集 | 规模 | 描述 | | :--------------------- | :--------------- | :---- | :-------------------------------------------------------- | | **总数据集** | qa_sft, qa_rl | 3,250 | 完整的问答对集合 | | **监督微调（Supervised Fine-Tuning，SFT）子集** | qa_sft | 1,016 | 用于监督微调的问答对 | | ↳ **SFT轨迹** | trajectories_sft | 858 | 通过拒绝采样从SFT问答对中获取的搜索轨迹 | | **强化学习（Reinforcement Learning，RL）子集** | qa_rl | 2,234 | 用于强化学习的问答对 | ## 数据结构 ### `qa_sft` 与 `qa_rl` 拆分集 **字段说明：** - `id`：问答对的唯一标识符 - `question`：需要进行多跳推理的搜索类问题 - `answer`：标准答案 - `conversation`：[]（空列表） ### `trajectories_sft` 拆分集 **字段说明：** - `id`：轨迹的唯一标识符 - `question`：原始问题 - `answer`：标准答案 - `conversation`：展示搜索过程的对话轮次列表（包含角色与内容） ## 使用方法 python from datasets import load_dataset dataset = load_dataset("zai-org/DeepDive") # 加载拆分集 sft_data = dataset["qa_sft"] rl_data = dataset["qa_rl"] trajectories = dataset["trajectories_sft"] ## 引用声明如果您的研究中用到了DeepDive数据集，请引用我们的论文： bibtex @misc{lu2025deepdiveadvancingdeepsearch, title={DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL}, author={Rui Lu and Zhenyu Hou and Zihan Wang and Hanchen Zhang and Xiao Liu and Yujiang Li and Shi Feng and Jie Tang and Yuxiao Dong}, year={2025}, eprint={2509.10446}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2509.10446}, }

提供机构：

maas

创建时间：

2025-09-18

5,000+

优质数据集

54 个

任务类型

进入经典数据集