DuReader
收藏魔搭社区2025-11-18 更新2024-08-31 收录
下载链接:
https://modelscope.cn/datasets/OmniData/DuReader
下载链接
链接失效反馈官方服务:
资源简介:
displayName: DuReader
license:
- DuReader Custom
mediaTypes:
- Text
paperUrl: https://arxiv.org/pdf/1711.05073v4.pdf
publishDate: "2018"
publishUrl: http://ai.baidu.com/broad/subordinate?dataset=dureader
publisher:
- Baidu
tags:
- Question And Answer
taskTypes:
- Reading Comprehension
- Open-Domain Question Answering
- Reading Comprehension Zero Shot
- Reading Comprehension One Shot
- Reading Comprehension Few Shot
---
# 数据集介绍
## 简介
DuReader 是一个大规模的开放域中文机器阅读理解数据集。该数据集由 200K 问题、420K 答案和 1M 文档组成。问题和文档基于百度搜索和百度智道。答案是手动生成的。该数据集还提供了问题类型注释——每个问题都被手动注释为实体、描述或是否以及事实或意见之一。
## 引文
```
@article{he2017dureader,
title={Dureader: a chinese machine reading comprehension dataset from real-world applications},
author={He, Wei and Liu, Kai and Liu, Jing and Lyu, Yajuan and Zhao, Shiqi and Xiao, Xinyan and Liu, Yuan and Wang, Yizhong and Wu, Hua and She, Qiaoqiao and others},
journal={arXiv preprint arXiv:1711.05073},
year={2017}
}
```
## Download dataset
:modelscope-code[]{type="git"}
displayName: DuReader
license:
- DuReader 专有许可
mediaTypes:
- 文本
paperUrl: https://arxiv.org/pdf/1711.05073v4.pdf
publishDate: "2018"
publishUrl: http://ai.baidu.com/broad/subordinate?dataset=dureader
publisher:
- 百度
tags:
- 问答(Question And Answer)
taskTypes:
- 阅读理解(Reading Comprehension)
- 开放域问答(Open-Domain Question Answering)
- 阅读理解零样本(Reading Comprehension Zero Shot)
- 阅读理解单样本(Reading Comprehension One Shot)
- 阅读理解少样本(Reading Comprehension Few Shot)
---
# 数据集介绍
## 简介
DuReader 是一款大规模开放域中文机器阅读理解数据集。该数据集涵盖20万个问题、42万个答案及100万篇文档,其中问题与文档均取材自百度搜索与百度智道,答案均由人工手动生成。此外,该数据集还附带问题类型标注:每道问题均被人工标注为实体类、描述类、是非类,以及事实型或意见型其一。
## 引文
@article{he2017dureader,
title={Dureader: a chinese machine reading comprehension dataset from real-world applications},
author={He, Wei and Liu, Kai and Liu, Jing and Lyu, Yajuan and Zhao, Shiqi and Xiao, Xinyan and Liu, Yuan and Wang, Yizhong and Wu, Hua and She, Qiaoqiao and others},
journal={arXiv preprint arXiv:1711.05073},
year={2017}
}
## 下载数据集
:modelscope-code[]{type="git"}
提供机构:
maas
创建时间:
2024-07-11



