---
pretty_name: '`beir/nfcorpus/dev`'
viewer: false
source_datasets: ['irds/beir_nfcorpus']
task_categories:
- text-retrieval
---
# Dataset Card for `beir/nfcorpus/dev`
The `beir/nfcorpus/dev` dataset, provided by the [ir-datasets](https://ir-datasets.com/) package.
For more information about the dataset, see the [documentation](https://ir-datasets.com/beir#beir/nfcorpus/dev).
# Data
This dataset provides:
- `queries` (i.e., topics); count=324
- `qrels`: (relevance assessments); count=11,385
- For `docs`, use [`irds/beir_nfcorpus`](https://huggingface.co/datasets/irds/beir_nfcorpus)
## Usage
```python
from datasets import load_dataset
queries = load_dataset('irds/beir_nfcorpus_dev', 'queries')
for record in queries:
record # {'query_id': ..., 'text': ...}
qrels = load_dataset('irds/beir_nfcorpus_dev', 'qrels')
for record in qrels:
record # {'query_id': ..., 'doc_id': ..., 'relevance': ..., 'iteration': ...}
```
Note that calling `load_dataset` will download the dataset (or provide access instructions when it's not public) and make a copy of the
data in 🤗 Dataset format.
## Citation Information
```
@inproceedings{Boteva2016Nfcorpus,
title="A Full-Text Learning to Rank Dataset for Medical Information Retrieval",
author = "Vera Boteva and Demian Gholipour and Artem Sokolov and Stefan Riezler",
booktitle = "Proceedings of the European Conference on Information Retrieval ({ECIR})",
location = "Padova, Italy",
publisher = "Springer",
year = 2016
}
@article{Thakur2021Beir,
title = "BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models",
author = "Thakur, Nandan and Reimers, Nils and Rücklé, Andreas and Srivastava, Abhishek and Gurevych, Iryna",
journal= "arXiv preprint arXiv:2104.08663",
month = "4",
year = "2021",
url = "https://arxiv.org/abs/2104.08663",
}
```
漂亮名称:`beir/nfcorpus/dev`
查看器:否
源数据集:['irds/beir_nfcorpus']
任务类别:
- 文本检索
# `beir/nfcorpus/dev` 数据集卡片
本数据集由 [ir-datasets](https://ir-datasets.com/) 包提供。如需了解该数据集的更多详情,请参阅 [官方文档](https://ir-datasets.com/beir#beir/nfcorpus/dev)。
## 数据详情
本数据集包含以下内容:
- `queries`(即查询主题):共计324条
- `qrels`(相关性评估标注):共计11385条
- 如需获取`docs`(文档集),请使用 [`irds/beir_nfcorpus`](https://huggingface.co/datasets/irds/beir_nfcorpus) 数据集。
## 使用方法
python
from datasets import load_dataset
queries = load_dataset('irds/beir_nfcorpus_dev', 'queries')
for record in queries:
record # {'query_id': ..., 'text': ...}
qrels = load_dataset('irds/beir_nfcorpus_dev', 'qrels')
for record in qrels:
record # {'query_id': ..., 'doc_id': ..., 'relevance': ..., 'iteration': ...}
注:调用`load_dataset`函数将下载该数据集(若数据集未公开则会提供访问指引),并将数据转换为 🤗 数据集格式。
## 引用信息
@inproceedings{Boteva2016Nfcorpus,
title="A Full-Text Learning to Rank Dataset for Medical Information Retrieval",
author = "Vera Boteva and Demian Gholipour and Artem Sokolov and Stefan Riezler",
booktitle = "Proceedings of the European Conference on Information Retrieval ({ECIR})",
location = "Padova, Italy",
publisher = "Springer",
year = 2016
}
@article{Thakur2021Beir,
title = "BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models",
author = "Thakur, Nandan and Reimers, Nils and Rücklé, Andreas and Srivastava, Abhishek and Gurevych, Iryna",
journal= "arXiv preprint arXiv:2104.08663",
month = "4",
year = "2021",
url = "https://arxiv.org/abs/2104.08663",
}