irds/beir_hotpotqa_dev
收藏数据集卡片 for beir/hotpotqa/dev
数据集概述
beir/hotpotqa/dev 数据集由 ir-datasets 包提供。
数据内容
该数据集包含以下内容:
queries(即主题):数量为 5,447qrels(相关性评估):数量为 10,894
对于 docs,请使用 irds/beir_hotpotqa。
使用方法
以下是加载和使用该数据集的示例代码:
python from datasets import load_dataset
queries = load_dataset(irds/beir_hotpotqa_dev, queries) for record in queries: record # {query_id: ..., text: ...}
qrels = load_dataset(irds/beir_hotpotqa_dev, qrels) for record in qrels: record # {query_id: ..., doc_id: ..., relevance: ..., iteration: ...}
引用信息
@inproceedings{Yang2018Hotpotqa, title = "{H}otpot{QA}: A Dataset for Diverse, Explainable Multi-hop Question Answering", author = "Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William and Salakhutdinov, Ruslan and Manning, Christopher D.", booktitle = "Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing", month = oct # "-" # nov, year = "2018", address = "Brussels, Belgium", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/D18-1259", doi = "10.18653/v1/D18-1259", pages = "2369--2380" } @article{Thakur2021Beir, title = "BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models", author = "Thakur, Nandan and Reimers, Nils and Rücklé, Andreas and Srivastava, Abhishek and Gurevych, Iryna", journal= "arXiv preprint arXiv:2104.08663", month = "4", year = "2021", url = "https://arxiv.org/abs/2104.08663", }



