irds/disks45_nocr_trec-robust-2004_fold3
收藏数据集卡片 disks45/nocr/trec-robust-2004/fold3
数据集概述
disks45/nocr/trec-robust-2004/fold3 数据集由 ir-datasets 包提供。
数据内容
该数据集包含以下内容:
queries(即主题):数量为 50。qrels(相关性评估):数量为 62,901。
对于 docs,请使用 irds/disks45_nocr。
使用方法
以下是加载和使用该数据集的示例代码:
python from datasets import load_dataset
queries = load_dataset(irds/disks45_nocr_trec-robust-2004_fold3, queries) for record in queries: record # {query_id: ..., text: ...}
qrels = load_dataset(irds/disks45_nocr_trec-robust-2004_fold3, qrels) for record in qrels: record # {query_id: ..., doc_id: ..., relevance: ...}
注意:调用 load_dataset 将下载数据集(或提供非公开数据集的访问指令),并以 🤗 Dataset 格式创建数据副本。
引用信息
@misc{Voorhees1996Disks45, title = {NIST TREC Disks 4 and 5: Retrieval Test Collections Document Set}, author = {Ellen M. Voorhees}, doi = {10.18434/t47g6m}, year = {1996}, publisher = {National Institute of Standards and Technology} } @inproceedings{Voorhees2004Robust, title={Overview of the TREC 2004 Robust Retrieval Track}, author={Ellen Voorhees}, booktitle={TREC}, year={2004} } @inproceedings{Huston2014ACO, title={A Comparison of Retrieval Models using Term Dependencies}, author={Samuel Huston and W. Bruce Croft}, booktitle={CIKM}, year={2014} }



