irds/disks45_nocr_trec-robust-2004
收藏数据集概述
数据集名称
disks45/nocr/trec-robust-2004
数据来源
- 原始数据集:
irds/disks45_nocr
数据内容
queries: 查询(主题),数量为250。qrels: 相关性评估,数量为311,410。docs: 文档数据,需从irds/disks45_nocr获取。
使用示例
python from datasets import load_dataset
queries = load_dataset(irds/disks45_nocr_trec-robust-2004, queries) for record in queries: record # {query_id: ..., title: ..., description: ..., narrative: ...}
qrels = load_dataset(irds/disks45_nocr_trec-robust-2004, qrels) for record in qrels: record # {query_id: ..., doc_id: ..., relevance: ..., iteration: ...}
引用信息
@misc{Voorhees1996Disks45, title = {NIST TREC Disks 4 and 5: Retrieval Test Collections Document Set}, author = {Ellen M. Voorhees}, doi = {10.18434/t47g6m}, year = {1996}, publisher = {National Institute of Standards and Technology} } @inproceedings{Voorhees2004Robust, title={Overview of the TREC 2004 Robust Retrieval Track}, author={Ellen Voorhees}, booktitle={TREC}, year={2004} } @inproceedings{Huston2014ACO, title={A Comparison of Retrieval Models using Term Dependencies}, author={Samuel Huston and W. Bruce Croft}, booktitle={CIKM}, year={2014} }




