irds/nyt_trec-core-2017
收藏数据集卡片 nyt/trec-core-2017
数据集概述
nyt/trec-core-2017 数据集由 ir-datasets 包提供。
数据内容
该数据集包含以下内容:
- 查询(queries):即主题,数量为50个。
- 查询相关性评估(qrels):相关性评估,数量为30,030个。
对于文档(docs),请使用 irds/nyt。
使用方法
以下是加载和使用该数据集的示例代码:
python from datasets import load_dataset
queries = load_dataset(irds/nyt_trec-core-2017, queries) for record in queries: record # {query_id: ..., title: ..., description: ..., narrative: ...}
qrels = load_dataset(irds/nyt_trec-core-2017, qrels) for record in qrels: record # {query_id: ..., doc_id: ..., relevance: ..., iteration: ...}
引用信息
@inproceedings{Allan2017TrecCore, author = {James Allan and Donna Harman and Evangelos Kanoulas and Dan Li and Christophe Van Gysel and Ellen Vorhees}, title = {TREC 2017 Common Core Track Overview}, booktitle = {TREC}, year = {2017} } @article{Sandhaus2008Nyt, title={The new york times annotated corpus}, author={Sandhaus, Evan}, journal={Linguistic Data Consortium, Philadelphia}, volume={6}, number={12}, pages={e26752}, year={2008} }



