five

ir_metadata: An Extensible Metadata Schema for Information Retrieval Experiments

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5997490
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset accompanies our work that introduces a metadata schema for TREC run files based on the PRIMAD model. PRIMAD considers essential components of computational experiments that possibly can affect reproducibility on a conceptual level. We propose to align the metadata annotations to the PRIMAD components. In order to demonstrate the potential of metadata annotations, we curated a dataset with run files derived from experiments with different instantiations of PRIMAD components and annotated these with the corresponding metadata. With this work, we hope to stimulate IR researchers to annotate run files and improve the reuse value of experimental artifacts even further.   This archive contains the following data: demo.tar.xz : Selected annotated runs files that are used in the Colab demonstration. metadata.zip : YAML files containing only the metadata annotations for each run. runs.zip : The entire set of run files with annotations.   The annotated runs result from the following experiments: Grossman and Cormack @ TREC Common Core 2017 Paper | Source Grossman and Cormack @ TREC Common Core 2018 Paper | Source Yu et al. @ TREC Common Core 2018 Paper | Source Yu et al. @ ECIR 2019 Paper | Source Breuer et al. @ SIGIR 2020 Paper | Source Breuer et al. @ CLEF 2021 Paper | Source
创建时间:
2022-02-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作