ir_metadata: An Extensible Metadata Schema for Information Retrieval Experiments
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5997490
下载链接
链接失效反馈官方服务:
资源简介:
This dataset accompanies our work that introduces a metadata schema for TREC run files based on the PRIMAD model. PRIMAD considers essential components of computational experiments that possibly can affect reproducibility on a conceptual level. We propose to align the metadata annotations to the PRIMAD components. In order to demonstrate the potential of metadata annotations, we curated a dataset with run files derived from experiments with different instantiations of PRIMAD components and annotated these with the corresponding metadata. With this work, we hope to stimulate IR researchers to annotate run files and improve the reuse value of experimental artifacts even further.
This archive contains the following data:
demo.tar.xz : Selected annotated runs files that are used in the Colab demonstration.
metadata.zip : YAML files containing only the metadata annotations for each run.
runs.zip : The entire set of run files with annotations.
The annotated runs result from the following experiments:
Grossman and Cormack @ TREC Common Core 2017 Paper | Source
Grossman and Cormack @ TREC Common Core 2018 Paper | Source
Yu et al. @ TREC Common Core 2018 Paper | Source
Yu et al. @ ECIR 2019 Paper | Source
Breuer et al. @ SIGIR 2020 Paper | Source
Breuer et al. @ CLEF 2021 Paper | Source
创建时间:
2022-02-21



