Experimental Data for "What Makes a Top-Performing Precision Medicine Search Engine? Tracing Main System Features in a Systematic Way" at SIGIR2020
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3854457
下载链接
链接失效反馈官方服务:
资源简介:
This deposit contains data used for the experiments reported in the paper "What Makes a Top-Performing Precision Medicine Search Engine? Tracing Main System Features in a Systematic Way", most notably the ElasticSearch 5.4 indices used for the reported experiments.
To load the indices into an ElasticSearch cluster of your own, use the restore function described in the ElasticSearch documentation.
The names of the index snapshots contained here are
ct1718 for the indexed ClinicalTrials data used in the TREC-PM challenges in 2017 and 2018.
ct19 for the indexed ClinicalTrials data used in the TREC-PM challenge in 2019.
ba1718 for the indexed PubMed data used in the TREC-PM challenges in 2017 and 2018.
ba19 for the indexed PubMed data used in the TREC-PM challenge in 2019.
The other file contains the original output that SMAC wrote to disc during the parameter optimization process. There are directories for the biomedical abstracts (BA) and clinical trials (ct) and for each respective 10 fold cross validation split. Those file contain the exact parameter configurations and their evalation score (the infNDCG metric was used) in live-runXX.json files.
The code to these files is located in this Zenodo deposit.
创建时间:
2020-06-12



