Evaluation results of the xMEN entity linking toolkit for multiple benchmark datasets

DataONE2024-12-21 更新2025-04-26 收录

下载链接：

https://search.dataone.org/view/sha256:bcef93e6b308bf7fbfbe043688429b847f1aa5e9b622d4dce87e52c0eef381a8

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains the benchmark results of the xMEN toolkit for cross-lingual medical entity linking on the following, publicly available benchmark datasets: Mantra Gold Standard Corpus (multilingual) Quaero (French) BRONCO150 (German) DisTEMIST (Spanish) MedMentions (English + machine-translated multilingual versions) For each dataset, we evaluate the default xMEN pipeline with different steps of candidate generation and weakly-supervised and fully-supervised re-ranking on the test sets or 5-fold-cross-validation (for BRONCO150).Users of xMEN can use these data to compare their own results to the current state-of-the-art performance on these benchmarks, when loaded through the BigBIO library., Evaluation of xMEN on datasets loaded from BigBIO dataloaders., , # xMEN Benchmark Results [https://doi.org/10.5061/dryad.15dv41p6h](https://doi.org/10.5061/dryad.15dv41p6h) ## Description of the data and file structure Evaluation of xMEN candidate generation + re-ranking (weakly and fully supervised) on various benchmark datasets. ### Files and variables Each file refers to a subset of a particular benchmark dataset. For each subset, we run candidate generation + weakly-supervised ([filename]_ws.csv) or fully-supervised ([filename]_fs.csv)Â | Benchmark | Subset | file\_name | | :---------- | :---------- | :------------------ | | Mantra | German | mantra\_de | | | English | mantra\_en | | | Spanish | mantra\_es | | | French | mantra\_fr | | | Dutch | mantra\_nl | | Quaero | - | quaero | | BRONCO | Diagnoses | bronco\_diagnoses | | | Medications | bronco\_medica...

创建时间：

2024-12-22

5,000+

优质数据集

54 个

任务类型

进入经典数据集