Evaluation results of the xMEN entity linking toolkit for multiple benchmark datasets
收藏DataONE2024-12-21 更新2025-04-26 收录
下载链接:
https://search.dataone.org/view/sha256:bcef93e6b308bf7fbfbe043688429b847f1aa5e9b622d4dce87e52c0eef381a8
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains the benchmark results of the xMEN toolkit for cross-lingual medical entity linking on the following, publicly available benchmark datasets:
Mantra Gold Standard Corpus (multilingual)
Quaero (French)
BRONCO150 (German)
DisTEMIST (Spanish)
MedMentions (English + machine-translated multilingual versions)
For each dataset, we evaluate the default xMEN pipeline with different steps of candidate generation and weakly-supervised and fully-supervised re-ranking on the test sets or 5-fold-cross-validation (for BRONCO150).Users of xMEN can use these data to compare their own results to the current state-of-the-art performance on these benchmarks, when loaded through the BigBIO library., Evaluation of xMEN on datasets loaded from BigBIO dataloaders., , # xMEN Benchmark Results
[https://doi.org/10.5061/dryad.15dv41p6h](https://doi.org/10.5061/dryad.15dv41p6h)
## Description of the data and file structure
Evaluation of xMEN candidate generation + re-ranking (weakly and fully supervised) on various benchmark datasets.
### Files and variables
Each file refers to a subset of a particular benchmark dataset.
For each subset, we run candidate generation + weakly-supervised ([filename]_ws.csv) or fully-supervised ([filename]_fs.csv)Â
| Benchmark | Subset | file\_name |
| :---------- | :---------- | :------------------ |
| Mantra | German | mantra\_de |
| | English | mantra\_en |
| | Spanish | mantra\_es |
| | French | mantra\_fr |
| | Dutch | mantra\_nl |
| Quaero | - | quaero |
| BRONCO | Diagnoses | bronco\_diagnoses |
| | Medications | bronco\_medica...
创建时间:
2024-12-22



