COMETA
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/cambridgeltl/cometa
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是带有SNOMED注释的最大公共社交媒体数据集之一,包含了四种不同的训练、开发、测试划分方式:分层一般(SG)、分层特定(SS)、零样本一般(ZG)和零样本特定(ZS)。此外,它还包含了用于训练、开发和测试的多重划分。该数据集的任务是零样本医学实体检索。
This dataset is one of the largest public social media datasets annotated with SNOMED. It encompasses four distinct training, development, and test split schemes: stratified general (SG), stratified specific (SS), zero-shot general (ZG), and zero-shot specific (ZS). Additionally, it includes multiple splits designed for training, development, and testing. The core task supported by this dataset is zero-shot medical entity retrieval.



