bigbio/symptemist
收藏Hugging Face2024-07-18 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/bigbio/symptemist
下载链接
链接失效反馈官方服务:
资源简介:
SympTEMIST语料库是一个包含1000份西班牙语临床病例报告的集合,这些报告标注了症状、体征和发现的提及,并标准化为SNOMED CT。
The SympTEMIST corpus is a collection of 1,000 clinical case reports in Spanish annotated with symptoms, signs and findings mentions and normalized to SNOMED CT. This dataset supports tasks including Named Entity Recognition (NER) and Named Entity Disambiguation (NED).
提供机构:
bigbio
原始信息汇总
数据集概述
基本信息
- 名称: SympTEMIST
- 语言: 西班牙语
- 许可证: CC BY 4.0
- 多语言性: 单语种
- 主页: https://temu.bsc.es/symptemist/
- 公开性: 公开
- PubMed: 否
数据集描述
- 内容: 包含1,000份西班牙语临床病例报告,标注了症状、体征和发现,并归一化为SNOMED CT。
- 任务:
- 命名实体识别 (NER)
- 命名实体消歧 (NED)
引用信息
@inproceedings{lima2023overview, title={Overview of SympTEMIST at BioCreative VIII: corpus, guidelines and evaluation of systems for the detection and normalization of symptoms, signs and findings from text}, author={Lima-L{o}pez, Salvador and Farr{e}-Maduell, Eul{`a}lia and Gasco-S{a}nchez, Luis and Rodr{\i}guez-Miret, Jan and Krallinger, Martin}, booktitle={Proceedings of the BioCreative VIII Challenge and Workshop: Curation and Evaluation in the era of Generative Models}, year={2023} }



