five

FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8355628
下载链接
链接失效反馈
官方服务:
资源简介:
The French Annotated Resource with Semantic Information for Medical Entities Detection (FRASIMED) contains 2'051 synthetic clinical cases in French, with 24'037 annotated entities. The dataset contains two subsets: CANTEMIST-FR: Originally from CANTEMIST (Miranda-Escalada et al. (2020)), it contains 1'301 oncological notes, with 15'978 annotations linked to an ICD-O-3.1 morphology code. Additionally, 15’457 of them are linked to a SNOMED-CT code. DISTEMIST-FR: Originally from DISTEMIST's training set (Miranda-Escalada et al. (2022)), it contains 750 clinical cases, with 8'059 annotations, with 5'132 of them linked to a SNOMED-CT code. Please, cite us: Zaghir, J., Bjelogrlic, M., Goldman, J.-P., Aananou, S., Gaudet-Blavignac, & Lovis, C. (2023). FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection. arXiv preprint http://arxiv.org/abs/2309.10770
创建时间:
2023-09-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作