Custom German Medical NER Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/frankkramer-lab/GERNERMED
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个自定义的数据集,包含了8599个句子对,这些句子对被标注了30233个注释,涵盖了9种不同的类别标签。这些标签源自原始数据集,并已被翻译成德语。此外,该数据集是通过将英文句子翻译成德语创建的,并采用了多种匿名化技术以确保数据隐私。在规模上,数据集包含了8599个句子对和30233个注释,其任务是命名实体识别。
This is a custom dataset that contains 8599 sentence pairs annotated with a total of 30233 annotations, covering 9 distinct category labels. These labels are derived from the original dataset and have been translated into German. Moreover, this dataset was developed by translating English sentences into German, and a range of anonymization techniques were employed to safeguard data privacy. With 8599 sentence pairs and 30233 annotations in total, the dataset is designed for the task of named entity recognition (NER).
提供机构:
Frank Kramer Lab



