five

Medical Case Report Corpus

收藏
arXiv2020-03-29 更新2024-06-21 收录
下载链接:
https://github.com/adahealth/medical_case_report_corpus
下载链接
链接失效反馈
官方服务:
资源简介:
Medical Case Report Corpus是由德国人工智能研究中心和Ada Health GmbH共同创建的一个包含53篇医学案例报告的标注数据集。该数据集专注于标注医学实体,如病例、条件、发现、因素和否定修饰符,并标注了这些实体之间的关系。数据集的创建旨在支持自然语言处理技术在医学文本自动信息提取方面的应用,特别是通过命名实体识别、关系提取和相关性检测等任务。该数据集为科学社区提供了首个此类英文资源,对于推动医学领域的NLP研究具有重要价值。

Medical Case Report Corpus is an annotated dataset consisting of 53 medical case reports, jointly created by the German Research Center for Artificial Intelligence and Ada Health GmbH. This dataset focuses on annotating medical entities including cases, conditions, findings, factors, and negation modifiers, as well as the relational connections between these entities. The dataset was developed to support the application of natural language processing technologies in automated information extraction from medical texts, particularly via tasks such as Named Entity Recognition, Relation Extraction, and Relevance Detection. As the first English-language resource of its kind for the scientific community, this dataset holds significant value for advancing natural language processing research in the medical field.
提供机构:
德国人工智能研究中心
创建时间:
2020-03-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作