SemClinBr
收藏arXiv2020-01-28 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2001.10071v1
下载链接
链接失效反馈官方服务:
资源简介:
SemClinBr是一个多机构、多专业的葡萄牙语临床NLP任务语义标注语料库。该数据集包含1000份临床笔记,标注了65,117个实体和11,263个关系,支持多种临床NLP任务,旨在推动葡萄牙语电子健康记录的二次使用。数据集创建过程中,采用了精细的标注方案和基于网络的标注工具,确保了标注的一致性和效率。该数据集的应用领域包括临床信息提取、医学概念识别和医疗决策支持系统等。
SemClinBr is a multi-institutional, multi-disciplinary semantic annotation corpus for Portuguese clinical natural language processing (NLP) tasks. It contains 1,000 clinical notes, with 65,117 annotated entities and 11,263 annotated relations, supporting a variety of clinical NLP tasks. The corpus aims to promote the secondary utilization of Portuguese electronic health records (EHRs). During its development, a rigorous annotation scheme and web-based annotation tool were adopted to ensure annotation consistency and efficiency. Its application fields include clinical information extraction, medical concept recognition, medical decision support systems and other related areas.
提供机构:
巴西帕拉纳天主教大学健康技术项目
创建时间:
2020-01-28



