i2b2 De-identification Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://www.i2b2.org/NLP/DataSets/Main.php
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一项医学命名实体识别(NER)任务,旨在从去识别化的临床笔记中提取关于问题、测试和治疗的内容提及。它专注于提取临床概念,这对于医疗保健自然语言处理任务至关重要。该数据集属于标准的临床数据集,其任务是命名实体识别。
This dataset is a medical named entity recognition (NER) task that aims to extract mentions of problems, tests, and treatments from de-identified clinical notes. It focuses on extracting clinical concepts, which is critical for healthcare natural language processing (NLP) tasks. This is a standard clinical dataset whose designated task is named entity recognition.
提供机构:
i2b2
搜集汇总
数据集介绍

背景与挑战
背景概述
i2b2 De-identification Dataset现已转移到n2c2项目,包含去标识化的患者出院摘要,需通过DBMI Data Portal注册并提交DUA后访问。
以上内容由遇见数据集搜集并总结生成



