five

Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization

收藏
DataCite Commons2024-12-14 更新2025-04-16 收录
下载链接:
https://physionet.org/content/task-1-3-soap-note-tag/1.0.0/
下载链接
链接失效反馈
官方服务:
资源简介:
Applying methods in natural language processing on electronic health records (EHR) data is a growing field. Existing corpus and annotation focus on modelling textual features and relation prediction [1] . However, there is a paucity of annotated corpus built to model clinical diagnostic reasoning, a process that involves text understanding, domain knowledge abstraction and reasoning, and clinical text generation. The datasets here support a hierarchical annotation schema with two out of the three stages available to address clinical text understanding and text generation. The datasets provided here are for individual tasks in Stages 1 and 3. The task for Stage 2 was previously accepted as part of the National NLP Clinical Challenges (n2c2) and may be retrieved from the n2c2 challenge website. The annotated corpus is based on an extensive collection of Intensive Care Unit progress notes, a type of EHR documentation that is collected in time series in a problem-oriented format. The progress notes were sourced from MIMIC-III. The conventional format for a progress note follows a Subjective, Objective, Assessment and Plan heading (SOAP). The novel suite of tasks was designed to train and evaluate future NLP models for clinical text understanding, clinical knowledge representation, inference, and summarization. The ultimate goal of these datasets is to advance the development and evaluation of NLP models for clinical applications that lead to AI-assisted clinical decision support and reduce medical errors.
提供机构:
PhysioNet
创建时间:
2022-09-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作