MIMIC-IV-Ext-Instr: A Dataset of 450K+ EHR-Grounded Instruction-Following Examples
收藏DataCite Commons2025-09-09 更新2026-05-04 收录
下载链接:
https://physionet.org/content/mimic-iv-ext-instr/1.0.0/
下载链接
链接失效反馈官方服务:
资源简介:
Large language models (LLMs) have shown impressive capabilities in solving a
wide range of tasks based on human instructions. However, developing a
conversational AI assistant for electronic health record (EHR) data remains
challenging due to the lack of large-scale instruction-following datasets. To
address this, we present **MIMIC-IV-Ext-Instr** , a dataset containing over
450K open-ended, instruction-following examples generated using GPT-3.5 on a
HIPAA-compliant platform. Derived from the MIMIC-IV EHR database, **MIMIC-IV-
Ext-Instr** spans a wide range of topics and is specifically designed to
support instruction-tuning of general-purpose LLMs for diverse clinical
applications.
提供机构:
PhysioNet
创建时间:
2025-04-18



