MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters
收藏DataCite Commons2025-05-05 更新2025-05-18 收录
下载链接:
https://physionet.org/content/medisumqa/
下载链接
链接失效反馈官方服务:
资源简介:
While increasing patients' access to medical documents improves medical care,
this benefit is limited by varying health literacy levels and complex medical
terminology. Large language models (LLMs) offer solutions by simplifying
medical information. However, evaluating LLMs for safe and patient-friendly
text generation is difficult due to the lack of standardized evaluation
resources. To fill this gap, we developed MeDiSumQA. MeDiSumQA is a dataset
created from MIMIC-IV discharge summaries through an automated pipeline
combining LLM-based question-answer generation with manual quality checks. We
use this dataset to evaluate various LLMs on patient-oriented question-
answering. Our findings reveal that general-purpose LLMs frequently surpass
biomedical-adapted models, while automated metrics correlate with human
judgment. By releasing MeDiSumQA on PhysioNet, we aim to advance the
development of LLMs to enhance patient understanding and ultimately improve
care outcomes.
提供机构:
PhysioNet
创建时间:
2025-04-22



