BlueZeros/EHR-Bench
收藏Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/BlueZeros/EHR-Bench
下载链接
链接失效反馈官方服务:
资源简介:
EHR-Bench是一个用于严格评估大型语言模型在电子健康记录分析任务上的全面基准数据集。它源自MIMIC-IV数据集,包含42个任务,分为决策任务和风险预测任务。决策任务包括诊断、治疗和建议服务,而风险预测任务则是对特定时间内发生的重要医疗事件进行预测。
EHR-Bench is a comprehensive benchmark introduced to rigorously evaluate Large Language Models (LLMs) on Electronic Health Record (EHR) analysis tasks. It is derived from the MIMIC-IV dataset and consists of 42 tasks, divided into Decision-Making Tasks and Risk-Prediction Tasks. Decision-Making Tasks involve diagnosis, treatment, and service recommendation, while Risk-Prediction Tasks involve forecasting the occurrence of significant medical events within a specified horizon.
提供机构:
BlueZeros



