BRIDGE
收藏BRIDGE 数据集概述
数据集背景
- 全称:Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
- 目的:评估大型语言模型(LLMs)在医疗领域的可靠性和临床有效性
- 特点:
- 包含87个真实世界临床文本任务
- 涵盖9种语言
- 超过一百万样本
- 评估52个最先进的LLMs(截至2025/04/29)
关键特征
- 临床文本:真实世界临床文本
- 语言:9种
- 任务类型:9种
- 临床专业:14种
- 临床文档类型:7种
- 临床应用:20种,覆盖患者护理的6个临床阶段
数据集获取与使用
- 开放访问数据集:可通过BRIDGE-Open下载
- 受限访问数据集:由于隐私和安全考虑,无法直接发布,但任务描述和数据来源详见BRIDGE论文
使用方法
- 下载数据集:从BRIDGE-Open获取开放访问数据
- LLM推理:
- 将数据放入
dataset_raw文件夹 - 编辑
BRIDGE.yaml和run.sh文件 - 运行
run.sh进行推理
- 将数据放入
- 评估:
- 结果保存在
result文件夹 - 使用
evaluate_BRIDGE.py进行评估 - 性能结果保存在
performance文件夹
- 结果保存在
更新与贡献
- 最新更新:
- 2025/04/28:BRIDGE Leaderboard V1.0.0上线
- 2025/04/28:BRIDGE论文在arXiv发布
- 贡献:欢迎社区贡献临床文本数据集
引用
bibtex @article{BRIDGE-benchmark, title={BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text}, author={Wu, Jiageng and Gu, Bowen and Zhou, Ren and Xie, Kevin and Snyder, Doug and Jiang, Yixing and Carducci, Valentina and Wyss, Richard and Desai, Rishi J and Alsentzer, Emily and Celi, Leo Anthony and Rodman, Adam and Schneeweiss, Sebastian and Chen, Jonathan H. and Romero-Brufau, Santiago and Lin, Kueiyu Joshua and Yang, Jie}, year={2025}, journal={arXiv preprint arXiv: 2504.19467}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.19467}, } @article{clinical-text-review, title={Clinical text datasets for medical artificial intelligence and large language models—a systematic review}, author={Wu, Jiageng and Liu, Xiaocong and Li, Minghui and Li, Wanxin and Su, Zichang and Lin, Shixu and Garay, Lucas and Zhang, Zhiyun and Zhang, Yujie and Zeng, Qingcheng and Shen, Jie and Yuan, Changzheng and Yang, Jie}, journal={NEJM AI}, volume={1}, number={6}, pages={AIra2400012}, year={2024}, publisher={Massachusetts Medical Society} }
联系方式
- Leaderboard Managers:Jiageng Wu (jiwu7@bwh.harvard.edu), Kevin Xie (kevinxie@mit.edu), Bowen Gu (bogu@bwh.harvard.edu)
- Benchmark Managers:Jiageng Wu, Bowen Gu
- Project Lead:Jie Yang (jyang66@bwh.harvard.edu)




