MMLU-Pro Benchmark Questions
收藏Figshare2025-04-08 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/MMLU-Pro_Benchmark_Questions/28751756/1
下载链接
链接失效反馈官方服务:
资源简介:
We investigated DeepSeek R1's ability to diagnose 162 medical scenarios that are part of MMLU-Pro question and answer dataset
提供机构:
Haider, Maruf; Bajwa, Maria; Hoyt, Robert; Knight, Dacre
创建时间:
2025-04-08



