five

KISTI-KONI/ScholarBench

收藏
Hugging Face2025-06-30 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/KISTI-KONI/ScholarBench
下载链接
链接失效反馈
官方服务:
资源简介:
ScholarBench是一个双语的学术推理能力评估基准,包含韩语和英语两种语言。它涵盖八个研究领域,包括商业研究、化学生物科学、工程、物理与数学、地球与生命科学、医学科学、社会专业研究和自由艺术与社会科学。数据集包含五种问题类型:总结、简答题、选择题、多选题和判断题。该数据集旨在评估大型语言模型在特定学术领域内的抽象、理解和逻辑推理能力。

ScholarBench is a bilingual benchmark for evaluating the academic reasoning capabilities of large language models in domain-specific contexts, including both Korean and English languages. It covers eight research domains: Business Studies, Chemical Biosciences, Engineering, Physics & Mathematics, Earth & Life Sciences, Medical Science, Socio-Professional Studies, and Liberal Arts & Social Sciences. The dataset includes five types of tasks: summarization, short answer, multiple choice, multiple selection, and true/false. It is designed to assess the abilities of LLMs in abstraction, comprehension, and logical inference within specific academic fields.
提供机构:
KISTI-KONI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作