PrismaX/SFE
收藏Hugging Face2025-08-11 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/PrismaX/SFE
下载链接
链接失效反馈官方服务:
资源简介:
科学家首次考试(SFE)基准测试是为了全面评估MLLMs的科学认知能力而设计的,它通过三个认知层次(科学信号感知、科学属性理解和科学比较推理)进行评估。该数据集包含66个经过专家审核的高价值多模态任务,这些任务跨越五个学科:天文学、化学、地球科学、生命科学和材料科学。每个任务都是从原生科学原始数据格式构建的,并设计为视觉问答(VQA)对,旨在探测特定的科学认知层次。所有任务都是中英双语的,以支持广泛的可访问性。
Scientists First Exam (SFE) benchmark is designed to comprehensively evaluate the scientific cognitive abilities of MLLMs through three cognitive levels: scientific signal perception, scientific attribute understanding, and scientific comparative reasoning. The dataset includes 66 expert-curated, high-value multimodal tasks across five disciplines: Astronomy, Chemistry, Earth, Life, and Materials Sciences. Each task is constructed from native scientific raw data formats and formulated as visual question answering (VQA) pairs, designed to probe specific levels of scientific cognition. All tasks are bilingual (English & Chinese) to support broad accessibility.
提供机构:
PrismaX



