glm-simple-evals-dataset
收藏魔搭社区2026-01-06 更新2025-08-09 收录
下载链接:
https://modelscope.cn/datasets/ZhipuAI/glm-simple-evals-dataset
下载链接
链接失效反馈官方服务:
资源简介:
# glm-simple-evals-dataset
This repository is dedicated to storing various evaluation data required for the [glm-simple-evals](https://github.com/zai-org/glm-simple-evals/tree/main) evaluation project, to enable industry researchers and developers to reproduce the performance of the GLM-4.5 series models on reported benchmarks.
Currently, this repository covers the data required for the following evaluation tasks:
- AIME
- GPQA
- HLE
- LiveCodeBench
- MATH 500
- SciCode
- MMLU Pro
## Usage Instructions
To use these evaluation datasets, please refer to the detailed guidelines in the [glm-simple-evals](https://github.com/zai-org/glm-simple-evals/tree/main) project.
# glm-simple-evals 数据集
本仓库用于存储[glm-simple-evals](https://github.com/zai-org/glm-simple-evals/tree/main)评估项目所需的各类评估数据,助力工业界研究者与开发者复现GLM-4.5系列模型在公开基准测试中的性能表现。
目前本仓库涵盖以下评估任务所需的数据:
- AIME
- GPQA
- HLE
- LiveCodeBench
- MATH 500
- SciCode
- MMLU Pro
## 使用说明
若需使用本仓库中的评估数据集,请参阅[glm-simple-evals](https://github.com/zai-org/glm-simple-evals/tree/main)项目中的详细操作指南。
提供机构:
maas
创建时间:
2025-08-06



