openthoughts_math_30k_eval_08c7
收藏魔搭社区2025-10-14 更新2025-10-11 收录
下载链接:
https://modelscope.cn/datasets/mlfoundations-dev/openthoughts_math_30k_eval_08c7
下载链接
链接失效反馈官方服务:
资源简介:
# mlfoundations-dev/openthoughts_math_30k_eval_08c7
Precomputed model outputs for evaluation.
## Evaluation Results
### HMMT
- **Average Accuracy**: 15.33% ± 1.96%
- **Number of Runs**: 10
| Run | Accuracy | Questions Solved | Total Questions |
|-----|----------|-----------------|----------------|
| 1 | 26.67% | 8 | 30 |
| 2 | 6.67% | 2 | 30 |
| 3 | 13.33% | 4 | 30 |
| 4 | 20.00% | 6 | 30 |
| 5 | 10.00% | 3 | 30 |
| 6 | 10.00% | 3 | 30 |
| 7 | 10.00% | 3 | 30 |
| 8 | 16.67% | 5 | 30 |
| 9 | 16.67% | 5 | 30 |
| 10 | 23.33% | 7 | 30 |
# mlfoundations-dev/openthoughts_math_30k_eval_08c7
用于评估的预计算模型输出。
## 评估结果
### HMMT
- **平均准确率**:15.33% ± 1.96%
- **运行次数**:10
| 运行序号 | 准确率 | 答对题目数 | 总题目数 |
|-----|----------|-----------------|----------------|
| 1 | 26.67% | 8 | 30 |
| 2 | 6.67% | 2 | 30 |
| 3 | 13.33% | 4 | 30 |
| 4 | 20.00% | 6 | 30 |
| 5 | 10.00% | 3 | 30 |
| 6 | 10.00% | 3 | 30 |
| 7 | 10.00% | 3 | 30 |
| 8 | 16.67% | 5 | 30 |
| 9 | 16.67% | 5 | 30 |
| 10 | 23.33% | 7 | 30 |
提供机构:
maas
创建时间:
2025-10-03



