mlfoundations-dev/OpenThinker-7B_eval_03-07-25_16-29_2870
收藏Hugging Face2025-03-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/OpenThinker-7B_eval_03-07-25_16-29_2870
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含预计算模型输出的数据集,用于评估名为OpenThinker-7B的模型在AIME24任务上的表现。数据集提供了5次运行的准确度结果,平均准确度为14.67%。
This dataset contains precomputed model outputs for evaluating the performance of a model named OpenThinker-7B on the AIME24 task. The dataset provides accuracy results for 5 runs, with an average accuracy of 14.67%.
提供机构:
mlfoundations-dev



