mlfoundations-dev/openr1_codeforces_eval_2870
收藏Hugging Face2025-06-30 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/openr1_codeforces_eval_2870
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含预计算模型输出的数据集,用于评估。具体是针对AIME24的评估,提供了10次独立运行的准确率以及解决的问题数量和总问题数量。
This dataset includes precomputed model outputs for evaluation purposes, specifically for the AIME24 evaluation, providing accuracy rates for 10 independent runs along with the number of questions solved and the total number of questions.
提供机构:
mlfoundations-dev



