mlfoundations-dev/SCP_40k_R1_with_OT_unverified_eval_03-19-25_01-03-32_0981
收藏Hugging Face2025-03-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/SCP_40k_R1_with_OT_unverified_eval_03-19-25_01-03-32_0981
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用于评估的预计算模型输出。具体包括在不同数学测试集上的准确度表现,如AIME24、AIME25、AMC23等。
This dataset contains precomputed model outputs for evaluation purposes, including performance metrics on various mathematics test sets such as AIME24, AIME25, AMC23, etc.
提供机构:
mlfoundations-dev



