mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science_eval_8179
收藏Hugging Face2025-07-01 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science_eval_8179
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science_eval_8179的预计算模型输出,用于评估模型在多个数学和编程任务上的性能。数据集包含了在不同基准测试中的准确率结果,如AIME24、AMC23、MATH500等,以及每次运行中解决的问题数量和总问题数量。
This dataset is the precomputed model outputs for the mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science_eval_8179, used for evaluating the models performance on various mathematical and programming tasks. The dataset includes accuracy results on different benchmarks such as AIME24, AMC23, MATH500, etc., as well as the number of problems solved and the total number of questions in each run.
提供机构:
mlfoundations-dev



