mlfoundations-dev/Llama-3.1-Nemotron-Nano-8B-v1_1743011943_eval_0981
收藏Hugging Face2025-03-26 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/Llama-3.1-Nemotron-Nano-8B-v1_1743011943_eval_0981
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是针对Llama-3.1-Nemotron-Nano-8B-v1模型在不同数学和编程任务上的表现进行评估的预计算结果集合,包括AIME24、AIME25、AMC23、MATH500、GPQADiamond和LiveCodeBench等多个任务。每个任务都有多次运行的准确度和解题情况记录。
This dataset consists of precomputed model outputs for evaluating the performance of the Llama-3.1-Nemotron-Nano-8B-v1 model on various mathematical and programming tasks, including AIME24, AIME25, AMC23, MATH500, GPQADiamond, and LiveCodeBench. Each task has records of accuracy and problem-solving from multiple runs.
提供机构:
mlfoundations-dev



