Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted
收藏Hugging Face2025-03-29 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted
下载链接
链接失效反馈官方服务:
资源简介:
在模型Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted评估过程中自动创建的数据集,包含三个配置,每个配置对应一个评估任务。数据集由九次运行结果组成,每次运行结果作为一个特定分割存储,并以时间戳命名。train 分割始终指向最新结果。另外,results 配置存储了所有运行的汇总结果。
Dataset automatically created during the evaluation of model Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted, consisting of three configurations, each corresponding to one evaluation task. The dataset is composed of results from nine runs, with each runs results stored as a specific split named with a timestamp. The train split always points to the latest results. Additionally, the results configuration stores aggregated results from all runs.
提供机构:
Lansechen



