Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync
下载链接
链接失效反馈官方服务:
资源简介:
在评估模型Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync时自动创建的数据集。包含三种配置,每种配置对应一个评估任务。数据集由12次运行构成,每次运行在各个配置中均有对应的分割,分割名以运行时间戳命名。train分割指向最新结果。另外有一个results配置,用于存储所有运行的汇总结果。
Dataset automatically created during the evaluation of model Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync. It consists of 3 configurations, each corresponding to one of the evaluated tasks. The dataset is composed of 12 runs, with each run having a specific split in each configuration, named using the runs timestamp. The train split always points to the latest results. Additionally, there is a results configuration that stores the aggregated results of all runs.
提供机构:
Lansechen



