Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat
收藏Hugging Face2025-04-03 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat
下载链接
链接失效反馈官方服务:
资源简介:
在模型[Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat](https://huggingface.co/Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat)的评估运行期间自动创建的数据集。该数据集包含三个配置,每个配置对应于一个评估任务。数据集由9次运行的结果组成,每次运行都以其时间戳命名的分割形式存在于每个配置中。train分割始终指向最新结果。还有一个名为results的额外配置,用于存储所有运行的聚合结果。
Dataset automatically created during the evaluation run of model [Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat](https://huggingface.co/Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat). The dataset consists of three configurations, each corresponding to one of the evaluation tasks. The dataset is composed of results from 9 runs, with each run found as a specific split within each configuration, named using the timestamp of the run. The train split always points to the latest results. An additional configuration results stores all aggregated results of the run.
提供机构:
Lansechen



