Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync

Name: Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync
Creator: Lansechen
Published: 2025-04-02 06:23:35
License: 暂无描述

Hugging Face2025-04-02 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync

下载链接

链接失效反馈

官方服务：

资源简介：

在评估模型Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync时自动创建的数据集。包含三种配置，每种配置对应一个评估任务。数据集由12次运行构成，每次运行在各个配置中均有对应的分割，分割名以运行时间戳命名。train分割指向最新结果。另外有一个results配置，用于存储所有运行的汇总结果。

Dataset automatically created during the evaluation of model Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync. It consists of 3 configurations, each corresponding to one of the evaluated tasks. The dataset is composed of 12 runs, with each run having a specific split in each configuration, named using the runs timestamp. The train split always points to the latest results. Additionally, there is a results configuration that stores the aggregated results of all runs.

提供机构：

Lansechen

5,000+

优质数据集

54 个

任务类型

进入经典数据集